Thank you for the report. The reason is that older versions of WikidPad
only supported the current system codepage (often 1252). When updating
wikis the pages were not automatically converted to utf-8.
Instead they are only converted when they are modified, but the
international characters page was obviously never modified since these
ancient times.
In the next version this page will be encoded properly in UTF-8.
Further inconsistencies of encoding of other pages (there are much more
of them as Christian Ziemski told me) will be fixed in a later version
as I first have to write a little tool for that.
> I also noticed that Wikipad doesn't read UTF-16 LE, although this is a
> format that some Windows programs may default to (eg MS Word). Are
> there any plans to support this?
Next 2.0 version will be able to read UTF-16 LE and BE files with
appropriate BOMs.
Michael
> In the next version this page will be encoded properly in UTF-8.
> [...] as I first have to write a little tool for that.
You may want to use the Linux tools:
'iconv' (old) or 'recode' (newer)
Regarding the BOM in many of those files:
According to
- http://unicode.org/faq/utf_bom.html#bom5
- http://en.wikipedia.org/wiki/Byte_Order_Mark
and other sources it seems to be a good idea to
*not* using a BOM in UTF8 files.
(In the help files and in the Wiki pages generally.)
Christian
Unfortunately there may be some occasions where ANSI-encoded data (or
UTF-16 BE/LE encoded one) from other sources may go into the wiki. I
assume that some people may modify wiki pages outside of WikidPad or add
new ones. So I can't be sure about the encoding of a file without a BOM.
Michael
PO files must always be in UTF-8 even if no BOM is detected because they
are meant to be installed on many other systems with different default
encodings.
The .wiki files are more probable to stay on the same system so assuming
the default encoding for the BOM-less ones is a good guess.
Michael