I am currently working on a project that makes extensive use of the Private Use Area of Unicode to record different letter forms found in medieval texts.
I have tried using opening *.txt files (encoded in UTF-8) derived from this project in both Antconc 3.5.9 and 4.3.1 and changing the font to Junicode, the font we are using.
In Antconc 4.3.1, it is possible to get the text displaying properly:

In Antconc 3.5.9, many of the PUA codepoints decompose into multiple katakana characters:

Eventually, the corpus project I am involved with will probably need its own front end, but in the short to medium term being able to use Antconc to do corpus analysis of the data would be a significant boon, so if there is a way to resolve the issue in either 3.5.9 or 4.3.1, I would be very happy to hear it.
Thanks,
Mark
--
You received this message because you are subscribed to the Google Groups "AntConc-Discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to antconc+u...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/antconc/49dd2715-ff89-49b1-9ef8-7b2faf56678en%40googlegroups.com.