Dear Laurence,
As far as I know, the latest version of the application does not allow us to load text files which has Unicode characters, and the previous versions of the application generates weird characters in “corpus files” pane when the filenames contain characters like İ,Ş,ı etc. If it is possible and compatible with the backend programming language, could you please change the behaviour of that section?
Since we have to track some concordances and the related text file, it will be very useful for us. Otherwise, we will need to use the 3.2.4 version for the texts which has Unicode characters in their names.
Best regards,
Umut
--
You received this message because you are subscribed to the Google Groups "AntConc-discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to antconc+u...@googlegroups.com.
To post to this group, send email to ant...@googlegroups.com.
Visit this group at http://groups.google.com/group/antconc.
For more options, visit https://groups.google.com/groups/opt_out.
Dear Laurence,
Let me show the problem via screenshots.
When I try to load all the text files in a directory, AntConc 3.2.4 generates such kind of invalid characters in filenames (see ss1.png).
In addition to that, if I try to load the text files in 3.2.5, it never lists the text files in “Corpus Files” pane, and generates an error message (see ss2.png).
If the backend programming language allows you to change the behaviour of that section, could you please change it also to UTF-8 encoding? By the way, all of my text files are processed in the tools supplied by AntConc such as Wordlist, Concordance, Collocation etc.
Best regards,
Umut
Dear Laurence,
We are still working on AntConc 3.2.4. My operating system is Windows 7 *64. The System Locale is Turkish (I’m not sure whether it uses ANSI or UTF-8, but probably ANSI).
However, I also tried to use 3.2.4 and 3.3.5 with Windows Server 2008 *64. It’s default language is English, and the problem still exists.
All my texts are encoded in UTF-8 since it supplies a wide range of character encoding.
Hope to see AntConc 4.0 soon.
Dear Laurence,
If I change the character encoding to ISO-8859-9 (Turkish), all the file names seem to be listed regularly in both versions. Moreover, I tried to generate concordance lines with ISO-8859 -9 (Turkish) encoding, and there is no problem at all.
It is also good to mention that all the texts are encoded in UTF-8. The problem is resolved.
Thank you for all.
Dear Laurence,
I closed and opened the application again. This time, both 3.2.4 and 3.3.5 generated the concordances with errors, but the filenames appear correctly. You are right, there is a problem with the tools as you expected. Probably, in the previous process, I just closed all the tools and files and reopened them after generating concordances.
Dear Laurence,
It’s great to hear that fixing is possible.
If you need to test the recent version before the release, I can test the application, and send feedback.