Lexique Pro document export does not print all entries

99 views
Skip to first unread message

Lucas Haiduck

unread,
Mar 11, 2015, 9:45:55 AM3/11/15
to shoeboxtoolbox-fiel...@googlegroups.com
Hello everyone,

I have a problem with the document export from Lexique Pro. From skipping through the topics in this group I assume there might be people here who can help me.

First off, I have a Toolbox lexicon which I imported into Lexique Pro. Works fine.
Now if I select an .rtf export, the printed version is incomplete: That is, it only prints entries up to î, which basically means over half of the entries are missing.

I tried .docx and .odt export, with slightly improved results: It gives me a print version with entries up to û.
Still, I can't export the full lexicon.
Tried to overwrite the exported documents multiple times, without any sucess.
The missing entries have a \lx marker in the lexicon's .txt, just like the ones LP prints. So the selection of entries left unprinted seems totally random, except for it's alphabetical order.

Has anyone ever witnessed this problem or something similar? Does anyone have a solution?

Best,
Lucas

Nick Thieberger

unread,
Mar 11, 2015, 12:37:14 PM3/11/15
to shoeboxtoolbox-field-linguists-toolbox
Hi,

I recall a similar problem and it was solved by looking at the character at the point at which the export failed. There were some weird, perhaps ex-MS Word character encodings as I recall that caused the problem.

Nick

--
You received this message because you are subscribed to the Google Groups "Shoebox/Toolbox Field Linguist's Toolbox" group.
To unsubscribe from this group and stop receiving emails from it, send an email to shoeboxtoolbox-field-ling...@googlegroups.com.
To post to this group, send email to shoeboxtoolbox-fiel...@googlegroups.com.
Visit this group at http://groups.google.com/group/shoeboxtoolbox-field-linguists-toolbox.
For more options, visit https://groups.google.com/d/optout.

Tony&Di Naden

unread,
Mar 12, 2015, 5:39:18 AM3/12/15
to shoeboxtoolbox-fiel...@googlegroups.com
Same here.  But these once-ff problems tent to get forgotten in the sigh of relif when you've worked round them.  Check that all your characters are entered in all the parts the language encodings. Maybe consistently change the special characters tou want to use for something else distinctive but less advanced, if it then works OK work backwards to restore codings that will give an output acceptable to your orthography.
--
Address: "Lost Marbles", 31, Reading Road,
Pangbourne, Berks., RG8 7HY  -

Tel.: 01189842368


Lucas Haiduck

unread,
Mar 12, 2015, 7:05:28 AM3/12/15
to shoeboxtoolbox-fiel...@googlegroups.com
Hi,

thanks for your answers. Nick, I checked the encoding at the point of abruption and it is viable unicode. But your idea makes sense, since the export stops exactly at î, which might cause encoding problems due to the combination of the character with the diacritic. Just to be on the sure side of things, I converted the lexicon to UTF-8 encoding with notepad++.

However; incomplete export of the lexicon only applies to the .rtf export!
After looking at the .docx and .odt exports for some time, I realized that they actually printed all entries from the lexicon -- only in an absolute obscure order...

So I have moved from working on a full .rtf export, to getting the .docx export in the correct order. You can see the current sort order of Lexique Pro in the attached screenshot:
It is displayed in the middle and on the right side of the shot. It leaves out letters like b and c, since there are no lexemes starting with those letters in Yali. So this is fine.
The problem is that i and u occur at the end of the list, and the single important differentiation between i and î and u and û is not displayed correctly.

At this point I should probably say that I am using the Toolbox MDF.typ template for the Lexique Pro lexicon version.
But also I tried moving away from the custom simple rules of the Toolbox sort order, and tried defining my own by using the ICU syntax in the "ICU Custom Collation rules" field under "Customize Languages".
I tried various rules, like e.g. "& I < i & Î < î " and others. It changed nothing...

At this point I don't know anymore, whether or not I'm doing something fundamentally wrong! Please help me :)

Lucas
sort_order.jpg

toolbox

unread,
Mar 12, 2015, 11:03:11 AM3/12/15
to shoeboxtoolbox-fiel...@googlegroups.com
Lucas,

Have you tried the "Check Unicode Validity" under the Checks menu in Toolbox? It will check the whole file. Sometimes the trouble spot is a bit beyond the point where the printing stops.

If that doesn't give you any useful results, then send me your project (data, and all the *.prj, *.typ, *.lng files), and I'll see if I can find anything strange. Send it to Toolbox @ sil.org (without the spaces).

Or, if you prefer, contact me on Skype and we can discuss using TeamViewer, which would let me see your computer. I'm Skype ID: KB9876

(For what it's worth, the classic approach to solving such a problem is to trim the data. What happens if you only have the plain i and u and the î and û as the data? What happens if you chop the file off after the first 10 entries after the printing ends? By eliminating the OK data, this type of process should eventually narrow the problem down to a single entry.)

Karen
Toolbox Support

Lucas Haiduck

unread,
Mar 13, 2015, 5:45:37 AM3/13/15
to shoeboxtoolbox-fiel...@googlegroups.com
Hey Karen,

thanks for your help. The Unicode validity check in Toolbox did not reveal any unvalid Unicode characters. I'll be sending you my files.

But I'd like to emphasize again, that all Unicode characters are shown properly in the .docx and .odt exports. Only the sorting order is messed up, and may seemingly not be rearranged/fixed.

Best,
Lucas

TonyNaden

unread,
Mar 24, 2021, 11:08:03 AM3/24/21
to Shoebox/Toolbox Field Linguist's Toolbox
Having advised, now need advice.  My lovely 8,400-word dictionary is perfect in LP on screen but only 2 entries exported. No special characters.

TonyNaden

unread,
Mar 24, 2021, 11:28:25 AM3/24/21
to Shoebox/Toolbox Field Linguist's Toolbox
OK, panic over: for some reason it didn't recognise I (capitaL I)  in my  I.A.1.a.i  system.
Reply all
Reply to author
Forward
0 new messages