clitic boundaries

32 views
Skip to first unread message

Weijian Meng

unread,
Aug 24, 2020, 9:47:58 AM8/24/20
to Shoebox/Toolbox Field Linguist's Toolbox
Hi! 

Does anyone know how to specify '=' as the clitic boundary for interlinearization? I have attached them to the clitics in the lexicon however clitic=host combinations won't parse unless separated by a space. 

Example: 

lexicon: 
\lx pp
\a p=

\lx q
text: 
\po p=q: does not parse
\po p= q: parses

Thanks for any advice! 

Weijian

ToolBox Support

unread,
Aug 24, 2020, 2:36:01 PM8/24/20
to shoeboxtoolbox-fiel...@googlegroups.com
If the result you are looking for is
\tx pq
\mb pp=    q
If that's not the result you wanted, then let me know, and perhaps send some real data as your example.

Assuming the above is what you wanted, then you need to do three different things:

First, add the = to the Morpheme Break Characters:
Place your cursor in the text file
Do Database, Properties
Choose the Interlinear tab (1 below)
Select the Parse process (2 below)
image.png
Click on Modify (3 above)
Toolbox will open the dialog containing various parse information.
Add the = to the list of Morpheme Break characters
image.png
Click OK or Close until you are back at the main Toolbox window.

Second, tell the Language Encoding to ignore the =
Place your cursor in the text line.
Do Project, Language Encodings. The Language Encoding of the text line will already be selected.
Click on Modify.
image.png
Toolbox will open the Language Encoding Properties dialog box and will default to the Sort Orders. 
Click on Modify 
image.png
Add the = to the Ignore sequence.
image.png
Again, click OK or Close to return to the main Toolbox window.

Third is to modify your dictionary entry.
The example you provided which produces a parse of pp= q needs the dictionary entry to be modified as follows:
\lx pp=
\a p=
with the = added to the lexeme as well as the alternate.

If a parse of p= q is what you were really looking for:
\tx pq
\mb p= q
, then you would just have: 
           \lx p=
no need for the \a p=.
I realize that you send totally generic data and I understand. But it makes it a bit hard to know how to advise on the dictionary.

Toolbox Support





--
You received this message because you are subscribed to the Google Groups "Shoebox/Toolbox Field Linguist's Toolbox" group.
To unsubscribe from this group and stop receiving emails from it, send an email to shoeboxtoolbox-field-ling...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/shoeboxtoolbox-field-linguists-toolbox/a1b51526-0219-408a-b8cf-ad1dca8d76ffn%40googlegroups.com.
Reply all
Reply to author
Forward
0 new messages