Context: I have a series of texts by a previous researcher who used a more or less phonetic orthography at a level of detail such that any given token of a word may be spelled many different ways. In addition, I have a variety of word lists in a variety of orthographies from a variety of researchers, including the transcriber of the texts.
The texts are linked to a word-level lexicon where the regularized spelling is the record marker, and the orthographic variants are separate fields, each followed by the gloss of that token, and a source citation. In essence, each variant is treated as a subentry. In the marker hierarchy, sources are under glosses, which are under orthographic variants, which are under the regularized spelling. A variant can have multiple glosses from different citations:
regularized spelling
[various fields, e.g. parse]
orthographic variant 1
source 1
source 2
gloss 2
orthographic variant 2
[etc]
Interlinearization consists solely of lookups, going from the orthographic variant to other data (gloss, citation, etc)
Problem: While that setup was fine for the original interlinearization, performing an updated interlinearization after extensive additions to the lexicon runs into the problem of the lookups not being restricted to the original data "subentries".
I.e. lookup on orthographic variant 2 will find the correct regularized spelling and e.g. return the parse, but the lookup from orthographic variant 2 to gloss will take the very first gloss in the main entry, gloss 1, instead of gloss 3, often producing errors as those were not necessarily formatted for interlinearization (e.g. characters not allowed). Lookup from orthographic variant 2 to source will pull up an unsortable list of all citations in the whole entry, rather than source 4.
Is there a way to tell the lookup to use the gloss from the orthographic variant looked up, rather than the first one in the main entry? Currently, I have to go into each entry and re-order the subentries so that the one needed is at the top, and re-order the fields in that subentry, etc. The next time another variant occurs in the text, I have to re-order the lexicon entry fields again.
Thanks,
- Craig