Hi,
I discovered recently that the LocateTfst program (which is the "automaton intersection" option of Text > Locate Pattern) has a strange behaviour since a revision of July 2010: the now here query (or a now --> here path in a graph) finds occurrences of nowhere, and the nowhere query finds now here. Word boundaries inside the query or inside the occurrence are not required to be identical. Such behaviour may be convenient for Korean and other languages where spaces between words are easily omitted or inserted, but it is counter-intuitive for English or Romance languages: now here does not mean nowhere, to get her has nothing to do with together. I suggest this feature should be limited to Korean and perhaps other languages.
This feature is a difference between LocateTfst and the Locate program (which is the default option of Text > Locate Pattern): by default, with Locate, now here and nowhere do not recognize each other. This difference makes it more complex to read and interpret graphs, and I don't see the advantage, except for Korean. LocateTfst was designed as a new version of Locate with more functionality (like dictionary-based morphological analysis, rule-based ambiguity resolution), but as much as possible of the functionality of Locate has been preserved.
The feature is misleading. With LocateTfst, even though the <now> query recognizes now and <here> recognises here, a <now> --> <here> path does not recognize nowhere.
The feature looks a little like Locate's morphological mode, but it is different in fact. In the morphological mode, a now --> here path in a graph finds nowhere, but a <now> --> <here> path finds it too, and a nowhere box does not find now here. All these differences are difficult to remember and add further confusion to the interpretation of graphs.
Does any user take benefit of this behaviour of LocateTfst? In which language?
Best,
Eric Laporte