Hi Unitex users,
We use rich-annotated XML files as Input and UNITEX
strips all XML-Annotations.
Inside UNITEX they become "pure" .txt Files,
which we process.
Our local grammars for Personal-Names, and so on adds new XML-tags via transducer output to the output-files.
But then we want to have our XMl-Tags from
before back is the file.
So we must merge the XML-Files from before with the new annotations from Unitex. This is not trivial!
So my question: Are there solutions?
One chance would be, to hide some
Index-Informations in our XML-Files, which is not removed though UNITEX-XML-Input and
not taken into account from the Local Grammars in UNiTEX.
This Index-information could be used afterwards to merge easily the OLD-XML-Annotations together with the new UNITEX Annotations.
Thanks for your help
Max Hadersbeck, Munich, CIS