HI,
> On 9. Sep 2021, at 13:15, Yanir Marmor <
yan...@gmail.com> wrote:
>
> The CATMA-TEI Export Format coumentation is here.
>
> In addition, I've attached a short example of text and annotations I made before.
I believe a generic mapping from the CATMA data model to the INCEpTION data model
is currently not possible. In particular the tag hierarchy in CATMA and the ability
of a property to have multiple values are problematic.
What also seems to be a bit inconvenient for an automatized conversion process is that
the CATMA TEI file nowhere contains the name of the text file.
However, the file you provided looks rather simple with a flat list of tags that have
no properties - unless I overlook something. Such structure should be mappable to
INCEpTION, either by mapping each tag to a separate annotation layer named as the tag
or by mapping all tags to the same annotation layer and using the tag name as a string
feature value.
The appears to be a catma-py Python library which could be used in conjunction with the
DKPro Cassis library to write a script which loads, maps, and saves the result.
Cheers,
-- Richard