Hi,
INCEpTION imports data from the CoNLL-U file to its internal format. It only imports the data it understands.
It does not understand the misc column. It only understands
* Part-of-speech tagging (built-in),
* Lemmatization (built-in),
* Morphological analysis (built-in),
* Dependency parsing (built-in),
* Text normalization (built-in)
See:
https://inception-project.github.io/releases/39.6/docs/user-guide.html#sect_formats_conllu
When when export, it is the same. It only exports the information it knows how to represent in CoNLL-U (see above).
So unsupported or custom layers are not exportable into CoNLL-U.
The misc column of CoNLL-U is not standardized. Different corpora use it differently.
Also it is rather restricted. It only has key-value pairs and no type information
(e.g. does a key accept a number or string, etc.).
So it is unclear what the information in the misc column means and which layer/feature INCEpTION should
map it to. Likewise, it is unclear how unsupported/custom layers/features would be reasonably mappable
into the misc column.
Theoretically, INCEpTION could "weave" updated annotations into the original CoNLL-U files. So if you
imported a CoNLL-U and exported it again, INCEpTION could try to just updated e.g. POS information in
the original file instead of writing out a fresh file. That would in theory be able to preserve the misc
column. However, this approach would require that token and sentence boundaries must not be changeable
in INCEpTION. While currently, they are not changeable by default, there is already experimental support
for making them changeable. And when this is used, it would be unclear how to handle the information
in the misc column again, e.g. if a token was split into two, what happens to the misc data?
What information is stored in the misc column in your case?
Cheers,
-- Richard