Dear Slawomir,
I guess there are no really good solutions.
In theory, it depends on what you want to do with it. You could have
THREE different corpora,
SYR, GRE, ARA, which would each have ALL the available text in that
language, and in addition the other translations (either incomplete or
with dummy sentences).
This would give you correct counts and easy searches in the primary
language. So, if you search for VerbX in all the texts available in
lang1, you search corpusLang1, and you get exactly what's there.
I think it will be difficult to get any better without complicated
dynamic measures.
Best,
Ruprecht
Am 01.11.21 um 11:58 schrieb Slavomír Čéplö: