Constrained Data

19 views
Skip to first unread message

Samuel Larkin

unread,
May 27, 2025, 11:43:43 AMMay 27
to LLMs with Limited Resources for Slavic Languages 2025
Hi,
  I would like to clarify what can be used to train a model.  Are we strictly limited to data from:
Are we allowed, for example, to use other resources from different languages (i.e., not DSB/HSB)?
Can I used another Question and Answer dataset that is not dsb/hsb/de?
Can I used, for example, de-pl bilingual text?


Thanks

Daryna Dementieva

unread,
May 28, 2025, 6:37:27 AMMay 28
to LLMs with Limited Resources for Slavic Languages 2025
Dear all,

Yes, we allow additional data usage. For fairness and reproducibility, the resources should, however, be publicly available. 

Best,
Organizers

вторник, 27 мая 2025 г. в 17:43:43 UTC+2, samuel...@gmail.com:
Reply all
Reply to author
Forward
0 new messages