--
You received this message because you are subscribed to the Google Groups "unimorph" group.
To unsubscribe from this group and stop receiving emails from it, send an email to unimorph+u...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/unimorph/CADYt9QG1e%2BQ2QSHYf8bFnXSTbTr499Hmn3wka%3Drg2xyk%2BKfgcQ%40mail.gmail.com.
--
Hi Kat,Thank you very much for your informative and encouraging response.I am excited to hear about the work done by UniMorph, especially in Semitic languages. The resources you shared are extremely valuable and I will dig into them in the upcoming days and weeks.Regarding my existing data: I am currently involved with the Northeastern Neo-Aramaic Database Project at the University of Cambridge. Our database includes a comprehensive description of morphological features in most dialects. Additionally, there has been produced extensive grammatical descriptions, each running to about 2000 pages, for several dialects. These descriptions often include texts and lexicons. It is with this data that a complete corpus of surface forms for the Christian Urmi dialect was possible. I will look at the resources you provided and confirm whether or not I can continue this work through UniMorph. However, this would require a conversation about how to support a language that is not recognized with appropriate ISO codes and exists as a spectrum of many dialects.Regarding ISO 639-3 codes: I would appreciate any information on how to register new ISO codes, if Antonis or Pomak has any information.
Kindly,Matthew--On Wed, Dec 13, 2023 at 11:38 PM Kat Vylomova <evyl...@gmail.com> wrote:Dear Matthew,Thank you for your interest! Wow, that's quite impressive! UniMorph has a group of annotators who work(ed) on Semitic languages; you may have a look at the languages that we have annotated so far, the feature set, and the issues we faced: https://docs.google.com/spreadsheets/d/1CEkZW2RdZpAFD6Go8SG3lQJFkhLsLeec_wBzCt8NdRI/edit#gid=0 (I CC'ed some annotators as they might be interested as well). You may also check our (more general) annotation instructions over here: https://unimorph.github.io/doc/unimorph-schema.pdf and particular examples, e.g. for Hebrew https://github.com/unimorph/hebAt some point, we have also created a Google group for annotators of Semitic languages (for annotation-related discussions), I am not sure how active it is, but worth giving it a try: https://groups.google.com/g/unimorph-semiticUnfortunately, I cannot access the data on the website. What does it provide? Texts in those languages, or morphological paradigms? Do you have any annotations already? I am happy to help or advise on further steps if you provide a bit more information on the data you have. So far, the UniMorph database was enriched with the data from the English edition of Wiktionary and various inflection tables (full paradigms), FSTs (full paradigms), glossed texts (partial paradigms).Regarding ISO 639-3 codes: As far as I know, it is possible to submit a request to register a language/dialect. I recall Antonis (also CC'ed) did this for Pomak (?), he might suggest something.In any case, we would be happy to have you as a part of the team! :-)Warm regards,KatOn Thu, Dec 14, 2023 at 2:37 PM Matthew Nazari <matthe...@college.harvard.edu> wrote:--There are over 150 dialects of Northeastern Neo-Aramaic (NENA), a diverse group of dialects spoken by marginalized Christian and Jewish communities from northwestern Iran, northern Iraq, and southeastern Turkiye.The issue is that NENA is not like other languages that Unimorph supports. It does not have a prestige dialect that can represent all of them, and it does not even have appropriate ISO 639-3 codes.What can the Unimorph project do to support languages like NENA, languages of community like mine?
You received this message because you are subscribed to the Google Groups "unimorph" group.
To unsubscribe from this group and stop receiving emails from it, send an email to unimorph+u...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/unimorph/CADYt9QG1e%2BQ2QSHYf8bFnXSTbTr499Hmn3wka%3Drg2xyk%2BKfgcQ%40mail.gmail.com.
You received this message because you are subscribed to the Google Groups "unimorph" group.
To unsubscribe from this group and stop receiving emails from it, send an email to unimorph+u...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/unimorph/CADYt9QEpzmY5E5cKN%3Dn_wUPoE1cEWUbGfgeS0N4KdWK_oPVB2Q%40mail.gmail.com.