Dear UniMorph Organizers,
I hope this message finds you well.
I am Kosuke Matsuzaki from Tohoku University in Japan.
I am writing to inform you that we have created a Japanese dataset in UniMorph. We believe it will be a valuable addition to the UniMorph project, and we would like to merge it with the main repository.
In addition, the existing Japanese dataset in UniMorph (automatically extracted from wiktionary) is currently registered with the language code "jap." However, this code is considered outdated and potentially offensive in some context (
https://en.wikipedia.org/wiki/Jap). Furthermore, changing the code to "jpn" would align it with the ISO 639-2 standard, ensuring consistency and accuracy in language representation.
I am also pleased to share that I will be presenting our dataset at the upcoming 21st SIGMORPHON Workshop at NAACL 2024.
Would it be okay to proceed with a pull request to integrate these updates?
Please let me know if there are any specific guidelines or requirements I should follow.
Thank you for your time and consideration.
Best regards,
Kosuke MATSUZAKI
Tohoku University, Japan