You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to EDICT-JMdict
Hi folks. A few days ago I downloaded the KRADFILE dataset to supplement my own language learning, and in the process of working with it I noticed a few things about the characters used to represent radicals which are not directly included in the JIS X 0208 encoding used in that file, but do exist in Unicode. Specifically, the following entries in the comments at the top of the file, which I'm annotating here with the corresponding Unicode characters:
Each of the former looked like they could easily be typos/transcription errors for the latter, given how close the code points are numerically, but it's also possible there's a deeper etymological significance I'm not aware of.
I also had a thought about this entry:
并 none available - upside-down ハ
Would 丷 U+4E37 CJK UNIFIED IDEOGRAPH-4E37 be a suitable character for that? It visually seems to fit the decription, and I've seen it used as a composing character for e.g. 兑 = ⿱丷兄 on Wiktionary, but I don't know how "official" that is or if it's just an ad-hoc usage.
Jim Breen
unread,
Mar 28, 2023, 1:46:09 AM3/28/23
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to edict-...@googlegroups.com
Many thanks, Jim, for that feedback.
Yes, those three were typos. I have corrected them in that header
file. I also added the Unicode codepoint you suggested for 并. It seems
fine for that purpose.
Some day I may get to converting the whole thing to Unicode, but It's
rarely changed and there are quite a few legacy systems using the JIS
version, so I'll probably never get the time or energy.