Unicode radical mappings in KRADFILE

18 views
Skip to first unread message

Jim Wisniewski

unread,
Mar 28, 2023, 12:20:56 AM3/28/23
to EDICT-JMdict
Hi folks. A few days ago I downloaded the KRADFILE dataset to supplement my own language learning, and in the process of working with it I noticed a few things about the characters used to represent radicals which are not directly included in the JIS X 0208 encoding used in that file, but do exist in Unicode. Specifically, the following entries in the comments at the top of the file, which I'm annotating here with the corresponding Unicode characters:
Would the following be better matches instead?
Each of the former looked like they could easily be typos/transcription errors for the latter, given how close the code points are numerically, but it's also possible there's a deeper etymological significance I'm not aware of.

I also had a thought about this entry:
  • 并 none available - upside-down ハ
Would 丷 U+4E37 CJK UNIFIED IDEOGRAPH-4E37 be a suitable character for that? It visually seems to fit the decription, and I've seen it used as a composing character for e.g. 兑 = ⿱丷兄 on Wiktionary, but I don't know how "official" that is or if it's just an ad-hoc usage.

Jim Breen

unread,
Mar 28, 2023, 1:46:09 AM3/28/23
to edict-...@googlegroups.com
Many thanks, Jim, for that feedback.

Yes, those three were typos. I have corrected them in that header
file. I also added the Unicode codepoint you suggested for 并. It seems
fine for that purpose.

Some day I may get to converting the whole thing to Unicode, but It's
rarely changed and there are quite a few legacy systems using the JIS
version, so I'll probably never get the time or energy.

Thanks again.

Jim
> --
> You received this message because you are subscribed to the Google Groups "EDICT-JMdict" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to edict-jmdict...@googlegroups.com.
> To view this discussion on the web visit https://groups.google.com/d/msgid/edict-jmdict/3d46e0c5-6544-47dd-bad9-057a38dce637n%40googlegroups.com.



--
Jim Breen
Adjunct Snr Research Fellow, Japanese Studies Centre, Monash University
http://www.jimbreen.org/
http://nihongo.monash.edu/
Reply all
Reply to author
Forward
0 new messages