I do not know much about this (Ulrich is the man for this kind of
stuff) but AFAIK radical information is not chosen according to
graphical ressemblance, but rather according to ethymology/historical
matters. That's why you can find such things.
> This data is very useful, but might be better auto-generated by
> another means. I have written software to do stroke comparison and I
> am thinking to have it work through this database to try and match the
> strokes to radicals/other kanji by comparing the strokes instead of
> using the data in the XML file. If it turns out to be more accurate I
> would be more than happy to share the result, but I'd like to know how
> you arrived at the current data set first.
I am not sure if this is what we want to have - or maybe this could be
added to the existing data as a new attribute. Would like to hear what
Ulrich things about this, as I prefer to have him decide for this
stuff.
Alex.