常用國字標準字體筆順手冊 | tw | 常用國字標準字體筆順手冊 (Stroke order 14 rules), Taiwan Ministry of Education. Book available online (authoritative). ISBN 957-00-7082-X |
現代漢語通用字筆順規範 | cn | 現代漢語通用字筆順規範, 453pages, 1997, editeur: 语文出版社, ISBN:7801262018 (Authoritative) |
筆順指導の手びき | jp |
筆順指導の手びき (Hitsujun shidō no tebiki), 1958. (Authoritative from 1958 to 1977) Note: nowadays, the Japanese Ministry of Education let editors set freely a character's stroke order, which all should « follow commonsensical orders which are widely accepted in the society ». |
香港標準字形及筆順 | hk | 香港標準字形及筆順 - Stroke Order in Hongkong, Hong Kong Department of Education's List of Commonly Used Characters (Authoritative) |
ROC | PRC | Japan | |
---|---|---|---|
Sequences (b&w) | *-tbw.png (3) | *-bw.png (1,006) | *-jbw.png (50) |
Shades of red | (0) | *-red.png (192) | *-jred.png (8) |
Animations | *-torder.gif (8) | *-order.gif (300) | *-jorder.gif (8) |
Personally, after 5 years working on the Commons SOP project (CC-by), I now prefer the Share-Alike like Kanjivg (CC-by-sa). We share, they share. That's cross help, that's more fair.
For getting the stroke order data from Commons SOP, I don't get it : this project worked in bitmap, there is not automatically extractable data.
There is only 1300 entries on Commons SOP, while Kanjivg have +6000 (if I'm right). Since most stroke order are identical between all countries, variants to input to get a multi-country coverage is likely just 5~10%. Not a big deal.
Long term:
So for me, progress toward multi-countries system is a long term issue. Go slowly. It is need to:
* Continue on Japanese only for now ;
* Think calmly about the possibility of multi-countries system :
* * naming (<hanzi T>;<hanzi M>;...) ;
* * Sources ;
* * which modifications of the kanjivg's wiki.
* * new problematics ? (risk of mixing datas)
* Go / Wait / Kill decision.
Then:
* Implement the multi-country approach.
* Look on Commons SOP for the characters with diverging stroke order.
* Create variants : <kanji>王</kanji>, Chinese is different > so we create <hanzi T>王</hanzi Ti>.
* Keep identical empty : when we have <kanji>三</kanji>, don't create <hanzi T>三</hanzi T>, <hanzi S>三</hanzi Si> since they have the same stroke order.
By the way:
Is there a Kanjivg text wiki page to put some key points on this long term "multi-countries" posibility ? > Sources, approach, etc.
OK, together with the hiragana and katakana models, let's use KanjiVG
as main Japanese model in the next Tegaki release! I think I'll just
include your Ruby script in the model folder.
During the development of a recognition engine, KanjiVG is in my
opinion a more appropriate test set than Tomoe. So Tomoe can be used
as training set and KanjiVG as test set during the development phase
and KanjiVG (+Tomoe) can be used as training set for the released
model, when the engine has been developed.
Mathieu
The child just needed a somehow handy name, and so it became KanjiVG. If this causes international problems, one should think about changing the name or to allow also project naming variants like HanziVG and so on. I think new tags aren't really necessary at this moment.
I don't have a solution yet for the naming problem. I will discuss with Roger, who is also member of this mailing list and other colleagues at Tübingen. I am planning to be in Tokyo in August. Perhaps I can meet with Alex, and we can finalize a naming convention
It seems that I am getting involved in project that has to deal with hentai-gana and even Egyptian hieroglyphs. Perhaps this will lead to a more general character description language based on SVG.