This just came across on Corpora List, may be of interest (particularly
the part about IK Analyzer, q.g.*):
------------------
I have been doing corresponding google searches but nothing clear comes
out of the murky waters of the internet... Is there some corpus of
traditional chinese to be had, be it under a commercial or free license?
Or for the lack of it, at least a tool that can tokenizse traditional
chinese into words? I am aware of the existing tools for simplified
chinese such as IK Analyzer - and I know that they would likely work
from traditional chinese as well, provided some word lists - which leads
me to the first question.
Thank you in advance,
Stefan Bordag
--
-------------------------------------------
- Dr. Stefan Bordag -
- 0341 49 26 196 -
- sbo...@informatik.uni-leipzig.de -
-------------------------------------------
------------------
Mike Maxwell
*q.v.: quod vidē
q.g.: quod gōgle