I have debugged the code, and find that the special characters 'Joined' and '|Broken|0|1' are added while generating the unicharset file.
But what is the function of these characters? Can anyone tell me which stage in the training process, these characters play in a role? I can't find it. Thx a lot.
For other special characters, such as 'cl', '|d|0|2', '|d|1|2', what is the function of these characters? Are they added in the combine_lang_model stage?
Can you help me?
Thanks sincerely.
在 2017年8月15日星期二 UTC+8下午1:47:10,
roberty...@gmail.com写道: