Hello,everyone:
I want to recognize the content in the table image.(You can get it in the attach file).It contains Chinese characters and some English letters, the most troublesome problem is that it contain a ancient Greek character "Φ".
I do not how to train the model. I tried add Greek font but no use. The first step is error.
This is my command:
src/training/tesstrain.sh --fonts_dir /usr/share/fonts --training_text ../training_data/chi_sim_tuned.txt \
--langdata_dir ../langdata_lstm --tessdata_dir ./tessdata --lang chi_sim --linedata_only --noextract_font_properties --exposures "0" \
--workspace_dir ~/share/workspace/tmp \
--save_box_tiff \
--fontlist "NSimSun" \
"Times New Roman" \
"Arial Unicode MS" \
"SimSun" \
"Noto Sans CJK SC" \
"Noto Sans Mono CJK SC" \
"GFS Artemisia" \
--output_dir ~/tesstutorial/chi_sim_train \
--overwrite
Can someone help me,thanks in advances.