Dear All,
I am using tesseract_ocr in order to train and recognize some Perisan characters plus some shapes. In recognition of all the characters and shapes, tesseract works well except of one shape that has many children and is more complex than others.
for example, my trained tesseract, recognize the following characters well with high confidence:



the confidence for the above characters and shapes is higher than 85 and the API WordFontAttributes pointsize parameter is 12 for all of them. But for the following shape (the only shape that my trained tesseract doesn't work well), I catch the pointsize parameter as 4 and I have very low confidence about 35%.

For solving this problem, I used the API parameters of tesseract proposed for Chinese and Japanese language in the group as:
tesseract.SetVariable("save_blob_choices", "T");
tesseract.SetVariable("chop_enable","T");
tesseract.SetVariable("use_new_state_cost","F");
tesseract.SetVariable("segment_segcost_rating","F");
tesseract.SetVariable("enable_new_segsearch","0");
tesseract.SetVariable("language_model_ngram_on","0");
tesseract.SetVariable("textord_force_make_prop_words","F");
tesseract.SetVariable("edges_max_children_per_outline","40");//end of our part!
tesseract.SetPageSegMode(PSM_SINGLE_CHAR);
but I have no change in my results.
It should be mentioned that if we don't use tesseract.SetPageSegMode(PSM_SINGLE_CHAR), tesseract consider the complex shape as 4 - 5 different characters.
I would greatly appreciate for any help.
Regard,
Hasan