You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to ductacesti
Iam developing a traindata for Bengali language.The problem is tesseract does not recognize most spaces in the input file and keep almost all the characters of the input image together to make one long word instead of several words and sentences.This is for a big traindata where it detects some spaces, for a small traindata, it detects nothing.I made an English traindata with only 26 English alphabets to test whether tesseract detects spacing for it and it can detect for English but not for Bengali.I am using 3.02.02 windows installer.Please tell me where to edit the configuration to make it work.I am giving some characters of Bengali to see.