Windows 7 Bangla Word Software Download

0 views

Skip to first unread message

Yahaira Petrov

unread,

Aug 4, 2024, 9:35:43 PM8/4/24

to ductacesti

Iam developing a traindata for Bengali language.The problem is tesseract does not recognize most spaces in the input file and keep almost all the characters of the input image together to make one long word instead of several words and sentences.This is for a big traindata where it detects some spaces, for a small traindata, it detects nothing.I made an English traindata with only 26 English alphabets to test whether tesseract detects spacing for it and it can detect for English but not for Bengali.I am using 3.02.02 windows installer.Please tell me where to edit the configuration to make it work.I am giving some characters of Bengali to see.