Hi,
Is there any config. parameter or setting to detect the beginning of a paragraph while using tesseract?
I am using Tess4J to convert a pdf to text. I am facing problems while recognising same paragraph after page break, i.e, when the page changes in the pdf, the first line of the next page is in continuation to the last paragraph of the last page , or if the line is a new paragraph?
Thanks,
Ashish