Page_Seperator

23 views
Skip to first unread message

mit

unread,
Apr 11, 2020, 4:33:58 AM4/11/20
to tesseract-ocr
Can anyone help me with page seperator issue.


I am trying to ocr multiple images to a txt file and want to put a page seperator after each line. What happens is after the page seperator, the content of the next line also comes along with that.

For instance if i use the command tesseract img.txt out -c page_separator="[*****]"


in the page seperator line the output is [*****]5.3. where 5.3 is the content of the next line. But i want 5.3 to be in the next line and not with the page seperator.


TIA

Lakshay Saini

unread,
Apr 11, 2020, 5:09:36 AM4/11/20
to tesseract-ocr
Hello

Try page_separator="[*****]\n"

Regards
Lakshay

mit

unread,
Apr 11, 2020, 7:59:54 AM4/11/20
to tesseract-ocr
It includes \n in the output too, and no change in my problem
Reply all
Reply to author
Forward
0 new messages