Two column PDF text extraction mixes the columns

79 views
Skip to first unread message

Awolad Hossain

unread,
Sep 22, 2021, 11:14:31 AM9/22/21
to cloud-vision-discuss
I'm using Google vision with the DOCUMENT_TEXT_DETECTION option. It's extracting text well when the PDF is a single column. But it mixes the text of the columns when the PDF contains multiple columns. Very strange result. But it should extract text left column first and then right.
tts-pdf-41-1632302682929.txt
tts-pdf-41-1632302682929.pdf
Reply all
Reply to author
Forward
0 new messages