Data in Excel sheet

114 views
Skip to first unread message

INAM HAQ

unread,
Oct 9, 2019, 1:03:34 PM10/9/19
to tesseract-ocr
Hi Friends,

Please advise me how to get the table data from image in csv format using tesseract?

Inam

Murtuza Dahodwala

unread,
Feb 16, 2021, 7:16:34 AM2/16/21
to tesseract-ocr
+1

Kostas

unread,
Feb 16, 2021, 1:55:58 PM2/16/21
to tesseract-ocr
I just read the documentation, perhaps goes like that: 

Tables recognitions

It is known tesseract has problem to recognize text/data from tables (see issues tracker) without custom segmenation/layout analyze. You can try to use/test Sintun proposal or get idea for Text Extraction from a Table Image, using PyTesseract and OpenCV/code for Text-Extraction-Table-Image


Murtuza Dahodwala

unread,
Feb 18, 2021, 6:32:07 AM2/18/21
to tesseract-ocr
Thank you for your response @kostas.
I have already tried these approaches and they do not work for me as my tables do not have grids to classify each cells.

Reply all
Reply to author
Forward
0 new messages