How to read texts from a table into arrays with tesseract, given the cooridnates of the column and row boundaries?

91 views
Skip to first unread message

Bec Zhao

unread,
Aug 15, 2018, 7:04:52 AM8/15/18
to tesseract-ocr
Hi, 

I want to extract texts from tables into arrays that represents the rows and columns of the table. 
I have already used opensv to obtain the precise boundaries of the table, 
now I want to know which syntax can extract the texts from the table, and put them into arrays according to the coorindates of the boundaries (or perhaps the joints of the boundaries)? 


Thanks!



Shree Devi Kumar

unread,
Aug 15, 2018, 8:25:51 AM8/15/18
to tesser...@googlegroups.com
check whether HOCR or TSV outputs are useful.

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/ec86fad1-c257-4085-9a41-2ec9cecf15f6%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.



--

____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
Reply all
Reply to author
Forward
0 new messages