Tesseract OCR table data not printing correctly

48 views
Skip to first unread message

Srikanth Vijayakumar

unread,
Mar 1, 2020, 11:26:47 AM3/1/20
to tesseract-ocr
Hello,

  In the pdf file , it contains table which contains serial numbers (1-50) and relevant to that dollar values (Eg. $10655.9) are present. When I try to extract, it is printing the below output

[| oRoUP | RAYE [ |
| 1 | $106559| |
| 2 | seoatslodPomeroySweetsutes | |
[3 | sies%o5lComandNv.i5 | |
| 4 [| s2146.70[Phonesc0r7Eaeee0 [|
| 5 [| seras:o] | |
6 [| sass ©
7 | stgsansf | |
8 [sms] 1
es [seams] 1

Dollar values and serial number is converted to text . I tried with various trained data. But I could not able to extract the exact data. Please do needful

Thanks
Sri

Reply all
Reply to author
Forward
0 new messages