tesseract inconsistent...

64 views
Skip to first unread message

R B

unread,
Mar 6, 2021, 3:28:44 AM3/6/21
to tesseract-ocr
Hi,

I could use some help with tesseract v5.0.0-alpha.20200328(I've tried 4.1. too).  I'm getting the following unwanted behavior from tesseract:

  • Unable to recognize asterisks
  • Some characters recognized as lower case 
  • Some characters are recognized incorrectly.

Here's my output of the attached image:

Total: $5.00
Card Type: JCB
Trn Crd Tp Acct = Amount
RRR KK KERR RK KEK RK RRR RR KERR KKK     <--- This should be asterisks
6 Jc SL 7684 $5.00                                                  <--- "Jc" should  all be upper case "JC"
Records: 1
Total: $5.00
Card Type: Discover
Trn Crd Tp Acct = Amount
RRR KK KERR RK KEK RK RRR RR KERR KKK           <--- asterisks
4 bc sb 1119 $5.00                                                       <--- "bc sb" should be "DC SL"
Records: 1
Total: $5.00
Card Type: Diners
Trn Crd Tp Acct = Amount
RRR KK KERR RK KEK RK RRR RR KERR KKK          <--- asterisks
5 DI SL 3330 $5.00
Records: 1

My command line is:    "tesseract 2.bmp 2.bmp -l eng --psm 6"

I've tried looking at the many configuration parameters and, well, I only have so much time in my life. :)

Thanks!

Rey

2.bmp
Reply all
Reply to author
Forward
0 new messages