Different behaviour on Ubuntu with tesseract cf0b378 and Mac OS X with tesseract 3.05.01

87 views
Skip to first unread message

David Sixela

unread,
Sep 26, 2017, 4:02:33 AM9/26/17
to tesseract-ocr
I recently changed my computer from a PC running Ubuntu 16.04 to a MacBook Pro with Mac Os X 10.12.6.
I'm working on a program using tesseract (pytesseract 0.1.7) and opencv 3.3.0 for automatic text extraction on Id cards.
The problem that i'm facing right now is that my program doesn't work properly, the OCR is completely false on my MacBook and i don't get why. I'd like to know what i should do to make it work on MacBook Pro the same way it works on Ubuntu. Should i downgrade? if yes how. Is there new parameters on the new version 3.05.01 ?

configuration :
-Ubuntu 16.04:
  tesseract was build from source
  tesseract --version :
  tesseract cf0b378
    leptonica -1.74.1
      libjpeg 8d (libjpeg-turbo 1.4.2): libpng 1.2.54 : libtiff 4.0.6 : zlib 1.2.8

    FOUND AVX
    FOUND SSE

-MacBook os X 10.12.6 :
  tesseract installed via Homebrew
  tesseract --version:
  tesseract 3.05.01
    leptonica-1.74.4

      libjpeg 9b : libpng 1.6.32 : libtiff 4.0.8 : zlib 1.2.8

Example:
on this particular image :

The output of this command 
tesseract image.jpg stdout

on Ubuntu's configuration : 
Gabo / M

MacBook Pro 's configuration: 
GM!"

ShreeDevi Kumar

unread,
Sep 26, 2017, 4:14:14 AM9/26/17
to tesser...@googlegroups.com
you can build 4.o on macos

$ brew update
$ brew install tesseract --HEAD

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/12aba377-d071-4b8e-a1d4-12f63ed43ce6%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

David Sixela

unread,
Sep 26, 2017, 4:18:03 AM9/26/17
to tesseract-ocr
Thanks for your answer, i'm going to try that. But do you think this will solve that issue? what's wrong with 3.05.01?
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.

ShreeDevi Kumar

unread,
Sep 26, 2017, 4:27:33 AM9/26/17
to tesser...@googlegroups.com
I thought you wanted to match the version you had built from source. 

3.05.01 uses legacy tesseract engine.

building from source in master branch will give you 4.00.00alpha - LSTM based OCR. It has not been released yet.


ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.

To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.

David Sixela

unread,
Sep 26, 2017, 5:34:19 AM9/26/17
to tesseract-ocr
I just ran into errors while trying to install tesseract --HEAD:

Error: Failed to download resource "tesseract--tessdata-head"

Failure while executing: git clone --depth 1 --branch master https://github.com/tesseract-ocr/tessdata.git /Users/drnnokc/Library/Caches/Homebrew/tesseract--tessdata-head--git

ShreeDevi Kumar

unread,
Sep 26, 2017, 5:46:05 AM9/26/17
to tesser...@googlegroups.com
sorry, can't help as I do not have a mac.

Please post on the related issue - maybe @stweil can help.

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.

To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.

David Sixela

unread,
Sep 26, 2017, 6:02:32 AM9/26/17
to tesseract-ocr
Thanks for your help, i'll look deeper and see what i can do.

David Sixela

unread,
Sep 26, 2017, 6:18:56 AM9/26/17
to tesseract-ocr
I solved this error just by doing 
brew update
brew install tesseract
--HEAD

Now everything works perfectly. Thank you @shree
Reply all
Reply to author
Forward
0 new messages