Re: how to get tesseract to run on an entire folder

3,917 views
Skip to first unread message

Quan Nguyen

unread,
Mar 31, 2013, 12:15:53 AM3/31/13
to tesser...@googlegroups.com
No, you cannot run a batch of files like that with Tesseract; it has to be a Tesseract invocation for each file.

Or you can use VietOCR, a GUI frontend for Tesseract that supports batch or bulk OCR.

On Saturday, March 30, 2013 6:30:11 AM UTC-5, rollas...@gmail.com wrote:
I have a folder of too many documents I would like to OCR. It would be to time consuming to type it all one by one

I tried 

tesseract *.tif * hocr 

but I get the following error message 

read_params_file: parameter not found: II*

How do I wildcard it. I am using tesseract 3.02 and ubuntu x64

Nick White

unread,
Mar 31, 2013, 9:33:07 AM3/31/13
to tesser...@googlegroups.com
Learn bourne shell ;)

This will work fine for you (so long as files aren't strangely
named):

for i in *tif; do b=`basename "$i" .tif`; tesseract "$i" "$b" hocr; done

On Sat, Mar 30, 2013 at 04:30:11AM -0700, rollas...@gmail.com wrote:
> I have a folder of too many documents I would like to OCR. It would be to time
> consuming to type it all one by one
>
> I tried
>
> tesseract *.tif * hocr
>
> but I get the following error message
>
> read_params_file: parameter not found: II*
>
> How do I wildcard it. I am using tesseract 3.02 and ubuntu x64
>
> --
> --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to tesser...@googlegroups.com
> To unsubscribe from this group, send email to
> tesseract-oc...@googlegroups.com
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>
> ---
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an email
> to tesseract-oc...@googlegroups.com.
> For more options, visit https://groups.google.com/groups/opt_out.
>
>
Reply all
Reply to author
Forward
0 new messages