(Batch processing) OCR all files in a folder

277 views
Skip to first unread message

Alex Santos

unread,
Aug 2, 2021, 11:51:33 AM8/2/21
to tesseract-ocr
Hello

I typically do one single scan image at a time but lately I've been trying to figure out how to do one file after another.

Folder
|_ file1.jpg
|_ file2.jpg
|_ file3.jpg

So in the tree example above (directory structure) there is a Folder that contains 3 jpg files. All of them need to be processed but to do one at a time is time consuming.

I am using zsh on Mac OS 10.13.6 (aka High Sierra)

This is what I typically  do when requesting for a single file to be processed.

ocrmypdf -v --output-type pdfa-3 -l eng --image-dpi 300 --optimize 0 --jpeg-quality 100 --pdfa-image-compression lossless --sidecar sidecar.txt /file1.jpg /file1_output.pdf 

Can I use a for loop or some other technique to batch the example directory?

Kind regards
—Alex
Reply all
Reply to author
Forward
0 new messages