Multiple tifs to one file

477 views
Skip to first unread message

Stathis L.

unread,
Apr 21, 2015, 5:58:19 AM4/21/15
to tesser...@googlegroups.com
Is there a way to combine several tiffs into one ocr file? If so, what may be the command for that?

Thanks

Quan Nguyen

unread,
Apr 21, 2015, 9:24:31 PM4/21/15
to tesser...@googlegroups.com
Try Merge TIFF function of VietOCR.

zdenko podobny

unread,
Apr 22, 2015, 3:21:38 AM4/22/15
to tesser...@googlegroups.com
IMO there are 2 easy solutions:

1. You can combine input images with ImageMagick to multipage tif (e.g. convert image1.png image2.bmp image3.tif output.tif)

2. You can create text file with image filename per line. e.g. filelist.lst that has this context:
song.png
tessinput.tif
superscript.png
testp.png

than you can ocr this list by command:

tesseract filelist.lst ocr_output

Zdenko

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/b54c6525-4c5b-419d-a246-c9ffe79183cb%40googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

Stathis L.

unread,
Apr 23, 2015, 9:03:02 AM4/23/15
to tesser...@googlegroups.com
Point is it's a 800-900 page tiff that I want to create and these solutions don't really work, as making a list is not a question and VietOCR runs out of memory. Anything else?

Tom Morris

unread,
Apr 23, 2015, 4:12:43 PM4/23/15
to tesser...@googlegroups.com

On Thursday, April 23, 2015 at 9:03:02 AM UTC-4, Stathis L. wrote:
Point is it's a 800-900 page tiff that I want to create and these solutions don't really work, as making a list is not a question and VietOCR runs out of memory. Anything else?

Are you replying to Quan or Zdenko?  It helps if you include a reference to the text that you're replying to or commenting on.

Zdenko offered a couple of different solutions.  If you don't like either of those, google "merge tiff files" and choose one of the other (many) options.

Tom 

Quan Nguyen

unread,
Apr 24, 2015, 9:50:37 AM4/24/15
to tesser...@googlegroups.com
That's a huge file! I'm not sure if any program can handle that size, including Tesseract. My suggestion is to keep the files to several tens-page size, or OCR them individually and merge the outputs.
Reply all
Reply to author
Forward
0 new messages