Yours,
Pedro Lopes de Almeida
It is. tesseract 1.03 is in Gutsy (7.10) and 2.01 is in Hardy (8.04).
sudo apt-get install tesseract-ocr
will do you the job.
Regards
Jeff
Ok. I've installed Tesseract 1.03. I use Xsane for scanning. On
PREFERENCES»SETUP»OCR»OCR COMMAND I've typed tesseract 1.03, is it
right?
In INPUTFILE OPTION: -i
In OUTPUTFILE OPTION: -o
In GUI output-fd option -x
Progress keyword (nothing write)
But when, after digitalizing an image I order it to save in OCR text, or
to perform OCR task, it saves NOTHING, it happens nothing, just like if
I haven't done anything. So, it is not working.
Can anyone help me, tell what is wrong about my configuration?
I attached a screen of my OCR setup in Xsane.
Yours,
Pedro Almeida
Best regards,
Pedro Almeida
On Tue, 2008-01-08 at 07:08 +0100, Jeffrey Ratcliffe wrote:
I suggest you use gscan2pdf - it integrates scanning and tesseract
nicely - and is also in Gutsy.
Regards
Jeff
I don't have a windows box, but you will have to create your tiffs
with a program like Imaging, and then install tesseract from the
tesseract-2.01.exe.tar.gz file in the downloads section and use it
from the command line.
> I downloaded the following file to my computer (which is running XP):
> tesseract-2[1].01.tar.gz
>
> Okay, what is a "gz" file?
A gz file is a compressed file similar to a zip. You are to decompress
it with 7zip for instance (http://www.7-zip.org/).
Additional instruction (found in a comment on the wiki)
1) download tesseract-2.01.exe.tar.gz and tesseract-2.00.eng.tar.gz
2) extract these files into the same folder (7-zip or whatever
expanding software you prefer)
3) open a command window for this folder, where the tesseract.exe file
is located.
4) prep a tiff image, in my case I took a digital picture of a book,
tweaked it in photoshop and saved as a tiff with no compression. You
could do the same with the Gimp.
5) now I put the tiff image into the same folder and then in the
command window invoke the operation 'tesseract.exe MyImage?.tif
MyImageConverted? -l eng'
6) the process runs in the background for a few seconds and then a new
text-file appears with the name 'MyImageConverted?.txt'.
> I have a scanner and I can convert to tiff using photoshop but that's
> where I'm at. Can someone give me a step by step process of how to
> make this work?
Only tiffs with no compression will work, otherwise bmp files are working.
--
olorin
Just use the Document Imaging program that comes with Microsoft Office.
Avery