--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to tesser...@googlegroups.com
To unsubscribe from this group, send email to
tesseract-oc...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en
Added simultaneous multi-language capability.
zdenko,
Tried in ubuntu 11.10 - failed to install even after following the guidelines in wiki.
Then try the instructions again as you did. You've been very helpful
to people on this list in the past. Sorry Zdenko was a bit rude --
developers often don't like to explain the fundamentals of compiling
software because they have to do it a lot.
Good luck!
--Sven
--
``All that is gold does not glitter,
not all those who wander are lost;
the old that is strong does not wither,
deep roots are not reached by the frost.
From the ashes a fire shall be woken,
a light from the shadows shall spring;
renewed shall be blade that was broken,
the crownless again shall be king.”
myTess->Init(tessDataDir.c_str(), language, OEM_DEFAULT, NULL, , 0, false);
zdenko,
in WinxP, I was able to build viz. = Build: 22 succeeded, 3 failed, 0 up-to-date, 0 skipped ===
when checked in bin.dbg - it contains (1)cntraining.exe,(2) combine_tessdata.exe, (3)mftraining.exe,(4)tesseract.exe,(5) unicharset_extractor.exe and(6) wordlist2dawg.exe and also(7) liblept168d.dll. In the Debug folder contains(8) ambiguous_words.exe,
Thus I was able to locate 7exe files and one Dll file and rest(14 files out of 22 succeeded) could not located. i may kindly intimated where i made mistake, if any?
Also tested as suggested and output was fine vide attached files for persual. i am thankful to you for the valuable guidance rendered to me.
Tested using tesseract-ocr 3.02 in WinXP(with sp3).
Tried to generate .tr file using the following commandline. exp0.box was generated successfully - but failed to generate exp0.tr file exp0.txt - attached herewith for perusal.
M:\rao- files\chilume\test-3.02>tesseract exp0.tif exp0 batch.nochop makebox
Tesseract Open Source OCR Engine v3.02 with Leptonica
Page 0
M:\rao- files\chilume\test-3.02>tesseract exp0.tif exp0 nobatch box.train logfile
Tesseract Open Source OCR Engine v3.02 with Leptonica
M:\rao- files\chilume\test-3.02>
Guidance is requested.
-sriranga(79yrs)
Tesseract 3.02 is now available in svn for preliminary testing, currently Linux-only.
There are now 65 languages and some big improvements in layout analysis and character accuracy.
This version will with luck make it into Ubunto LTS Precise Pangolin, so please test to see if your favorite issue is resolved.
Thanks and enjoy!
Ray.
Tesseract 3.02 is now available in svn for preliminary testing, currently Linux-only.
There are now 65 languages and some big improvements in layout analysis and character accuracy.
This version will with luck make it into Ubunto LTS Precise Pangolin, so please test to see if your favorite issue is resolved.
Thanks and enjoy!
Ray.
Tesseract 3.02 is now available in svn for preliminary testing, currently Linux-only.
There are now 65 languages and some big improvements in layout analysis and character accuracy.
This version will with luck make it into Ubunto LTS Precise Pangolin, so please test to see if your favorite issue is resolved.
Thanks and enjoy!
Ray.
Tesseract 3.02 is now available in svn for preliminary testing, currently Linux-only.
There are now 65 languages and some big improvements in layout analysis and character accuracy.
This version will with luck make it into Ubunto LTS Precise Pangolin, so please test to see if your favorite issue is resolved.
Thanks and enjoy!
Ray.
Tesseract 3.02 is now available in svn for preliminary testing, currently Linux-only.
There are now 65 languages and some big improvements in layout analysis and character accuracy.
This version will with luck make it into Ubunto LTS Precise Pangolin, so please test to see if your favorite issue is resolved.
Thanks and enjoy!
Ray.
Tesseract 3.02 is now available in svn for preliminary testing, currently Linux-only.
There are now 65 languages and some big improvements in layout analysis and character accuracy.
This version will with luck make it into Ubunto LTS Precise Pangolin, so please test to see if your favorite issue is resolved.
Thanks and enjoy!
Ray.
you did give all details, so I need to guess some details:
1. I guess that you run something like this:
$ tesseract binarized.jpg content -l deu
but you created makebox file with command
$ tesseract binarized.jpg binarized makebox
if yes, than difference is in used language file
2. I try to run OCR eng and than with deu language file. With eng url
was ok (see binarized-eng), but some German words were not correct. It
look like "problem" is in German language file (dictionary?) and not in
tesseract library. This is just quick option, so maybe I am wrong. As a
workaround you can combine English and German file in tesseract3.02 (see
result binarized-eng_deu.txt)
$ tesseract binarized.jpg binarized-eng_deu -l eng+deu
Zdenko
Haben Sie noch Fragen?
Unsere Mitarbeiter/-innen helfen lhnen gem weiter:KundenCenter Regiobahn
An der Regiobahn 13
40822 Mettmann
Telefon: 02104 305-400
Telefax: 02104 305-403www.regio-bahn.de
in...@regio-bahn.deSchlaue Nummer 0 180 3/50 40 30
(Festnetzpreis 0,09 €/Minute;
mobil max. 0,42 €/Minute)Gute Fahrtwtmscht lhnen lhre REGIOBAHN
Haben Sie noch Fragen?
Unsere Mitarbeiter/-innen helfen Ihnen gern weiter:KundenCenter Regiobahn
An der Regiobahn 13
40822 Mettmann
Telefon: 02104 305-400
Telefax: 02104 305-403www.regio-bahn.de
in...@regio-bahn.deSchlaue Nummer 0 180 3/50 40 30
(Festnetzpreis 0,09 €/Minute;
mobil max. 0,42 €/Minute)Gute Fahrt wünscht Ihnen Ihre REGIOBAHN
Tesseract 3.02 is now available in svn for preliminary testing, currently Linux-only.
There are now 65 languages and some big improvements in layout analysis and character accuracy.
This version will with luck make it into Ubunto LTS Precise Pangolin, so please test to see if your favorite issue is resolved.
Thanks and enjoy!
Ray.
I just uploaded some fixes to VC2008 build - target was to compile and run tesseract.exe ("tesseract.exe eurotext.tif eurotext" produced output :-) )Please test it. Feel free to improve it.I still continue to support the current "vs2008 structure". When Tom will finalize his contribution[1] I will adapt it to 3.02 version and use it for next tesseract release.Zdenko
Well, this is a really old thread but I'm hoping some of you are still around. What do those Error messages mean? I am using tesseract on some Kannada files and I get these messages. Since I'm processing hundreds of pages, I cannot tell whether or not the OCR is accurate. Error messages are worrisome.