>>see below. Will also post parts to tesseract group.
From: 74yrs old [mailto:withbl...@gmail.com]
Sent: Friday, September 04, 2009 1:34 AM
To: tesser...@googlegroups.com
Cc: Pohorsky, Steve
Subject: Re: A vcproj file for building the traineddata files for 3.0
SteveP,
Appreciated for the detailed instructions how to generate combine.exe. Thanks for the same.
I followed your guidance
> "rick-click on "Solution 'teseract'" in the Solution Explorer pane,
> select Add,
> select Existing Project...
>
> In the dialog box that comes up, navigate to the folder that has
> combine.vcproj in it,
> select this vcproj file and click on Open.
> If this worked, you should see "combine" as a new project in the
> solution."
As a result of the re-compilation (Build batch -select all -Clean -rebuild all) in VC++2008
cntraining - 0 error(s), 11 warning(s)
========== Rebuild All: 35 succeeded, 1 failed, 0 skipped ==========
Note: I could not understand "1 failed" - which one failed?
>>click in the Output pane. do a Find for “error”.
In the bin.dbg = 7exe files generated including combine.exe appeared..
In the Main folder = 6exe(release) generated. Copied combine.exe from bin.dbg and
pasted under Main folder. Thus total 7 exe files[6 exe release +one exe dbg] existed.
Tested tesseract photest.tif phtest logfile = phtest.txt reproduced correctly from tif file.
Regarding generating combine.exe: As per your guidance
">To run this exe, it needs to run with the working
> directory set to the folder that has the tessdata folder in it. The
> easiest way to do this is to copy the exe to that folder"
Whether copy "combine.exe" found in bin.dbg can be pasted into folder "tessdata" ?
>> not into tessdata, but into the folder above it, the one that contains tessdata folder.
Because I don't know which are files of DLLs to copied into bin.dbg.
It is presumed that six files of DLLs are of Lepton like Jpeg62.dll, libimage.dll, librle3.dll, leptonlib.dll,
libpng13.dll, libtiff3.dll. plus "tessdata" folder have to be copied into bin.dbg.
>> I was referring to what Ray S wrote in the README in the wiki site, ‘all DLLs except tessdll”.
Further, It is presumed that to run combine.exe - the command line( example for English datafiles)
should be as follows:
" combine tessdata/eng.freq-dawg, tessdata/eng.user-words, tessdata/eng.word-dawg,
tessdata/eng.inttemp, tessdata/eng.normproto, tessdata/eng.pffmtable, tessdata/eng.unicharset,
tessdata/eng.DangAmbigs (output)eng.traineddata "
>> no, command line is just what is between quotes here: “combine tessdata/eng.”
>>all of the suffixes are in the source code; that is why they are not specified on command line.
>>Note that “DangAmbigs” is the old name. For 3.0 tesseract source code for combine (I did not write it) uses “unicharambigs”.
This communication, including any attachments, may contain information that is proprietary, privileged, confidential or legally exempt from disclosure. If you are not a named addressee, you are hereby notified that you are not authorized to read, print, retain a copy of or disseminate any portion of this communication without the consent of the sender and that doing so may be unlawful. If you have received this communication in error, please immediately notify the sender via return e-mail and delete it from your system.