MSI installer for tesseract-ocr

283 views
Skip to first unread message

Osye

unread,
Oct 9, 2009, 5:37:14 PM10/9/09
to tesseract-ocr
I am sorry this has taken so long but I am finished with the Windows
Installer (MSI) for tesseract-ocr. I will upload the MSI shortly.
Between work and life I have not had much time to work on this but it
is done and ready for user testing I guess.

Please let me know if there are any issues with this for anyone, or if
it works great for you.


Continued from discussion here:
http://groups.google.com/group/tesseract-ocr/browse_thread/thread/4687e13d3a6c209f/5ee88f5a3621851f?lnk=gst&q=installer#5ee88f5a3621851f

Osye

unread,
Oct 11, 2009, 11:51:09 AM10/11/09
to tesseract-ocr
Update: The MSI is too large to upload here (it is 18 MB but there is
a 10 MB limit here).


On Oct 9, 4:37 pm, Osye <oprit...@gmail.com> wrote:
> I am sorry this has taken so long but I am finished with the Windows
> Installer (MSI) for tesseract-ocr.  I will upload the MSI shortly.
> Between work and life I have not had much time to work on this but it
> is done and ready for user testing I guess.
>
> Please let me know if there are any issues with this for anyone, or if
> it works great for you.
>
> Continued from discussion here:http://groups.google.com/group/tesseract-ocr/browse_thread/thread/468...

74yrs old

unread,
Oct 11, 2009, 1:05:37 PM10/11/09
to tesser...@googlegroups.com
Osye,
please visit http://www.mediafire.com/ to find out whether it is suitable for you to upload your  files.
MSI for which version of tesseract created.
-sriranga(77yrs old)

Osye

unread,
Oct 11, 2009, 3:47:42 PM10/11/09
to tesseract-ocr
I also need to know if it needs any changes. I included the language
files in the installer as optional features with English set as the
default. And the install location is set to C:\Program Files
\tesseract-ocr. Any of these things can be changed during the install
though.

Also I tested it my self (several times) but it would be nice if
someone else could test the install as well to see if I missed
anything.

On Oct 11, 12:05 pm, 74yrs old <withblessi...@gmail.com> wrote:
> Osye,
> please visithttp://www.mediafire.com/to find out whether it is suitable
> for you to upload your  files.
> MSI for which version of tesseract created.
> -sriranga(77yrs old)
>

Osye

unread,
Oct 11, 2009, 4:10:13 PM10/11/09
to tesseract-ocr
Here is the Mediafire page it is shared on:
http://www.mediafire.com/?nln0zjzdzyz

Here is a direct link to the file:
http://download631.mediafire.com/jpw4khmerwyg/nln0zjzdzyz/tesseract-ocr-2.04.msi

On Oct 11, 2:47 pm, Osye <oprit...@gmail.com> wrote:
> I also need to know if it needs any changes.  I included the language
> files in the installer as optional features with English set as the
> default.  And the install location is set to C:\Program Files
> \tesseract-ocr.  Any of these things can be changed during the install
> though.
>
> Also I tested it my self (several times) but it would be nice if
> someone else could test the install as well to see if I missed
> anything.
>
> On Oct 11, 12:05 pm, 74yrs old <withblessi...@gmail.com> wrote:
>
> > Osye,
> > please visithttp://www.mediafire.com/tofind out whether it is suitable

76yrsold

unread,
Oct 12, 2009, 11:28:53 AM10/12/09
to tesseract-ocr
Osye,
I am glad you have succeeded in uploading the your program.
Congratulations.
Please consider as a special case to upload MSI installers also for
the
previous versions of the tesseract like 2.03, 2.02 etc for the benefit
of community since some of the previous versions
were built on Visual Studio 2005express which is now not available for
download for re-compiling purpose.
Ray will really appreciate for the same.
With Choicest Best Wishes.
-sriranga(77yrs)

On Oct 12, 1:10 am, Osye <oprit...@gmail.com> wrote:
> Here is the Mediafire page it is shared on:http://www.mediafire.com/?nln0zjzdzyz
>
> Here is a direct link to the file:http://download631.mediafire.com/jpw4khmerwyg/nln0zjzdzyz/tesseract-o...
>
> On Oct 11, 2:47 pm, Osye <oprit...@gmail.com> wrote:
>
> > I also need to know if it needs any changes.  I included the language
> > files in the installer as optional features with English set as the
> > default.  And the install location is set to C:\Program Files
> > \tesseract-ocr.  Any of these things can be changed during the install
> > though.
>
> > Also I tested it my self (several times) but it would be nice if
> > someone else could test the install as well to see if I missed
> > anything.
>
> > On Oct 11, 12:05 pm, 74yrs old <withblessi...@gmail.com> wrote:
>
> > > Osye,
> > > please visithttp://www.mediafire.com/tofindout whether it is suitable

76yrsold

unread,
Oct 12, 2009, 11:33:31 AM10/12/09
to tesseract-ocr
Osye,
Really I am interested to check the optional features whether it works
for Kannda lang (Indic -utf8). Beta testing/feedback will done during
1st week of November, since I am on bed rest due to eye operation.
-sriranga(77yrsold)

On Oct 12, 1:10 am, Osye <oprit...@gmail.com> wrote:
> Here is the Mediafire page it is shared on:http://www.mediafire.com/?nln0zjzdzyz
>
> Here is a direct link to the file:http://download631.mediafire.com/jpw4khmerwyg/nln0zjzdzyz/tesseract-o...
>
> On Oct 11, 2:47 pm, Osye <oprit...@gmail.com> wrote:
>
> > I also need to know if it needs any changes.  I included the language
> > files in the installer as optional features with English set as the
> > default.  And the install location is set to C:\Program Files
> > \tesseract-ocr.  Any of these things can be changed during the install
> > though.
>
> > Also I tested it my self (several times) but it would be nice if
> > someone else could test the install as well to see if I missed
> > anything.
>
> > On Oct 11, 12:05 pm, 74yrs old <withblessi...@gmail.com> wrote:
>
> > > Osye,
> > > please visithttp://www.mediafire.com/tofindout whether it is suitable

Osye

unread,
Oct 12, 2009, 6:00:11 PM10/12/09
to tesseract-ocr
I included the language data from here (http://code.google.com/p/
tesseract-ocr/downloads/list) for:

Bangla
German
Fraktur (Old German)
English
French
Italian
Dutch
Portuguese (Brazilian)
Vietnamese

They are all optional features, with English set to install by
default, but any of them can be selected\deselected. Is there
language data for Kannda that should be included?


On Oct 12, 10:33 am, 76yrsold <withblessi...@gmail.com> wrote:
> Osye,
> Really I am interested to check the optional features whether it works
> for Kannda lang (Indic -utf8). Beta testing/feedback  will done during
> 1st week of November, since I am on bed rest due to eye operation.
> -sriranga(77yrsold)
>
> On Oct 12, 1:10 am, Osye <oprit...@gmail.com> wrote:
>
> > Here is the Mediafire page it is shared on:http://www.mediafire.com/?nln0zjzdzyz
>
> > Here is a direct link to the file:http://download631.mediafire.com/jpw4khmerwyg/nln0zjzdzyz/tesseract-o...
>
> > On Oct 11, 2:47 pm, Osye <oprit...@gmail.com> wrote:
>
> > > I also need to know if it needs any changes.  I included the language
> > > files in the installer as optional features with English set as the
> > > default.  And the install location is set to C:\Program Files
> > > \tesseract-ocr.  Any of these things can be changed during the install
> > > though.
>
> > > Also I tested it my self (several times) but it would be nice if
> > > someone else could test the install as well to see if I missed
> > > anything.
>
> > > On Oct 11, 12:05 pm, 74yrs old <withblessi...@gmail.com> wrote:
>
> > > > Osye,
> > > > please visithttp://www.mediafire.com/tofindoutwhether it is suitable

Osye

unread,
Oct 13, 2009, 1:48:31 AM10/13/09
to tesseract-ocr
I cleaned up the installer some today and re-uploaded it here:
http://www.mediafire.com/?nln0zjzdzyz
> > > > > please visithttp://www.mediafire.com/tofindoutwhetherit is suitable

74yrs old

unread,
Oct 13, 2009, 4:14:22 AM10/13/09
to tesser...@googlegroups.com
Osye,
I could not follow what kind of clean up the installer.
Anyhow I re-downloaded just now.
Regarding language data i.e. Kannada I have to re-generate(<lang>trained data) from the scratch as per guidance of Ray due to
new version 2.04 because old kannada datafiles will not work in the new version 2.04.
Anyhow I hope there is provision to add "kan.trained data" at a later stage by  the user - under tessdata of tesseract 2.04 folder?
With Regards,
-sriranga(77yrsold)

74yrs old

unread,
Oct 13, 2009, 12:58:51 PM10/13/09
to tesser...@googlegroups.com
Osye,
Just now I installed the Msi installation 2.04(1). After installation, I could not locate icon of tesseract-ocr nor in C:\progran Files.When checked in Add/remove there is entry tesseract-ocr - where I clicked to repair and rebooted but still not figured"tesseract-ocr" under C:\ProgramFiles\??.  Further guidance is requested what to do?
With regards,
-sriranga(77yrsold)

Osye

unread,
Oct 13, 2009, 5:56:46 PM10/13/09
to tesseract-ocr
I will look into that this evening. I may have copied the wrong
installer to the file share.

On Oct 13, 11:58 am, 74yrs old <withblessi...@gmail.com> wrote:
> Osye,
> Just now I installed the Msi installation 2.04(1). After installation, I
> could not locate icon of tesseract-ocr nor in C:\progran Files.When checked
> in Add/remove there is entry tesseract-ocr - where I clicked to repair and
> rebooted but still not figured"tesseract-ocr" under C:\ProgramFiles\??.
> Further guidance is requested what to do?
> With regards,
> -sriranga(77yrsold)
>
> On Mon, Oct 12, 2009 at 1:17 AM, Osye <oprit...@gmail.com> wrote:
>
> > I also need to know if it needs any changes.  I included the language
> > files in the installer as optional features with English set as the
> > default.  And the install location is set to C:\Program Files
> > \tesseract-ocr.  Any of these things can be changed during the install
> > though.
>
> > Also I tested it my self (several times) but it would be nice if
> > someone else could test the install as well to see if I missed
> > anything.
>
> > On Oct 11, 12:05 pm, 74yrs old <withblessi...@gmail.com> wrote:
> > > Osye,
> > > please visithttp://www.mediafire.com/tofind out whether it is suitable

Sven Pedersen

unread,
Oct 13, 2009, 7:23:16 PM10/13/09
to tesser...@googlegroups.com
Osye, good job!
I installed the program without trouble. I tried the English and
French, though I used a very simple test file. I also chose the
Spanish option, and it seemed to install just fine, though I have not
yet tested it. I wonder if we could not build a version with
compressed TIFF and PNG support? I guess you're working from the
binaries, and it might require multiple licenses being displayed
during installation. My main concern is that everyday users would
likely want to use multiple file formats, especially compressed TIFF,
which many scanning packages produce.

Anyway, it looks very good. The one small nit which I would pick is
that the red disc icon is a bit weird -- it looks like a warning,
rather than a normal part of installation. Perhaps that is part of the
WiX system?
Thanks so much for your hard work!
--Sven

Osye

unread,
Oct 13, 2009, 8:49:49 PM10/13/09
to tesseract-ocr
The red icon is a default part of the WiX UI, but it can be changed
out for another icon (actually a bitmap) if preferred. I can include
other things in the installer pretty easily if needed or wanted.

There is a problem with the installer that sriranga(77yrsold) pointed
out. When skipping the advanced settings and just installing with the
initial install button it doesn't actually install. I am fixing that
issue now. I should have a new version of the installer up later this
evening or in the morning.


On Oct 13, 6:23 pm, Sven Pedersen <sven.peder...@gmail.com> wrote:
> Osye, good job!
> I installed the program without trouble. I tried the English and
> French, though I used a very simple test file. I also chose the
> Spanish option, and it seemed to install just fine, though I have not
> yet tested it. I wonder if we could not build a version with
> compressed TIFF and PNG support? I guess you're working from the
> binaries, and it might require multiple licenses being displayed
> during installation. My main concern is that everyday users would
> likely want to use multiple file formats, especially compressed TIFF,
> which many scanning packages produce.
>
> Anyway, it looks very good. The one small nit which I would pick is
> that the red disc icon is a bit weird -- it looks like a warning,
> rather than a normal part of installation. Perhaps that is part of the
> WiX system?
> Thanks so much for your hard work!
> --Sven
>
> On Tue, Oct 13, 2009 at 5:56 PM, Osye <oprit...@gmail.com> wrote:
>
> > I will look into that this evening.  I may have copied the wrong
> > installer to the file share.
>
> > On Oct 13, 11:58 am, 74yrs old <withblessi...@gmail.com> wrote:
> >> Osye,
> >> Just now I installed the Msi installation 2.04(1). After installation, I
> >> could not locate icon of tesseract-ocr nor in C:\progran Files.When checked
> >> in Add/remove there is entry tesseract-ocr - where I clicked to repair and
> >> rebooted but still not figured"tesseract-ocr" under C:\ProgramFiles\??.
> >> Further guidance is requested what to do?
> >> With regards,
> >> -sriranga(77yrsold)
>
> >> On Mon, Oct 12, 2009 at 1:17 AM, Osye <oprit...@gmail.com> wrote:
>
> >> > I also need to know if it needs any changes.  I included the language
> >> > files in the installer as optional features with English set as the
> >> > default.  And the install location is set to C:\Program Files
> >> > \tesseract-ocr.  Any of these things can be changed during the install
> >> > though.
>
> >> > Also I tested it my self (several times) but it would be nice if
> >> > someone else could test the install as well to see if I missed
> >> > anything.
>
> >> > On Oct 11, 12:05 pm, 74yrs old <withblessi...@gmail.com> wrote:
> >> > > Osye,
> >> > > please visithttp://www.mediafire.com/tofindout whether it is suitable

Osye

unread,
Oct 13, 2009, 11:45:28 PM10/13/09
to tesseract-ocr
I have reworked the installer, again :)

The new installer can be found here:

http://www.mediafire.com/?ljwyyrwmnxi

It should take care of the issue that sriranga(77yrsold) was having.

Sven I can add other files to the package if people want me to.
Also I will work on changing the images on the installer this week.

What files would people like added to the installer?
> > >> > > please visithttp://www.mediafire.com/tofindoutwhether it is suitable

74yrs old

unread,
Oct 14, 2009, 8:03:09 AM10/14/09
to tesser...@googlegroups.com, Ray Smith
Thanks for the hardwork!
Just now I downloaded the revised installer works well. However it is observed that no "combine.exe" is missing  in the trainer folder? under tessdata folder "eng.traineddata" is missing?
It is presumed that other "<lang>traineddata" files can be added under tessdata folder by the user.
It would be nice to have frontend GUI similar to freeOCR.Net/vietocr.net. It is nothing but automate the functions of different exe

Sample menu created

Open image

Select                 
After selected  exe click on "Run" to generate

 tesseract.exe
   
fontfile.tif junk nobatch box.train
mftraining.exe
 
fontfile_1.tr fontfile_2.tr
cntraining.exe
fontfile_1.tr fontfile_2.tr
 unicharset_extracter.exe
fontfile_1.box fontfile_2.box .

 wordslist2drawg.exe
   a) select freq-words-txt
   b) select words-txt
 combine.exe for <lang> datafiles generated

If succeeded, the purpose of generated MSI installer for tesseractocr will be served..
With Best Wishes,
-sriranga(77yrs)
Reply all
Reply to author
Forward
0 new messages