First version of tesseract4java -- a GUI for training and running Tesseract -- released

781 views
Skip to first unread message

Paul

unread,
Aug 19, 2016, 7:49:23 PM8/19/16
to tesseract-ocr

Some features include:
  • Includes pre-compiled Tesseract (3.04.01) for your platform, just click and run it! Traineddata is not included, though.
  • Image pre-processing using Sauvola or Otsu binarization methods
  • Integrated Box Editor
  • Glyph overview for advanced training including glyph feature visualization
  • Recognition comparison view with helpful visual info like detected word boxes, symbol boxes, line numbers, base lines, x-lines, blocks, paragraphs, and font features
  • Evaluation view that can generate recognition reports using ocrevalUAtion
  • Batch recognition with text, HTML and XML output
You can find screenshots on the project page at GitHub: https://github.com/tesseract4java/tesseract4java

I appreciate your ideas for improvement and bug reports in the issues section.

-- Paul

ShreeDevi Kumar

unread,
Aug 20, 2016, 5:24:17 AM8/20/16
to tesser...@googlegroups.com
Is this similar to Quan's JTessBoxEditor and VietOCR?

I downloaded tesseract4java-0.1.0-windows-x86_64.jar and tried to run it - 

it gives fatal error - no jnilept in java.library.path

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/1042471e-4d36-46db-a24b-342c00e19b7c%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Paul

unread,
Aug 20, 2016, 11:31:03 AM8/20/16
to tesseract-ocr
Do you have a 32-bit or 64-bit Java installed?

You can find out by running `java -version` on a command line.

Paul

Am Samstag, 20. August 2016 11:24:17 UTC+2 schrieb shree:
Is this similar to Quan's JTessBoxEditor and VietOCR?

I downloaded tesseract4java-0.1.0-windows-x86_64.jar and tried to run it - 

it gives fatal error - no jnilept in java.library.path

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Sat, Aug 20, 2016 at 5:19 AM, Paul <pa...@vorb.de> wrote:

Some features include:
  • Includes pre-compiled Tesseract (3.04.01) for your platform, just click and run it! Traineddata is not included, though.
  • Image pre-processing using Sauvola or Otsu binarization methods
  • Integrated Box Editor
  • Glyph overview for advanced training including glyph feature visualization
  • Recognition comparison view with helpful visual info like detected word boxes, symbol boxes, line numbers, base lines, x-lines, blocks, paragraphs, and font features
  • Evaluation view that can generate recognition reports using ocrevalUAtion
  • Batch recognition with text, HTML and XML output
You can find screenshots on the project page at GitHub: https://github.com/tesseract4java/tesseract4java

I appreciate your ideas for improvement and bug reports in the issues section.

-- Paul

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.

universal reseller

unread,
Aug 20, 2016, 11:44:54 AM8/20/16
to tesser...@googlegroups.com
​is this support rtl languages?!​

ShreeDevi Kumar

unread,
Aug 20, 2016, 10:40:30 PM8/20/16
to tesser...@googlegroups.com
C:\Users\User>java -version

java version "1.8.0_45"
Java(TM) SE Runtime Environment (build 1.8.0_45-b15)
Java HotSpot(TM) Client VM (build 25.45-b02, mixed mode)

Probably it is 32 bit. 
I downloaded tesseract4java-0.1.0-windows-x86.jar and it runs, however there is no help with the program so it is difficult to figure out how to use it. 

Inline image 1

Edit preferences asks for paths for Tesseract execuatables and langdata but I could not find where to specify paths for tessdata.

I can provide more feedback offline or under issues on github, if you like.
 

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.

To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.

David H.

unread,
Aug 21, 2016, 7:54:08 AM8/21/16
to tesseract-ocr
Is there a guide on how to use the software once it's installed? I created a new project, added image files but don't know what to do next.

Paul

unread,
Aug 21, 2016, 6:20:09 PM8/21/16
to tesseract-ocr
No, I have not tested it on RTL languages.

Paul

unread,
Aug 21, 2016, 6:31:37 PM8/21/16
to tesseract-ocr
I see there's a need for documentation, so I started writing documentation in the project's wiki.

If you have something that is unclear, create an issue at GitHub or write me an email, if you have no GitHub account.

Charlie Hayes

unread,
May 4, 2017, 5:25:08 AM5/4/17
to tesseract-ocr
I was finally able to get the tessdata to load but I still couldn't get anything to work.
Reply all
Reply to author
Forward
0 new messages