How to make a .trainedData? (Java)

1,028 views
Skip to first unread message

Tollpatsch GehtEuchNichtsAn

unread,
May 4, 2015, 12:18:37 PM5/4/15
to tesser...@googlegroups.com

Hey Guys,

I already tryed to use JTessBoxEditor because i read with this .jar i can create my needed .traineddata file.

My problem is i have a screenshot like this.











I want to use Tesseract on this image (.png)... So i decided to use Tess4J..
but i need a .traineddata file because i get a problem with the numbers. (5 get output as <, 2 or S)
Now i created with JTessBoxEditor a .box file out of this part of the screenshot (i just want to test if it work, if it work i want to make a whole .box file out of the image)



Now i have the .box file. but how i create a .traineddata?
I already created a .frequent_words_list, .words_list and a .font_properties.
The problem is i dont know the name of the font...

Can you help me?

Sriranga(82yrsold)

unread,
May 4, 2015, 12:42:17 PM5/4/15
to tesser...@googlegroups.com
you can generate traineddata in Jboxeditor. 

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/e80c9e08-94ea-48f8-8240-61e38ee9e94f%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Tollpatsch GehtEuchNichtsAn

unread,
May 4, 2015, 3:32:53 PM5/4/15
to tesser...@googlegroups.com
And how? the forum tells me that you uploaded a image. but i dont see any image?
If i try to create a data with the first tab (im on my smartphone, not on the pc). I get an error that i needed 3 Files (named in my first post). the last, one font-properties, i dont know what i should write in it, because i dont know the name of the font
Message has been deleted

Quan Nguyen

unread,
May 4, 2015, 9:29:06 PM5/4/15
to tesser...@googlegroups.com
You need to read Training wiki before continuing on.

The font name could be arbitrary if you don't have an exact name. And the same name should be used across the image file, the box file, and the entry in font_properties file. Look in samples folder of jTessBoxEditor for example.

Tollpatsch GehtEuchNichtsAn

unread,
May 6, 2015, 3:07:05 PM5/6/15
to tesser...@googlegroups.com
maybe im to stupid for that but i realy dont get it work....

Which files i need to get it work?
i have a .tiff with a example sentence. I created a .box every char in this sentence is matched right.
but then? 
how i use the "Trainer" Tap?
Which .exe i need to use in "Tesseract Execute"? i set tesseract.exe.
Training Data? I choose the ascent.
Language?`I use the name i want. "ascent"
Bootstrap Language: i dont know what i need to tipping here.
RTL? Yes/No ? What means RTL?
Which modus? i use "Trained with existing Box"


If i use run i get the error:








but i created a ascent.font_properties
Entry
ascent 0 0 0 0 0 

I realy try verything i can. I tryed to read the wiki but my english is not so good. i dont unterstand that. I dont find any tutorial to creat a .trainedData...




Quan Nguyen

unread,
May 6, 2015, 4:51:56 PM5/6/15
to tesser...@googlegroups.com
You seem to mix up between language code and fontname. Did you look in samples folder for example? List the names of all of your input files so we could assess that they are named correctly.

Tollpatsch GehtEuchNichtsAn

unread,
May 8, 2015, 2:35:41 PM5/8/15
to tesser...@googlegroups.com

Quan Nguyen

unread,
May 8, 2015, 3:26:37 PM5/8/15
to tesser...@googlegroups.com
You did not follow Training Wiki directions and the samples. Where's the language code?

On Friday, May 8, 2015 at 1:35:41 PM UTC-5, Tollpatsch GehtEuchNichtsAn wrote:

Reply all
Reply to author
Forward
0 new messages