fraktur is ready

16 views
Skip to first unread message

piggy

unread,
Aug 31, 2007, 4:01:27 PM8/31/07
to tesseract-ocr
The file fraktur.tgz contains training images and the resulting
training files for fraktur, the old German script.

I'm going to claim this as good enough for two reasons.

1) We're expecting a child any time now, so I expect to be heavily
distracted from this project for a few months.

2) I had someone volunteer to help. After trying these on a couple
books he decided that the results were good enough to use and that
further training wasn't necessary for his books.

I will undoubtedly add more training pages at some point, but this set
is good enough for a fairly large set of books.

La Monte H. P. Yarroll

unread,
Aug 31, 2007, 8:38:16 PM8/31/07
to tesseract-ocr
Issue 62 remains a problem with this training dataset.

Ray Smith

unread,
Aug 31, 2007, 9:08:32 PM8/31/07
to tesser...@googlegroups.com
I think I know what the bug is. I will investigate next week.
Ray.

piggy

unread,
Sep 1, 2007, 10:55:41 PM9/1/07
to tesseract-ocr
Thanks!

Inka Weide of PGDP prepared another training image and box for me from
a book with some difficult fonts, so I'm uploading a new fraktur.tgz.
The example image for Issue 62 is the new training image.

I'm seeing error messages from unicharset_extractor I've not noticed
before. I've consequently fixed about a half dozen errors in my
previous box files. These fixes are in the new fraktur.tgz.

On Aug 31, 9:08 pm, "Ray Smith" <theraysm...@gmail.com> wrote:
> I think I know what the bug is. I will investigate next week.
> Ray.
>

> On 8/31/07, La Monte H. P. Yarroll <piggy.yarr...@gmail.com> wrote:
>
>
>
> > Issue 62 remains a problem with this training dataset.
>

74yrsold

unread,
Sep 2, 2007, 8:16:41 AM9/2/07
to tesseract-ocr
Hi Piggy,
With reference to " testpageBefreiung.tif" It appears to be
lang:<deu> and if so,upload few lines typed text in lanmg:<deu> to
enable me to test in MSwindows. In case, if it works in MSwindows, it
should have work in "Ubuntu".
-74yrsold
Reply all
Reply to author
Forward
0 new messages