Re: Persian Tesseract?

444 views

Skip to first unread message

Sven Pedersen

unread,

Aug 9, 2012, 12:43:26 PM8/9/12

to tesser...@googlegroups.com

The release notes say that .NET library was updated for Tesseract OCR
3.01. Cube (multiple language) support is not enabled for RTL
languages (Arabic/Farsi/Hebrew/etc) or complex scripts, as far as I
know. I have heard they're working on that, such as Hindi and Chinese
with English, but not sure. The current version in SVN is 3.02 --
you'll get a lot of backlash calling it 3.2. :-)

Someone named Arsalan Ghasrsaz was working on a chopper for Farsi, but
I have not seen anything recently...
--Sven

On Thu, Aug 9, 2012 at 9:10 AM, Alex C <coug...@gmail.com> wrote:
> Hi. Is there a Tesseract language pack for Persian? The Arabic pack works
> well.
>
> Also, I'm using the Tesseract imlementation from here:
> https://github.com/charlesw/tesseract-ocr-dotnet -- Anyone know which
> version of the Tesseract this is using? I'd like to apply multiple languages
> to an OCR processing ("ara+eng"), but it doesn't seem to support it. I think
> version 3.2 has that capability.
>
> Thank you :)
>
> --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to tesser...@googlegroups.com
> To unsubscribe from this group, send email to
> tesseract-oc...@googlegroups.com
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en

--
``All that is gold does not glitter,
not all those who wander are lost;
the old that is strong does not wither,
deep roots are not reached by the frost.
From the ashes a fire shall be woken,
a light from the shadows shall spring;
renewed shall be blade that was broken,
the crownless again shall be king.”

Quan Nguyen

unread,

Aug 9, 2012, 6:31:19 PM8/9/12

to tesser...@googlegroups.com

When Tesseract 3.02 is officially released, the author of tessdotnet will update to it. Then we'll have multiple language support.

https://github.com/charlesw/tesseract-ocr-dotnet/issues/4

Reply all

Reply to author

Forward

0 new messages