Depending on OS, tesseract (4.0) performance is different?

146 views
Skip to first unread message

이경준

unread,
Mar 15, 2018, 3:13:25 AM3/15/18
to tesseract-ocr
Hi Shree, I'm using tesseract 4.0 Alpha on Ubuntu 16.04.03 LTS

But, you give me 2 scripts for tesseract 4.0 

In the scripts, They are operated by Windows . I Think.

So, I change my running enviornment of Tesseract 4.0. (Ubuntu -> Window 10)

Depending on OS, tesseract (4.0) performance is different? 

And I finally Do not solve can't encode transcription , after replacing top layer

I conclude that I have to change my Os.(Ubuntu -> window 10) 

Thank you

Can you check my training_text for fine tuning? 

이경준

unread,
Mar 15, 2018, 3:16:54 AM3/15/18
to tesseract-ocr
Plus) could you give me some advice on running tesseract 4.0 on Window? 

such as using bash script. etc ..

Thank You

2018년 3월 15일 목요일 오후 4시 13분 25초 UTC+9, 이경준 님의 말:

ShreeDevi Kumar

unread,
Mar 15, 2018, 3:56:59 AM3/15/18
to tesser...@googlegroups.com
>  tesseract 4.0 Alpha on Ubuntu 16.04.03 LTS

Please use latest version beta.1 or build from source on github.

> They are operated by Windows . I Think.

No, they are not operated by windows. They run on 'bash under winodws' which provides Ubuntu 14.04. It can use fonts installed under windows.

> Depending on OS, tesseract (4.0) performance is different? 

Quite possible. It will also depend on how many changes from github are included in each.

> I finally Do not solve can't encode transcription , after replacing top layer

I cannot reproduce the problem. Please send your training_text and font that is giving you error so that I can check with it.

bash script will not run directly on windows.





ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/e6b57b7a-5ba6-4ca6-a7eb-864a2abf94a2%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

이경준

unread,
Mar 15, 2018, 4:05:43 AM3/15/18
to tesseract-ocr
Thank you so much .

1) how to replace tesseract 4.00 alpha with tesseract 4.00 Beta ?

Thank you

2018년 3월 15일 목요일 오후 4시 56분 59초 UTC+9, shree 님의 말:
>  tesseract 4.0 Alpha on Ubuntu 16.04.03 LTS

Please use latest version beta.1 or build from source on github.

> They are operated by Windows . I Think.

No, they are not operated by windows. They run on 'bash under winodws' which provides Ubuntu 14.04. It can use fonts installed under windows.

> Depending on OS, tesseract (4.0) performance is different? 

Quite possible. It will also depend on how many changes from github are included in each.

> I finally Do not solve can't encode transcription , after replacing top layer

I cannot reproduce the problem. Please send your training_text and font that is giving you error so that I can check with it.

bash script will not run directly on windows.





ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Thu, Mar 15, 2018 at 12:43 PM, 이경준 <player...@gmail.com> wrote:
Hi Shree, I'm using tesseract 4.0 Alpha on Ubuntu 16.04.03 LTS

But, you give me 2 scripts for tesseract 4.0 

In the scripts, They are operated by Windows . I Think.

So, I change my running enviornment of Tesseract 4.0. (Ubuntu -> Window 10)

Depending on OS, tesseract (4.0) performance is different? 

And I finally Do not solve can't encode transcription , after replacing top layer

I conclude that I have to change my Os.(Ubuntu -> window 10) 

Thank you

Can you check my training_text for fine tuning? 

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.

이경준

unread,
Mar 15, 2018, 4:12:51 AM3/15/18
to tesseract-ocr

attached file is my training _text

for finely tuninig, I use fonts 

train for fonts : Baekmuk Dotum , Baekmuk Gulim, Baekmuk Headline

eval for fonts :  Baekmuk Batang

You can install Baekmuk Font

$ apt-get install fonts-baekmuk

And I don't know font that is giving me error 

so I explain my training environment & settings

2018년 3월 15일 목요일 오후 5시 5분 43초 UTC+9, 이경준 님의 말:
kor.plus.training_text

ShreeDevi Kumar

unread,
Mar 15, 2018, 5:22:47 AM3/15/18
to tesser...@googlegroups.com
> 1) how to replace tesseract 4.00 alpha with tesseract 4.00 Beta ?


How did you install tesseract 4.00alpha?

이경준

unread,
Mar 15, 2018, 6:06:15 AM3/15/18
to tesseract-ocr
Hi
First ppa setting
Second sudo apt get remove
Third sudo apt get installl tesseract ocr

이경준

unread,
Mar 15, 2018, 6:08:09 AM3/15/18
to tesseract-ocr
I think that Is it possible to install tesseract 4.0 beta like instaliing patch file....
Sorry ㅠㅠ

이경준

unread,
Mar 15, 2018, 9:26:04 AM3/15/18
to tesseract-ocr
Totally / Completeley Tesseract 4.0 (alpha) to Delete way is 

$ sudo apt-get remove tesseract-ocr

$ sudo apt auto remove 

Is it right? 

2018년 3월 15일 목요일 오후 7시 8분 9초 UTC+9, 이경준 님의 말:

ShreeDevi Kumar

unread,
Mar 15, 2018, 9:35:40 AM3/15/18
to tesser...@googlegroups.com
sudo apt-get purge packagename, or 
sudo apt-get remove --purge packagename
will remove about everything regarding the package packagename, [...]
Particularly useful when you want to 'start all over' with an application


sudo apt-get autoremove


ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.

To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.

이경준

unread,
Mar 15, 2018, 9:54:11 AM3/15/18
to tesseract-ocr
Thank u so much

이경준

unread,
Mar 15, 2018, 11:46:33 AM3/15/18
to tesseract-ocr
Now Im installing ubuntu 18.04 for tesseract4.00 beta.1

Is it right?

ShreeDevi Kumar

unread,
Mar 15, 2018, 12:09:11 PM3/15/18
to tesser...@googlegroups.com
No.

You can use Alex's PPA and install for your version of Ubuntu.



On Thu 15 Mar, 2018, 9:16 PM 이경준, <player...@gmail.com> wrote:
Now Im installing ubuntu 18.04 for tesseract4.00 beta.1

Is it right?

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.

To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
Reply all
Reply to author
Forward
Message has been deleted
0 new messages