Only use the Line-Detection of Tesseract

91 views
Skip to first unread message

dũng lê

unread,
Sep 10, 2021, 4:04:29 AM9/10/21
to tesseract-ocr
Hello everyone,
I only want to use the Line-Detection Module of Tesseract, not the Word Detection Module or Character Classification Module.
How can I do it ?
Thanks,

Zdenko Podobny

unread,
Sep 10, 2021, 8:07:52 AM9/10/21
to tesser...@googlegroups.com
Can you please elaborate what do you mean by Line-Detection Module of Tesseract/Word Detection Module/Character Classification Module?
What did you do, and what do you want to achieve?

Zdenko

Dňa pi 10. 9. 2021, 10:04 dũng lê <duydung...@gmail.com> napísal(a):
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/3d814ffe-f36d-44b5-b0e4-2dd92c2d53c4n%40googlegroups.com.

dũng lê

unread,
Sep 11, 2021, 4:08:57 AM9/11/21
to tesseract-ocr
I read from this paper of Google about Tesseract:
https://research.google.com/pubs/archive/33418.pdf
z2750605265252_9b5504641228d9450935b8d1a80ee916.jpg
So I only want to use this module, then I will use my module to recognize, not the module of Tesseract 
Thanks
Vào lúc 19:07:52 UTC+7 ngày Thứ Sáu, 10 tháng 9, 2021, zdenop đã viết:

Zdenko Podobny

unread,
Sep 11, 2021, 4:50:32 AM9/11/21
to tesser...@googlegroups.com
I am still not sure if I understand you correctly but:
1. This is a very old document.
2. IMO you would like to use the document to analyze part of the tesseract only.

So have a look at API examples[1] for getting Result iterator and GetComponentImages examples. PageIteratorLevel helps you get the level (Block, Paragraph, Line, Word, Symbol) you are interested in.
And of course have a look at other API functions that can be useful for your project (e.g. GetTextlines, GetRegions...) 


Zdenko


so 11. 9. 2021 o 10:08 dũng lê <duydung...@gmail.com> napísal(a):
Reply all
Reply to author
Forward
0 new messages