increase the quality of image so that it extracts proper text from it.

213 views
Skip to first unread message

Mahima Goyal

unread,
Aug 1, 2018, 2:33:41 AM8/1/18
to tesseract-ocr
I want to increase the quality of the image so that proper text is extracted. Right now I am using tesseract but I am not able to extract few things in the image

In another, image I am not able to extract any data. Please guide me


May

unread,
Aug 7, 2018, 1:29:09 PM8/7/18
to tesseract-ocr
Could you share the image that you used to process?

hebiya...@gmail.com

unread,
Sep 27, 2018, 6:10:53 AM9/27/18
to tesseract-ocr

32935.jpg

I have the same problem

在 2018年8月8日星期三 UTC+8上午1:29:09,May写道:

Mark Phillips

unread,
Sep 27, 2018, 11:42:56 AM9/27/18
to tesser...@googlegroups.com
Try —psm 11 or 12

Mark

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/5663e43e-bd44-4115-b97e-9e0a659f77cb%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
<32935.jpg>

pbw

unread,
Sep 27, 2018, 6:15:26 PM9/27/18
to tesseract-ocr
This is a difficult image.  Get textcleaner from Fred's ImageMagick scripts here.

I tried the following parameters for a reasonable image.  You may do better by experimenting, but the text in the original image is poor.

textcleaner -g -e stretch -f 15 -o 10 -s 1 -c 125,250,100,200 <inputfile> <outputfile>

sixt.jpg

hebiya...@gmail.com

unread,
Oct 10, 2018, 4:32:47 AM10/10/18
to tesseract-ocr
Thank you, I tried textcleaner before, but the result is not good. 

在 2018年9月28日星期五 UTC+8上午6:15:26,pbw写道:

Soumik Ranjan Dasgupta

unread,
Oct 11, 2018, 12:13:46 PM10/11/18
to tesser...@googlegroups.com
I would suggest cropping out the image part by part and performing OCR on them after some preprocessing. Concatenate the results at the end. It would take a bit more time, but should work theoretically.

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.

For more options, visit https://groups.google.com/d/optout.


--
Regards,
Soumik Ranjan Dasgupta

hebiya...@gmail.com

unread,
Oct 12, 2018, 2:49:48 AM10/12/18
to tesseract-ocr
thank you for your answer, this operation has produced two new problems.
1. time consuming
2. too small pixels  may not be recognized
but the overall effect is better than before.

get-text-ROI-from-image-(96, 802, 1006, 314).png

result:
 Ee I
Loss Damage Waiver
Third Party Insurance
Premium location fea
—E% VAL CDAED
Est. Total: 0,00 AED
The charges above are based on the stated rental period ANY Chagas fo Ihe fetal pariod colid fesultin a re-caiculaliorn of charjes:
| acknowledge receipt of the vehicle identified in this agreement and that its condition is 2s indicated in the Vehicle Check Report. | agree fo pay
all rental charges, any charges for fines or offences plus AED 170,00 as service charge per fine associated with this vehicle while in my
possession, any damage not covered by a police report, as well as any missing items or accessories,
Wy signature below shall constitute my authority lo debit my nominated credit card with (he amounts due under this agreement. | have thoroughly
read, understood and accept all lems and conditions in this agreement For monthly agreements the above excess KM Rate (Per KM) wil apply:
If the vehicle is impounded by the police due to traffic offences, | wil be liable to pay the fine amount plus AED 770,00 as service charges.
All Sixt cars ara non-smoking cars, If a car needs additional cleaning/smel-remayal caused by smoking, 3 fee of AED 500,00 wil be added to the
invoice. (Fee For Smell Removal = OT)




在 2018年10月12日星期五 UTC+8上午12:13:46,Soumik Ranjan Dasgupta写道:

hebiya...@gmail.com

unread,
Oct 12, 2018, 2:51:40 AM10/12/18
to tesseract-ocr

get-text-ROI-from-image.png



在 2018年10月12日星期五 UTC+8下午2:49:48,hebiya...@gmail.com写道:
Reply all
Reply to author
Forward
0 new messages