Ban some characters on tessseract ( '/' , '|' , ',' , ...)

83 views
Skip to first unread message

Guillaume de Rybel

unread,
Mar 6, 2020, 7:49:41 AM3/6/20
to tesseract-ocr
Hi, my work is to recognize license plates, and sometimes, tesseract recognize some special characters. I need to 'ban' those characters : '/' , '|' , ',' .
May I have some help ?

Sorry if my english is bad,
 Thank you,

Guillaume

Shree Devi Kumar

unread,
Mar 6, 2020, 9:03:04 AM3/6/20
to tesseract-ocr
Search for whitelist / blacklist in forum for ways to restrict the characters.

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/c02ecca9-2157-49db-8149-0745a3bfab19%40googlegroups.com.

Guillaume de Rybel

unread,
Mar 6, 2020, 3:16:30 PM3/6/20
to tesseract-ocr
Thanks a lot ! I found what I was looking for ! :
txt = pytesseract.image_to_string(image, config="-c tessedit_char_whitelist=0123456789ABCDEFGHIJKLMNOPQRSTUVWXYZ-")

Le vendredi 6 mars 2020 15:03:04 UTC+1, shree a écrit :
Search for whitelist / blacklist in forum for ways to restrict the characters.

On Fri, Mar 6, 2020, 18:19 Guillaume de Rybel <guillaum...@gmail.com> wrote:
Hi, my work is to recognize license plates, and sometimes, tesseract recognize some special characters. I need to 'ban' those characters : '/' , '|' , ',' .
May I have some help ?

Sorry if my english is bad,
 Thank you,

Guillaume

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesser...@googlegroups.com.

Shubhranshu Panda

unread,
Mar 9, 2020, 3:17:47 AM3/9/20
to tesseract-ocr
you can also take help of regex.
Reply all
Reply to author
Forward
0 new messages