Issue 1482 in tesseract-ocr: Whitelist not working on 3.02.02 for Python

521 views

Skip to first unread message

tesser...@googlecode.com

unread,

Jun 5, 2015, 12:58:50 PM6/5/15

to tesserac...@googlegroups.com

Status: New
Owner: ----

New issue 1482 by darke...@yahoo.com: Whitelist not working on 3.02.02 for
Python
https://code.google.com/p/tesseract-ocr/issues/detail?id=1482

What steps will reproduce the problem?
1. enter the following code at beginning of python script:

import tesseract
ocr = tesseract.TessBaseAPI()
ocr.SetVariable("tessedit_char_whitelist", "0123456789;")
ocr.Init("C:\\Program Files
(x86)\\Tesseract-OCR\\","eng",tesseract.OEM_DEFAULT)

What is the expected output? What do you see instead?

It should be whitelisting those characters, but tesseract ignores it and
will grab other characters anyway. I've tried putting the SetVariable
before and after Init, but that didn't change anything.

What version of the product are you using? On what operating system?
3.02.02 on Windows 8.1

Please provide any additional information below.
I do have pytesseract/python-tesseract installed as well, if that changes
anything.

Thanks in advance for any help on this. I've been trying to fix this
problem all week without much progress.

--
You received this message because this project is configured to send all
issue notifications to this address.
You may adjust your notification preferences at:
https://code.google.com/hosting/settings

tesser...@googlecode.com

unread,

Jun 7, 2015, 9:58:21 AM6/7/15

to tesserac...@googlegroups.com

Updates:
Status: Invalid

Comment #1 on issue 1482 by zde...@gmail.com: Whitelist not working on

3.02.02 for Python
https://code.google.com/p/tesseract-ocr/issues/detail?id=1482

We do not provide support for tesseract wrapper see FAQ[1] for instructions
[1] https://code.google.com/p/tesseract-ocr/wiki/FAQ#Rules_and_advices

Reply all

Reply to author

Forward

0 new messages