Issue 832 in tesseract-ocr: "Too many unichars in ambiguity on line *" error when i try to have a test on it firstly.

143 views
Skip to first unread message

tesser...@googlecode.com

unread,
Jan 24, 2013, 8:21:56 AM1/24/13
to tesserac...@googlegroups.com
Status: New
Owner: ----

New issue 832 by wushuang...@gmail.com: "Too many unichars in ambiguity on
line *" error when i try to have a test on it firstly.
http://code.google.com/p/tesseract-ocr/issues/detail?id=832

What steps will reproduce the problem?
1.download and install the tesseract-ocr-setup-3.02.02.exe , download the
chi_sim.trainneddata ,and put it in the "tessdata" folder.
2.and try to execute command "tesseract.exe pic.jpg ling -l chi_sim"

What is the expected output? What do you see instead?
It is supposed to generate ling.txt and some characters in it,but i only
see errors like this
"C:\Users\Administrator>F:\Tesseract-OCR-FromEXE\tesseract
romEXE\test1.jpg ling -l chi_sim
Too many unichars in ambiguity on line 18558088
Too many unichars in ambiguity on line 18558088
Too many unichars in ambiguity on line 18578624
Tesseract Open Source OCR Engine v3.02 with Leptonica"

What version of the product are you using? On what operating system?
tesseract-ocr3.02 on english version windows 7 .

Please provide any additional information below.

tesser...@googlecode.com

unread,
Jan 24, 2013, 1:20:31 PM1/24/13
to tesserac...@googlegroups.com

Comment #1 on issue 832 by zde...@gmail.com: "Too many unichars in
ambiguity on line *" error when i try to have a test on it firstly.
http://code.google.com/p/tesseract-ocr/issues/detail?id=832

Well you see message. So what is your problem?

tesser...@googlecode.com

unread,
Feb 2, 2013, 9:08:14 AM2/2/13
to tesserac...@googlegroups.com
Updates:
Status: Invalid

Comment #2 on issue 832 by zde...@gmail.com: "Too many unichars in
ambiguity on line *" error when i try to have a test on it firstly.
http://code.google.com/p/tesseract-ocr/issues/detail?id=832

no real bug details provided.

tesser...@googlecode.com

unread,
Mar 13, 2013, 4:42:46 AM3/13/13
to tesserac...@googlegroups.com

Comment #3 on issue 832 by landisk...@qq.com: "Too many unichars in
ambiguity on line *" error when i try to have a test on it firstly.
http://code.google.com/p/tesseract-ocr/issues/detail?id=832

I have been encountered in the same error too.
This is a very simple test, but failed to get a simple answer.
Are there anybody can tell me how to do this simple job.
Please email to me, thanks.
The attach files are the simple picture and the complicate ocred answer.

Windows XP SP3
Tesseract Open Source OCR Engine v3.02
Simplified Chinese Language Traindata

Attachments:
me.TIF 22.7 KB
me.txt 133 bytes

--
You received this message because this project is configured to send all
issue notifications to this address.
You may adjust your notification preferences at:
https://code.google.com/hosting/settings

tesser...@googlecode.com

unread,
Mar 13, 2013, 4:52:00 AM3/13/13
to tesserac...@googlegroups.com

Comment #4 on issue 832 by landisk...@qq.com: "Too many unichars in
ambiguity on line *" error when i try to have a test on it firstly.
http://code.google.com/p/tesseract-ocr/issues/detail?id=832

I have been encountered in the same error too.
This is a very simple test, but failed to get a simple result.
Are there anybody can tell me how to do this simple job.
Please email to me, thanks.
The attached files are the simple picture and the complicate ocred result.

tesser...@googlecode.com

unread,
Mar 24, 2013, 3:38:15 AM3/24/13
to tesserac...@googlegroups.com

Comment #5 on issue 832 by KunZhan...@gmail.com: "Too many unichars in
ambiguity on line *" error when i try to have a test on it firstly.
http://code.google.com/p/tesseract-ocr/issues/detail?id=832

I also meet this issue when recognize with Simplified Chinese in
Tesseract-OCR 3.02.

The only difference is that I used iOS library. Can any one help?

tesser...@googlecode.com

unread,
Aug 19, 2015, 10:46:27 AM8/19/15
to tesserac...@googlegroups.com

Comment #6 on issue 832 by paulwang...@gmail.com: "Too many unichars in
ambiguity on line *" error when i try to have a test on it firstly.
https://code.google.com/p/tesseract-ocr/issues/detail?id=832

I also got this issue with Simplified Chinese in Tesseract-OCR 3.02.

My Env:
Win7
Reply all
Reply to author
Forward
0 new messages