What does special character "|" mean?

26 views
Skip to first unread message

易鑫

unread,
Mar 28, 2019, 11:21:18 PM3/28/19
to tesseract-ocr
Hello,everyone:

I now use tesseract 4.0.0 to recognize the content of table image. The sample image is in the attach files(5-a.jpg)

When  I use the command:

tesseract 5-a.jpg command  -l chi_sim_layer

In the command.txt  there will be some characters "|" , I think they are delimiters.

But when I extract the cell from the image and binarization.(in the attach file 11.png)

tesseract 11.png stdout -l chi_sim_layer --psm 7

The result is "- |"


I want to know why the 11.png contains "|" ,this character does not in my unicharset,what does "|" mean?  thank you in advance.

Sorry for my poor English.









 
5-a.jpg
command.txt
11.png

Shree Devi Kumar

unread,
Mar 29, 2019, 12:45:34 AM3/29/19
to tesser...@googlegroups.com
The training text that I used for replace layer has the | character. 

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/f0556f69-16f6-4233-a186-a3a6f251a91d%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

易鑫

unread,
Mar 29, 2019, 1:34:27 AM3/29/19
to tesseract-ocr
I' m sorry I don't quite understand what you mean.Shall I ignore the "|" character ?

Shree Devi Kumar <shree...@gmail.com> 于2019年3月29日周五 下午12:45写道:

易鑫

unread,
Mar 29, 2019, 2:13:25 AM3/29/19
to tesseract-ocr
my training text do not have the "|" character, does this character is reserved?

易鑫 <yixinl...@gmail.com> 于2019年3月29日周五 下午1:34写道:
Reply all
Reply to author
Forward
0 new messages