Hello,everyone:
I now use tesseract 4.0.0 to recognize the content of table image. The sample image is in the attach files(5-a.jpg)
When I use the command:
tesseract 5-a.jpg command -l chi_sim_layer
In the command.txt there will be some characters "|" , I think they are delimiters.
But when I extract the cell from the image and binarization.(in the attach file 11.png)
tesseract 11.png stdout -l chi_sim_layer --psm 7
The result is "- |"
I want to know why the 11.png contains "|" ,this character does not in my unicharset,what does "|" mean? thank you in advance.
Sorry for my poor English.