OCR chess scoresheet

147 views
Skip to first unread message

Arvind M

unread,
Jun 29, 2015, 2:40:11 AM6/29/15
to tesser...@googlegroups.com
Hi,

I am looking for a way to OCR my son's handwritten chess scoresheet after tournaments. I have attached a sample scoresheet snippet (full seems to be too big), but you can see sample game in pgn format at https://en.wikipedia.org/wiki/Portable_Game_Notation. I just stumbled upon tesseract and thought this might save me time. 

System:

Darwin 13.4.0 Darwin Kernel Version 13.4.0: Wed Mar 18 16:20:14 PDT 2015

tesseract 3.02.02

leptonica-1.71

libgif 4.2.3 : libjpeg 9a : libpng 1.6.16 : libtiff 4.0.3 : zlib 1.2.8 : libwebp 0.4.2 : libopenjp


I am not having much success following instructions on http://tesseract-ocr.googlecode.com/svn/trunk/doc/tesseract.1.html

In eng.user-words, I have:
WHITE
BLACK
SIGNATURE
RESULT
WON
DRAW
O-O
O-O-O

\d*

\A*\a*X*\a\d+*#*


I can't use eng.user-patters since I have to provide at least 4 concrete characters at the beginning.

Command:

tesseract CCI06282015.png output bazaar


I have attached output.txt - its pretty bad. Any suggestions on how I can improve?

Thanks,
- Arvind

output.txt
chess.png
Reply all
Reply to author
Forward
0 new messages