could not open file eng.user-words

278 views
Skip to first unread message

Glen Rubin

unread,
Jun 2, 2014, 4:03:31 PM6/2/14
to tesser...@googlegroups.com
I am running Teseract on windows.  When I try running from the commandline and specifying a config file, tesseract will give me an error message saying that it cannot open the user-words file.  Weirder still it looks like the eng.user-words file has been renamed somehow to just eng??

zdenko podobny

unread,
Jun 2, 2014, 4:54:09 PM6/2/14
to tesser...@googlegroups.com
Can you please provide exact information what command you used, exact error message???

Zdenko


On Mon, Jun 2, 2014 at 10:03 PM, Glen Rubin <rubi...@gmail.com> wrote:
I am running Teseract on windows.  When I try running from the commandline and specifying a config file, tesseract will give me an error message saying that it cannot open the user-words file.  Weirder still it looks like the eng.user-words file has been renamed somehow to just eng??

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/325cb025-cadd-48e6-a3c7-ff558fbe11db%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Glen Rubin

unread,
Jun 3, 2014, 1:13:05 AM6/3/14
to tesser...@googlegroups.com
I corrected this  by just adding .txt to the filename and specifying user-words.txt in my config file, cheers.

Glen Rubin

unread,
Jun 3, 2014, 1:25:30 AM6/3/14
to tesser...@googlegroups.com
Actually, I am mistaken...it is still not working. 

My command is:

tesseract image23.png image23g prov.txt

result is:

Could not open file, C:\Program Files\Tesseract-OCR\tessdata/eng.user-words.txt


On Monday, June 2, 2014 1:54:09 PM UTC-7, zdenop wrote:

zdenko podobny

unread,
Jun 3, 2014, 2:44:55 AM6/3/14
to tesser...@googlegroups.com

Glen Rubin

unread,
Jun 3, 2014, 11:01:55 AM6/3/14
to tesser...@googlegroups.com
when i check my end of line marker with emacs it says that it is Ctrl-J as follows:
 
             position: 80 of 82 (96%), column: 6
            character: C-j (displayed as C-j) (codepoint 10, #o12, #xa)
    preferred charset: ascii (ASCII (ISO646 IRV))
code point in charset: 0x0A
               syntax:    which means: whitespace
             to input: type "C-x 8 RET HEX-CODEPOINT" or "C-x 8 RET NAME"
          buffer code: #x0A
            file code: #x0A (encoded by coding system undecided-dos)
              display: by this font (glyph code)
    uniscribe:-outline-Courier New-normal-normal-normal-mono-13-*-*-*-c-*-iso8859-1 (#x03)
Character code properties: customize what to show
  name: <control>
  old-name: LINE FEED (LF)
  general-category: Cc (Other, Control)
  decomposition: (10) ('
')
Reply all
Reply to author
Forward
0 new messages