Simple example

142 views
Skip to first unread message

Richard Barraclough

unread,
Nov 2, 2024, 11:13:57 AM11/2/24
to tesseract-ocr
Hi,
I'm trying to get a simple example to work but can't.

With the attached file I run
```
$ tesseract tesseract.jog tesseract
```
and was expecting to get a file `tesseract.txt` containing the text `12.3`.
However although `tesseract.txt` is created it is empty.

The only output from the `tesseract` command is
```
Estimating resolution as 1281
```

Don't understand.

tesseract.jpg

Zdenko Podobny

unread,
Nov 2, 2024, 11:17:08 AM11/2/24
to tesser...@googlegroups.com
Maybe because you did not read the documentation?

Zdenko


so 2. 11. 2024 o 16:13 Richard Barraclough <goo...@rwb.me.uk> napísal(a):
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/tesseract-ocr/7ad7d934-2396-4558-99bf-55dfd2de2191n%40googlegroups.com.

Richard Barraclough

unread,
Nov 2, 2024, 11:35:32 AM11/2/24
to tesseract-ocr
```
TESSERACT(1)                                                                                                                                                                                                               TESSERACT(1)

NAME
       tesseract - command-line OCR engine

SYNOPSIS
       tesseract FILE OUTPUTBASE [OPTIONS]... [CONFIGFILE]...

DESCRIPTION
       tesseract(1) is a commercial quality OCR engine originally developed at HP between 1985 and 1995. In 1995, this engine was among the top 3 evaluated by UNLV. It was open-sourced by HP and UNLV in 2005, and has been developed
       at Google since then.

IN/OUT ARGUMENTS
       FILE
           The name of the input file. This can either be an image file or a text file.

           Most image file formats (anything readable by Leptonica) are supported.

           A text file lists the names of all input images (one image name per line). The results will be combined in a single file for each output file format (txt, pdf, hocr, xml).

           If FILE is stdin or - then the standard input is used.

       OUTPUTBASE
           The basename of the output file (to which the appropriate extension will be appended). By default the output will be a text file with .txt added to the basename unless there are one or more parameters set which
           explicitly specify the desired output.

           If OUTPUTBASE is stdout or - then the standard output is used.
```

Zdenko Podobny

unread,
Nov 2, 2024, 11:37:03 AM11/2/24
to tesser...@googlegroups.com

so 2. 11. 2024 o 16:35 Richard Barraclough <goo...@rwb.me.uk> napísal(a):

Richard Barraclough

unread,
Nov 2, 2024, 12:27:56 PM11/2/24
to tesseract-ocr
```
tesseract tesseract.jpg output --oem 1 -l eng
```
still nothing.

Do you even know how to make it work?
If you do then why not just tell me instead of being a cunt?
Reply all
Reply to author
Forward
0 new messages