repository changes

98 views
Skip to first unread message

Tom

unread,
Oct 9, 2011, 10:03:32 AM10/9/11
to ocr...@googlegroups.com
The code refactoring is finally nearing an end and the next release is getting closer and closer.

Part of that is some repository changes.  The toplevel "ocropus" repository now is a repository containing subrepositories.  You should be able (on a recent Ubuntu system) to type:

cd ocropus
sudo sh install-ubuntu-packages
sh compile-ocropus

If you still need access to the old ocropus repository, you can pull from https://code.google.com/p/ocropus.ocroold

Tom

Raj Julha

unread,
Oct 10, 2011, 2:52:46 AM10/10/11
to ocropus
Thanks for this.
I used ver 0.4 for my final year dissertation to recognise handwritten
historical documents (Gearge Washington's handwriting) but wasn't
successful at creating a good character model so recognition rate was
close to nil. I had split each word as separate images and had the
corresponding ground truth but didn't have enough knowledge of the
ocropus libraries to create cseg files for ultimate extraction of
chararacters. I plan to give it another go with the new version.

Cheers

Raj

Bill Janssen

unread,
Nov 7, 2011, 9:03:42 PM11/7/11
to ocropus
Apparently not quite functional yet, though. I just did the above on
a 10.04 system, and tried running a page through:

% ocropus-pages /tmp/cleaned.png
/usr/lib/pymodules/python2.6/matplotlib/backends/backend_gtk.py:621:
DeprecationWarning: Use the new widget gtk.Tooltip
self.tooltips = gtk.Tooltips()
Traceback (most recent call last):
File "/usr/local/bin/ocropus-pages", line 108, in <module>
preproc = ocrolib.make_IBinarize(options.clean)
File "/usr/local/lib/python2.6/dist-packages/ocrolib/common.py",
line 556, in make_IBinarize
return mkpython(name) or Binarize().make(name)
NameError: global name 'Binarize' is not defined
%

Bill Janssen

unread,
Nov 7, 2011, 9:23:11 PM11/7/11
to ocropus
Whoops! That's from my old install. The latest fails like this:

% ocropus-lattices cleaned.png
# gcc -g -O4 -lm -shared -fopenmp -std=c99 -fPIC .pynative/
73fa8ca7cf53b3733eb77b5f1798ddb2.c -o .pynative/
73fa8ca7cf53b3733eb77b5f1798ddb2.so
Traceback (most recent call last):
File "/usr/local/bin/ocropus-lattices", line 84, in extract
lattice,rseg = linerec.recognizeLineSeg(image)
File "/usr/local/lib/python2.6/dist-packages/ocrolib/segrec.py",
line 273, in recognizeLineSeg
raise RecognitionError("line image too high (maybe
rescale?)",image=image)
NameError: global name 'RecognitionError' is not defined
/usr/local/bin/ocropus-lattices:90 (extract) FATAL internal error
%

What's up here? Clearly segrec doesn't import RecognitionError from
common.
Reply all
Reply to author
Forward
0 new messages