using OCRopus3

120 views
Skip to first unread message

hounaida Moalla

unread,
Jul 21, 2018, 5:33:32 AM7/21/18
to ocropus
Hello everyone

I used ocropus (in its original version), then ocropy (improved version), now I would like to use ocropus3 (in its new version). I wanted to start learning on my database with the following command:

ocroline-train -d myDataBase.gz -t myDataBase.gz -T 100 -o testmodel --epochs 1

but I got the following error:

/usr/local/lib/python2.7/dist-packages/matplotlib/__init__.py:962: UserWarning: Duplicate key in file "/home/ubuntu/.config/matplotlib/matplotlibrc", line # 2
  
(fname, cnt))
/usr/local/lib/python2.7/dist-packages/matplotlib/__init__.py:962: UserWarning: Duplicate key in file "/home/ubuntu/.config/matplotlib/matplotlibrc", line # 3
  
(fname, cnt))
['CenterNormalizer', 'LineRecognizer', '__builtins__', '__doc__', '__file__', '__name__', '__package__', '__path__', 'lineest', 'recognizer']
Traceback (most recent call last):
  
File "/ usr / local / bin / ocroline-train", line 214, in <module>
    
sample = training_data.next ()
  
File "/usr/local/lib/python2.7/dist-packages/dlinputs/filters.py", line 266, in map
    
for sample in data:
  
File "/usr/local/lib/python2.7/dist-packages/dlinputs/filters.py", line 266, in map
    
for sample in data:
  
File "/usr/local/lib/python2.7/dist-packages/dlinputs/filters.py", line 508, in batchedbuckets
    
for sample in data:
  
File "/usr/local/lib/python2.7/dist-packages/dlinputs/filters.py", line 235, in ren
    
raise ValueError ("missing keys, got {}, want {}". format (sample.keys (), kw)
ValueError: missing keys, got ['__source__', 'txt', '__key__', 'index'], want {'image': 'png', 'transcript': 'txt'}

knowing that my database respects the file format of the uw3 database used by the author of ocropus.
can someone help me please?

thank you
Reply all
Reply to author
Forward
0 new messages