Caching in TrainLineRecognizer?

Visto 38 veces
Saltar al primer mensaje no leído

Jens Weibler

no leída,
5 mar 2017, 1:32:365/3/17
a tesseract-ocr
Hi,

I'm new to tesseract and wondered why the lstm dataset creation for the training process has to write the file again and again in TrainLineRecognizer. I've seen 200MB/s IO on the disk while creating the training data set.
As far I can see for the training case it would be sufficient to just load it once and write it at the end. The same applies to the box and tif file - but these are only read and not written...


Thanks,
Jens Weibler
Responder a todos
Responder al autor
Reenviar
Se ha eliminado el mensaje
0 mensajes nuevos