carolin@carolin-VirtualBox:~/ocrinput$ ocropus-recognize-book testbild2.png -o output.htmlWe attached the sample file which we used. Is there any obvious thing that we are doing wrong?
book directory ./_book-003266
testbild2.png -> ./_book-003266/0001.png
=== preprocess
# ocropus-nlbin ./_book-003266/????.png
=== ./_book-003266/0001.png 1
flattening
estimating skew angle
estimating thresholds
rescaling
./_book-003266/0001.png lo-hi (0.05 1.00) angle 0.0
writing
=== page segmentation
# ocropus-gpageseg ./_book-003266/????.bin.png
./_book-003266/0001.bin.png
./_book-003266/0001.bin.png: scale (6.9282) less than --minscale; skipping
=== line recognition
# ocropus-lattices ./_book-003266/????/??????.bin.png
Traceback (most recent call last):
File "/usr/local/bin/ocropus-lattices", line 56, in <module>
args.files = ocrolib.glob_all(args.files)
File "/usr/local/lib/python2.7/dist-packages/ocrolib/toplevel.py", line 204, in argument_checks
result = f(*args,**kw)
File "/usr/local/lib/python2.7/dist-packages/ocrolib/common.py", line 509, in glob_all
raise Exception("%s: expansion did not yield any files"%arg)
Exception: ./_book-003266/????/??????.bin.png: expansion did not yield any files
Traceback (most recent call last):
File "/usr/local/bin/ocropus-recognize-book", line 79, in <module>
run(args.linerec,book+"/????/??????.bin.png",m=args.model)
File "/usr/local/bin/ocropus-recognize-book", line 57, in run
assert subprocess.call(args)==0
AssertionError
Tom,
As suggested by you that training requires a lot of memory, I am thinking whether it is feasible to create swap area to 20GB in addition to existing 4GB RAM - which is maximum for my Dell machine. However I find program does not utilize the swap area effectively - for which suitable coding is required I guess - ultimately solve the problem of memory error. At present swap area is 16GB.
Alternatively whether external HDD or internal one HDD can be utilized as combined RAM+swap?
With Warmest Regards,
-sriranga(79yrs)On Fri, Oct 5, 2012 at 2:48 PM, Tom <tmb...@gmail.com> wrote:
wed, I'm sorry I couldn't help you more. , am not aware Of any memoryleaks, but as I indicated, training requires a lot of memory, We Usually use 8-16GB of memory for training.
Tom
On Oct 4, 2012 9:25 AM, "Sriranga(78yrsold)" <withbl...@gmail.com> wrote:
My computer Dell optiplex 330(System Type X86-based PC)
which has Processor Intel(R)core(TM)DuoCPU E7...@2.53GHz 2.53GHz OS win XPwith Sp3. RAM 4GB(recently added more ram) Ubuntu 12.04(32bit) and also 12.10.(both has swap of 16GB) It appears that my machine does not support Ocropus project which requires higher capacity computer..
After several experiments with python 0.6 it is observed very often memory error will displayed. I feel there is leakage of memory in relevant python source codes of 0.6 more over some of py programs does not support kannada script except during run-test and run-box-training. Since i am not python programmer nor developer. i find difficult to pursue the kannada OCR project under Ocropus Project due to support not received from any one except from you but you are too busy in release of next stage of version.
As such I am frustrated in kannada OCR project of Ocropus - decided to discontinue.
I am always ready to furnish Kannada text, tif or png with its box file generated in tesseract-ocr. to any python programmer or developer for research purpose at any time.and also undertake beta-testing/feedback of py programs for kannada at any stage, if any, received from you.
Thanks for the help rendered to me from time to time.
With warmest Regards,
-sriranga(79yrs)
To view this discussion on the web visit https://groups.google.com/d/msg/ocropus/-/c_q4jBZOA8UJ.--
You received this message because you are subscribed to the Google Groups "ocropus" group.
To post to this group, send email to ocr...@googlegroups.com.
To unsubscribe from this group, send email to ocropus+u...@googlegroups.com.
Is it not possible if data stored in RAM exceeds 1GB moved or transfer to Swap pragmatically OR data stored in RAM moved every seconds to the Swap and cleared up RAM for next processing the data. - I presumed that this can be done by suitable coding in the relevant source code in python. [...]
I am of view majority of users may not have higher end of computer having more than 4GB - resulting discourage them for usage of ocropus project.
Is it not possible if data stored in RAM exceeds 1GB moved or transfer to Swap pragmatically OR data stored in RAM moved every seconds to the Swap and cleared up RAM for next processing the data. - I presumed that this can be done by suitable coding in the relevant source code in python. [...]I am of view majority of users may not have higher end of computer having more than 4GB - resulting discourage them for usage of ocropus project.Users will generally only use OCRopus for recognizing text, and that works fine in 4GB of memory.
It is training that requires large amounts of memory, but users don't usually do that.
The motivation for sharing open source projects is so that people contribute source code to the project. The box training code that you are having problems with is indeed very memory inefficient. Since that seems to be a problem for you, I encourage you to improve the Python code and contribute the improvements.
Tom
--
You received this message because you are subscribed to the Google Groups "ocropus" group.
To post to this group, send email to ocr...@googlegroups.com.
To unsubscribe from this group, send email to ocropus+u...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msg/ocropus/-/udPhMuQpfRUJ.