Hi all,
I want to announce OpenOCR:
It has everything you need to run Tesseract in the cloud behind a REST API, and it was designed with horizontal scalability in mind.
I created this because I wanted to integrate OCR functionality in a mobile app, but I wanted the image processing to happen in the cloud rather than on the device, and I couldn't find any free/cheap OCR-as-a-service providers. So I figured I'd build my own.
I decided to open source everything, including all the glue code that wraps Tesseract and handles the RabbitMQ job queuing. I'm hoping that other people will find bugs and contribute fixes, which will help improve the codebase for everyone using it.
It uses docker for the containerization / virtualization, so it can run on docker-aware PaaS's, or on Amazon EC2 machines running docker.
If you play around w/ it, please get in touch or file a github issue.