A few days ago there was a post regarding OCR software using Drive API for Windows.
I have been using a python script for this purpose on Linux for the past few years. I've turned that into a pip package.
$ pip install google_drive_ocr
It will install an executable `google-ocr`
Some use cases,
$ google-ocr -i image.png
$ google-ocr -b image1.png image2.png image3.png
$ google-ocr --image-dir images/ --extension .jpg --workers 4
Core workflow is,
* Upload an image "as a google document" to Google Drive
* Upon doing this, google adds the OCR layer
* Download google document as plain text
A primary difficulty in using this might be setting up a project on Google Cloud platform and downloading a client_secret.json file.
Once done, client_secret.json file needs to be provided
in every call. (To avoid this, one can do
$ google-ocr --client-secret /full/path/to/client-secret.json --write-config ~/.gdo.cfg
Once that is done, next time onwards no --client-secret option is required.
Regards,