Hi Tanner,
You’re right, it is in fact advised to inspect and pre-process any images submitted to the Cloud Vision API in order to improve efficiency, accuracy and response time. A good set of recommendations are provided on this GCV Best Practices page.
Here’s a list of the main guidelines that would apply to using the OCR feature of this service:
It is recommended that your submitted image be of any non lossy file types among the supported GCV image types.
In case the image is submitted via a JSON request, it would be necessary to encode the image file into base64 prior sending it to the service.
The image sizing section advises the use of images with at least 1024 x 768 pixels.
In this last section, it is also recommended that images be of a sufficient size so that important features within the request can be easily distinguished
Sorry for the delay, image preprocessing is a good practice on the client side with regards to the large set of possible image settings that can be submitted to the API. In fact, it would be particularly important to remove as much noise as possible. In the meantime, I would recommend testing them in order to establish the best set of preprocessing steps and so, make your receipts images return uniformly the most optimal results with the API’s OCR feature.
I received feedback from the backline team and no recommendations for image pre-processing were evoked, as these steps (including deskewing) were already performed internally. Still a recommendation was given regarding making sure that the characters to be detected figure among the dominant objects on the image.
Hope this information helps and feel free to share your observations in case you would want to test any pre-processing steps.
Regards,
Alex