Updated Google Vision OCR API

169 views
Skip to first unread message

Christian Scheier

unread,
May 30, 2022, 4:25:24 AM5/30/22
to cloud-vision-discuss
Hi,

we notice very different (and for us: pipeline breaking) changes when extracting paragraph bounding boxes in "stable" vs "legacy".

"legacy" works as expected but "stable" returns rather strange bounding boxes. "Stable" appears to maximize (vertical) bounding box size, while "legacy" perfectly matches with expected results (bounding box per paragraph). Example included below.

Word-level bounding boxes still work as expected.

It would be extremely helpful to learn whether this behavior is indeed intended, or what we would do to recover "legacy-like"-behavior for paragraphs.
Thank you
Christian Scheier



Screenshot 2022-05-30 102321.jpg
Message has been deleted
Message has been deleted
Message has been deleted

Big Data Mining

unread,
Aug 8, 2022, 11:16:39 AM8/8/22
to cloud-vision-discuss
We are facing the same problems!!!! The legacy model will not be avaiable after August 20 and this huge big problems are not solved! I cannot believe that Google is oferring a product ready for production with this kind of versioning issues, bad communcation with the community and not supporting background compatibility.
Reply all
Reply to author
Forward
0 new messages