text detection in vison api is not respecting line segmentation

504 views

Skip to first unread message

surbhi suman

unread,

Mar 30, 2021, 8:27:59 AM3/30/21

to cloud-vision-discuss

Hi ,

i m new to to cloud vision api , i am trying to processing receipt having inter word space is greater than normal , and after text detection what i found is vison api is not respecting line segmentation ,

can some one guide how does vision api work in backend to detect words in a line .

Monica (Google Cloud Platform)

unread,

Apr 22, 2021, 5:48:44 PM4/22/21

to cloud-vision-discuss

Vision API first performs a layout analysis on the image to segment the location of the text. After the general location is detected, the OCR module then performs a text recognition analysis on the specified location to generate the text. Finally, errors are corrected at a post-processing step by feeding it through a language model or dictionary. You can find more details here.

If you are using document_text_detection you might try text_detection and see if you get better performances. There might be some factors contributing for this issue (like poor image quality or slight angle orientation of the text) and it would be advisable to open a case with support or create a public issue providing your use setup and a sample document (non-confidential).

Since you are handling receipts you might try to test Document AI or Procurement DocAI that are specialized in analyzing invoices.

Reply all

Reply to author

Forward

0 new messages