Firebase ML KIT missing characters

49 views
Skip to first unread message

Patrick Questembert

unread,
Jun 3, 2018, 11:58:49 AM6/3/18
to Firebase Google Group
I have noticed fairly frequent cases where ML KIT seems to completely miss characters, usually isolated one, for example a set of prices aligned in a column where there is an extra large space left of the decimal e.g. "3  .75" then below it "4  .12" etc. ML KIT in that case may return ".75" and the ".12" below in one block, but then won't include the "3" and "4" anywhere in the results. Happens with the Google cloud OCR as well.

I attached an example: ML KIT misses the "5" of the first price, the "2" of the 2nd price, and the "2" in "2 .12" a few lines below.

Does this sound familiar? Any plans to fix this?

Thanks!
GST 3.jpg

Pannag Sanketi

unread,
Jun 4, 2018, 12:58:58 PM6/4/18
to fireba...@googlegroups.com
Thanks Patrick.
Our eng teams will take a look at this and get back to you!

Best,
Pannag

--
You received this message because you are subscribed to the Google Groups "Firebase Google Group" group.
To unsubscribe from this group and stop receiving emails from it, send an email to firebase-tal...@googlegroups.com.
To post to this group, send email to fireba...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/firebase-talk/a147e9f9-b7ec-40c2-a617-471d9eb96003%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Patrick Questembert

unread,
Jun 12, 2018, 12:39:34 PM6/12/18
to Firebase Google Group
Hey Pannag,

Just wanted to clarify this is not an isolated case - I have seen this relatively often. Also, I should point out that I am talking about cases where the patterns are sharp and with good contrast. Appears to be some confusion with the grouping of these letters in words within blocks whereby they sometimes get dropped.

Do let me know if you need other examples.

Thanks!

Pannag Sanketi

unread,
Jun 12, 2018, 12:44:00 PM6/12/18
to fireba...@googlegroups.com
Thanks a lot, Patrick!
Likely this is an issue with our OCR model. I have forwarded the issue to the team and they are investigating it.
Do you have an example image that I can give to them? 

Thanks,
Pannag


Patrick Questembert

unread,
Jun 12, 2018, 2:59:14 PM6/12/18
to Firebase Google Group
Hi Pannag,

See attached image - same as the one I attached originally in that thread, just cropped (smaller). MLKIT misses the first digit of the top two prices on top (".75" instead of "5.75" and ".00" instead of "2.00"). I will forward additional cases soon.

Patrick
B9C77CCB-965D-4D0A-A23E-88BD663A8E55.jpeg

Pannag Sanketi

unread,
Jun 12, 2018, 8:06:25 PM6/12/18
to fireba...@googlegroups.com
Oh sorry I missed that. Thanks a lot, Patrick.


Reply all
Reply to author
Forward
0 new messages