Martin Blais <
bl...@furius.ca> writes:
> These days for OCR I think you can just download a free vision model
> from Hugginface and run it locally and it would work.
> I remember doing that in the recent past.
I imagine something like that just returns a big blob of plain text,
amirite? Or can you have it return the data to you in a more structured
format, with specific fields (key-value pairs) more relevant to a
receipt?
With this Veryfi API, they are returning JSON with very specific fields
(e.g., card number, date, payee, tax, tip, total, even individual
receipt lines (including UPCs when avalable), etc.) and it seems to be
very accurate so far for me.
When I played with other general OCR tools in the past, I remember the
OCR itself was only half the battle. Even if you got that accurate, you
then had to write regex trying to pull all this other specific info out
from that.
--
Cheers,
TRS-80