Very sorry if this has been asked and answered numerous times. I did try to search on various keywords.
We are wanting to OCR some PDFs and then use
https://tabula.technology/ to grab the tables. No other OCR capability comes close to Google Vision, in our opinion.
If we can't OCR the PDFs (in place) with Google Vision, does anyone have any ideas about how to convert the JSON back into a table? We get the x,y is in JSON, but that is for every single word.
We were kind of hoping there was some software out there that will take JSON outputs and recreate the input image, with structure that we can then scrape using a different technology.
Again, sorry if this has been asked and answered a bunch. You might have a link to the exact post that will answer my question(s).
Thanks! And congrats on Google Vision. The OCR output is really stunning!