Can Google's OCR convert a PDF to HTML code?

9 views
Skip to first unread message

carmen

unread,
Aug 17, 2013, 12:03:26 PM8/17/13
to mak...@googlegroups.com
Does anyone know if it is possible for Google's OCR to take a PDF of a mocked up web page and convert it to basic HTML with divs, spans, etc? Someone is telling me it can be done but I have not found anywhere that tells me how.

John Farrell Kuhns

unread,
Aug 17, 2013, 12:24:49 PM8/17/13
to mak...@googlegroups.com

Carmen:

          I don't know, but by the very definition of OCR I would doubt that if it could do anything other than text based content and that graphics would not translate. I have all the newest Adobe products, let me see if there is some commands from within Acrobat that would work.

 

This electronic message transmission contains information that may be proprietary, confidential and/or privileged. The information is intended only for the use of the individual(s) or entity named above. If you are not the intended recipient, be aware that any disclosure, copying or distribution or use of the contents of this information is prohibited. If you have received this electronic transmission in error, please immediately return it to the sender and delete it from your system. Thank you.

John Farrell Kuhns

H.M.S. Beagle

j...@hms-beagle.com

816-587-9998

180 English Landing Drive - Parkville, Missouri 64152

www.hms-beagle.com


--
You received this message because you are subscribed to the Google Groups "Make:KC" group.
To unsubscribe from this group and stop receiving emails from it, send an email to makekc+un...@googlegroups.com.
To post to this group, send email to mak...@googlegroups.com.
Visit this group at http://groups.google.com/group/makekc.
For more options, visit https://groups.google.com/groups/opt_out.

John Farrell Kuhns

unread,
Aug 17, 2013, 12:30:28 PM8/17/13
to mak...@googlegroups.com

Carmen:

          The newest Acrobat can save a PDF as an HTML document. If you have a PDF you'd like me to try to convert for you send it as an attachment.

 

This electronic message transmission contains information that may be proprietary, confidential and/or privileged. The information is intended only for the use of the individual(s) or entity named above. If you are not the intended recipient, be aware that any disclosure, copying or distribution or use of the contents of this information is prohibited. If you have received this electronic transmission in error, please immediately return it to the sender and delete it from your system. Thank you.

John Farrell Kuhns

H.M.S. Beagle

j...@hms-beagle.com

816-587-9998

180 English Landing Drive - Parkville, Missouri 64152

www.hms-beagle.com


From: mak...@googlegroups.com [mailto:mak...@googlegroups.com] On Behalf Of carmen
Sent: Saturday, August 17, 2013 11:03 AM
To: mak...@googlegroups.com
Subject: Can Google's OCR convert a PDF to HTML code?

 

Does anyone know if it is possible for Google's OCR to take a PDF of a mocked up web page and convert it to basic HTML with divs, spans, etc? Someone is telling me it can be done but I have not found anywhere that tells me how.

--

Reply all
Reply to author
Forward
0 new messages