How to read the data in doc. and pdf

57 views
Skip to first unread message

Suresh Kumar

unread,
Apr 21, 2014, 1:45:22 AM4/21/14
to seleniu...@googlegroups.com
How to read the data present in the word document and PDF file using selenium web driver with fire-fox.

Krishnan Mahadevan

unread,
Apr 21, 2014, 1:55:40 AM4/21/14
to Selenium Users
Short answer : You can't. WebDriver interacts with web pages that are loaded within a browser and NOT pdfs.

You may have to figure out ways of downloading your pdf to your local file system and then use libraries such as pdfbox to parse it.

I would recommend that you search through this forum as well. This has been answered in the past.

Thanks & Regards
Krishnan Mahadevan

"All the desirable things in life are either illegal, expensive, fattening or in love with someone else!"
My Scribblings @ http://wakened-cognition.blogspot.com/
My Technical Scribbings @ http://rationaleemotions.wordpress.com/


On Mon, Apr 21, 2014 at 11:15 AM, Suresh Kumar <suri...@gmail.com> wrote:
How to read the data present in the word document and PDF file using selenium web driver with fire-fox.

--
You received this message because you are subscribed to the Google Groups "Selenium Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to selenium-user...@googlegroups.com.
To post to this group, send email to seleniu...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/selenium-users/a1d7c3aa-9d6a-4c77-9e3d-445226ab9304%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Jordan Trilles

unread,
Apr 22, 2014, 2:24:48 AM4/22/14
to seleniu...@googlegroups.com
Hi,

Since Firefox now integrate pdf.js and can read PDF without  a plugin it should be possible to retrieve some data. But the DOM should be horrible to navigate.

Cristallix
Reply all
Reply to author
Forward
0 new messages