PDF File parsing

29 views
Skip to first unread message

Saidinesh Narisetti

unread,
Aug 4, 2016, 5:38:56 AM8/4/16
to Gauteng Python User Group
Hi All,

Is there any to way to parse PDF file .Any Packages will help to parse PDF files.Any suggestions are appreciated.

Thanks,
Dinesh.  

Adam Piskorski

unread,
Aug 4, 2016, 5:53:26 AM8/4/16
to Gauteng Python User Group
Hello

I know that Willem Hoek knows, as he gave a lightning talk 2 months ago on that very subject. Let me know if you need me to contact him for you.

Ludolph Neethling

unread,
Aug 10, 2016, 9:08:53 AM8/10/16
to gpu...@googlegroups.com
> Is there any to way to parse PDF file .Any Packages will help to parse PDF files.Any suggestions are appreciated.

I recently used pdfquery[0] with success to parse a dentist bill. Takes a bit of hacking to get use to the pdf.pq selectors, and a lot of XML interpretation to understand how to extract info from the PDF.

Regards,

Reply all
Reply to author
Forward
0 new messages