generates a MalformedPDFError when I try to access the 'text' attribute of any page, even though it seems perfectly viewable via Acrobat Reader, and by Preview (Mac).
In fact, if I open it with Preview, and then export it to a different filename (without making any changes), the exported file seems then to be readable.
FWIW, this is not a major problem for me - I expect to get the information I need in some other way, but wanted to provide the example. The file is being generated by FINRA (www.finra.org) who create plenty of PDF's with valuable data, so if there is something to set them straight on, might be useful.
Thanks for a great project!
James Healy
unread,
Dec 30, 2020, 7:33:50β―AM12/30/20
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to pdf-r...@googlegroups.com
Thanks for sending this through.
I had a look at the sample file, and ruby's zlib bindings were
refusing to inflate some of the compressed data.
I'm not familiar enough with the zlib format to know the root cause
here. Maybe the program that wrote that file added a garbage 0x01 byte
by accident?
In any case, my general rule is that if Adobe can parse a file and
pdf-reader can't, it's a pdf-reader bug. I've merged the above PR so
your files can be read.
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to PDF::Reader
Thanks so much for the quick turnaround! I will need to find some time to build from source and test, but it sounds like you've identified a straightforward issue.
Thanks again for your work on this project. Super useful!
Alba Hoxha
unread,
Jul 9, 2024, 5:08:05β―AMJul 9
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message