Uncompressing PDF

51 views
Skip to first unread message

hoffmannick

unread,
Apr 5, 2010, 12:45:12 PM4/5/10
to origami-pdf
Hello all,
Great work on the project so far! Is there a module (or plan to build
one) that can uncompress a pdf in the same manner as pdftk or
pdfparser?

Fred Raynal

unread,
Apr 6, 2010, 9:31:42 AM4/6/10
to origa...@googlegroups.com


Hello,

What do you mean exactly?

For instance, we have done a script called pdfscan.rb. It analyzes the structure of a pdf file, once it has been sanitized. We should probably improve this script.

Otherwise, in order to make your own scripts, you can simply "read" a file:

pdf = PDF.read(ARGV[0])

This function will remove all filters put in the objects, thus the complete file will be sanitized. By "sanitize", I mean strange encoding is removed, and filters like /ASCIIHex, LZW compression or even encryption with an empty password.

Is this the information you were looking for?

Regards,

fred

Fred Raynal

unread,
Apr 6, 2010, 9:33:54 AM4/6/10
to origa...@googlegroups.com
Reply all
Reply to author
Forward
0 new messages