I'm not super familiar with PDF/UA, but I suspect the answer is: yes
it's possible to read and extract that data you're after, but it might
not be neat.
I'm reasonably confident pdf-reader can parse all content in PDFs.
However, there's no helper methods for many features.
If you loop over the document and print some of the data on each page,
you might find what you're after. Possibly deeply nested in hashes and
arrays.
require 'pdf/reader'
pdf = PDF::Reader.new(ARGV[0])
pdf.pages.each do |page|
puts page.attributes.inspect
puts page.xobjects.inspect
puts page.raw_content
end
James
> --
> You received this message because you are subscribed to the Google Groups "PDF::Reader" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to
pdf-reader+...@googlegroups.com.
> To view this discussion on the web visit
https://groups.google.com/d/msgid/pdf-reader/61d72ba1-226e-40d9-9f5f-78b0676c9cd5n%40googlegroups.com.