Hi all,
I want to use
http://godoc.org/rsc.io/pdf, but I'm not very familiar with the data structures of pdf files, and I'm primarily looking to write something quickly (and don't really want to learn those low-level pdf details... I just want to use the mentioned go API. However, just from reading the API doc it is not trivial to figure out how things actually work.
Here is what I want: I want to check a whole bunch of pdf files
- that they are within a specified page limit. This is easy.
- that all fonts are embedded in the pdf file. (so far I've been able to find some fonts, but not all, still experimenting...).
- extract certain text elements, e.g. first author last name and paper title. (These should be in the same place in all files.)
I'll be playing around with the API, but I would appreciate it if someone could point me to a few examples using the API. @Russ: Maybe it would be worth adding a few Examples to the API doc, just to demonstrate what can be done with it.
Thanks,
:) Hein