Thanks. So I had a look at this, it's certainly problematic. There's no
way to read the data in a column like that. James touches on this here:
In this case, the delimiter is a space, which you can't really have.
The problem really is there's nothing in the PDF spec that says, here's a
column, instead everything is done based on coordinates. James does an
outstanding job in keeping this stuff together and presenting you with
The closest thing I could get with that test PDF was to convert it to an
HTML page, then use Nokogiri with xpath to put the columns into
different arrays. Once that's done, you could do what you want with the
data. However, there's no automatic export to HTML out of pdf-reader (I
manually did that for my test).