I noticed this gem has problems parsing some PDFs where the text is not necessarily clean.
For instance, this file:
https://www.jstor.org/stable/3684663 (I can send it to you separately if you can't download it online)
Some parts of it get output like: "a b o u t a r e g r e s s i o n t o o r i g i n a l c h a o s"
However, it doesn't seem like it's inherently a problem with the file, because Python's PyPDF2 reads it correctly as "about a regression to original chaos"
Do you think there is some step that this reader is missing? Or alternatively is there some option I should set when using the PDF::Reader to get it to read the pdfs better?