Edgar Madrigal <zai...@gmail.com>: Jan 22 01:44PM -0800
The function extract
https://pkg.go.dev/github.com/heussd/pdftotext-go#Extract actually says: Extract
PDF text content in simplified format
That might mean it will return text only and not
...more
|
Mike Schinkel <mi...@newclarity.net>: Jan 22 10:25PM -0500
Hi Hugh,
I have been planning to do some Go work with PDF files, so your email triggered me to do some research.
Not sure it using heussd/pdftotext-go is critical to you, or if you are just
...more
|
Hugh Myrie <hugh....@gmail.com>: Jan 23 07:29AM -0500
Hi Mike,
Thanks for the suggestion! I'm interested in checking out your forked code.
It seems like a good alternative to what I'm currently using.
Hugh
...more
|
Michael Bright <mjbri...@gmail.com>: Jan 23 09:17AM -0800
Hi Mike,
Not wanting to suggest that you take the Python route, but just sharing my
experience.
I've tried Acrobat Reader's "Save as Text" functionality, and also one or
two Python libraries
...more
|
robert engels <ren...@ix.netcom.com>: Jan 23 11:55AM -0600
You typically can’t convert a PDF to text and do what you are trying to do.
Look for PDF to XML converters - you need the “blocks” and the hierarchy in order to interpret most PDFs with any
...more
|
Sharon Mafgaoker <sha...@cloud5.co.il>: Jan 23 08:56PM +0200
Hey,
I’m using
https://cloud.google.com/document-ai
I’m sending my pdf and getting back extracted text json object.
Work fast and not expensive 🙏
I hope this will help you .
...more
|