Re: [go-nuts] Abridged summary of golang-nuts@googlegroups.com - 6 updates in 1 topic

52 views
Skip to first unread message

Hugh Myrie

unread,
Jan 24, 2025, 12:48:08 AMJan 24
to golang-nuts
Hi there,

Thank you for the suggestions. I'll explore the Document AI as a potential solution and will look into PDF to XML converters as well.

Best regards,
Hugh




On Thu, Jan 23, 2025, 4:35 PM <golan...@googlegroups.com> wrote:
Edgar Madrigal <zai...@gmail.com>: Jan 22 01:44PM -0800

The function extract
https://pkg.go.dev/github.com/heussd/pdftotext-go#Extract actually says: Extract
PDF text content in simplified format
That might mean it will return text only and not ...more
Mike Schinkel <mi...@newclarity.net>: Jan 22 10:25PM -0500

Hi Hugh,
 
I have been planning to do some Go work with PDF files, so your email triggered me to do some research.
 
Not sure it using heussd/pdftotext-go is critical to you, or if you are just ...more
Hugh Myrie <hugh....@gmail.com>: Jan 23 07:29AM -0500

Hi Mike,
 
Thanks for the suggestion! I'm interested in checking out your forked code.
It seems like a good alternative to what I'm currently using.
 
Hugh
 
...more
Michael Bright <mjbri...@gmail.com>: Jan 23 09:17AM -0800

Hi Mike,
 
Not wanting to suggest that you take the Python route, but just sharing my
experience.
 
I've tried Acrobat Reader's "Save as Text" functionality, and also one or
two Python libraries ...more
robert engels <ren...@ix.netcom.com>: Jan 23 11:55AM -0600

You typically can’t convert a PDF to text and do what you are trying to do.
 
Look for PDF to XML converters - you need the “blocks” and the hierarchy in order to interpret most PDFs with any ...more
Sharon Mafgaoker <sha...@cloud5.co.il>: Jan 23 08:56PM +0200

Hey,
 
I’m using
https://cloud.google.com/document-ai
 
I’m sending my pdf and getting back extracted text json object.
 
Work fast and not expensive 🙏
 
I hope this will help you .
...more
You received this digest because you're subscribed to updates for this group. You can change your settings on the group membership page.
To unsubscribe from this group and stop receiving emails from it send an email to golang-nuts...@googlegroups.com.
Reply all
Reply to author
Forward
0 new messages