Converting PDF To Text From Node.Js In Memory

931 views
Skip to first unread message

Joseph Koziatek

unread,
Apr 3, 2014, 10:02:54 AM4/3/14
to nod...@googlegroups.com
Hello all,

Is there a way to convert an in memory binary string (pdf content) to text from within node.js?
I receive pdf content through a web request and have the pdf in memory as a string object.
I see there are many tools available that operate from system calls but I was wondering
if there is anything to convert the pdf totally in memory without spawning a system call..

Thanks In Advance

Joe

Matt

unread,
Apr 8, 2014, 12:02:13 PM4/8/14
to nod...@googlegroups.com
The short answer is no, there isn't. But you can do it streaming through child processes (i.e. without writing anything to disk).

The longer answer is you can probably do something with a lot of work using node-ffi, but it probably won't be worth the effort.


--
--
Job Board: http://jobs.nodejs.org/
Posting guidelines: https://github.com/joyent/node/wiki/Mailing-List-Posting-Guidelines
You received this message because you are subscribed to the Google
Groups "nodejs" group.
To post to this group, send email to nod...@googlegroups.com
To unsubscribe from this group, send email to
nodejs+un...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/nodejs?hl=en?hl=en

---
You received this message because you are subscribed to the Google Groups "nodejs" group.
To unsubscribe from this group and stop receiving emails from it, send an email to nodejs+un...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Floby

unread,
Apr 9, 2014, 3:05:57 AM4/9/14
to nod...@googlegroups.com
There's this article about using node.js to parse PDFs

You may also be interested in https://github.com/mozilla/pdf.js/ even though it is meant to run in the browser
Reply all
Reply to author
Forward
0 new messages