Account Options

  1. Sign in
The old Google Groups will be going away soon, but your browser is incompatible with the new version.
Google Groups Home
« Groups Home
Ruby PDF text extractor
There are currently too many topics in this group that display first. To make this topic appear first, remove this option from another topic.
There was an error processing your request. Please try again.
flag
  5 messages - Collapse all  -  Translate all to Translated (View all originals)
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
 
From:
To:
Cc:
Followup To:
Add Cc | Add Followup-to | Edit Subject
Subject:
Validation:
For verification purposes please type the characters you see in the picture below or the numbers you hear by clicking the accessibility icon. Listen and type the numbers you hear
 
Kevin Olbrich  
View profile  
 More options Aug 13 2005, 1:01 pm
Newsgroups: comp.lang.ruby
From: "Kevin Olbrich" <kevin.olbr...@duke.edu>
Date: Sun, 14 Aug 2005 02:01:06 +0900
Local: Sat, Aug 13 2005 1:01 pm
Subject: Ruby PDF text extractor
I notice that Ruby has lots of tools for creating PDF files, are there any
that let you extract text from a PDF file?

_Kevin


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Austin Ziegler  
View profile  
 More options Aug 13 2005, 1:45 pm
Newsgroups: comp.lang.ruby
From: Austin Ziegler <halosta...@gmail.com>
Date: Sun, 14 Aug 2005 02:45:10 +0900
Local: Sat, Aug 13 2005 1:45 pm
Subject: Re: Ruby PDF text extractor
On 8/13/05, Kevin Olbrich <kevin.olbr...@duke.edu> wrote:

> I notice that Ruby has lots of tools for creating PDF files, are there any
> that let you extract text from a PDF file?

Not yet. PDF::Writer will be refactored a little bit for version 2.0
(coming out later this year) so that it will be three separate
components: PDF::Core (the core objects representing a PDF object in
memory, as well as rendering), PDF::Writer (the writer/layout code),
and PDF::Reader (read a PDF object into an in-memory representation).
Much of the code to do PDF::Core is already in place (it's currently
called PDF::Writer::Object or PDF::Writer::Objects), but there's
nothing explicitly present to represent this.

PDF::Reader will probably be released in early 2006, depending on how
long it takes to refactor the code that already exists, properly
extend it, and get the necessary PDF::Writer code finished.

-austin
--
Austin Ziegler * halosta...@gmail.com
               * Alternate: aus...@halostatue.ca


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Kevin Olbrich  
View profile  
 More options Aug 13 2005, 1:59 pm
Newsgroups: comp.lang.ruby
From: "Kevin Olbrich" <kevin.olbr...@duke.edu>
Date: Sun, 14 Aug 2005 02:59:13 +0900
Local: Sat, Aug 13 2005 1:59 pm
Subject: Re: Ruby PDF text extractor
Thanks, I'll keep my eyes open for it.

_Kevin


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Andreas Schrafl  
View profile  
 More options Aug 16 2005, 7:53 pm
Newsgroups: comp.lang.ruby
From: Andreas Schrafl <aschr...@ywesee.com>
Date: Wed, 17 Aug 2005 08:53:12 +0900
Local: Tues, Aug 16 2005 7:53 pm
Subject: Re: Ruby PDF text extractor
I once wrote a Ruby PDF Text extractor while workin at ywesee.

I tought they released it on rubyforge but I can't find it anymore.
perhaps if you contact them they can help you.
www.ywesee.com

Greetings
Andy


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Martin DeMello  
View profile  
 More options Aug 17 2005, 6:18 am
Newsgroups: comp.lang.ruby
From: Martin DeMello <martindeme...@yahoo.com>
Date: Wed, 17 Aug 2005 10:18:54 GMT
Local: Wed, Aug 17 2005 6:18 am
Subject: Re: Ruby PDF text extractor

Austin Ziegler <halosta...@gmail.com> wrote:

> PDF::Reader will probably be released in early 2006, depending on how
> long it takes to refactor the code that already exists, properly
> extend it, and get the necessary PDF::Writer code finished.

I'd be interested in helping with this.

martin


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
End of messages
« Back to Discussions « Newer topic     Older topic »