wade
unread,Jun 23, 2010, 5:58:26 PM6/23/10Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
  to pnwcode4lib
Looking for a kick start idea or a simple solution for a particular
task, and thought of this pnwcode4lib group as a good resource.
I want to search our local web site for a particular text string, and
then scrape the content of the pages referenced by the search results
into a single, or multiple, text file(s).
More specifically, I want to search our local site for "faculty cv"
and / or "faculty resume", etc, grab any CV data posted (of which
there appears to be lots), and massage that data into a rough starting
point for a bibliography of faculty publications.
I've done this kind of thing on individual pages via PHP.
But almost seems like there would be a ready-made Google API-based
tool for doing something like this.
Looking at the Google API now.
Meanwhile, if you have any suggestions, I'd love to hear them.
Wade Guidry
University of Puget Sound