Jun 23, 2010, 5:58:26 PM6/23/10
Looking for a kick start idea or a simple solution for a particular
task, and thought of this pnwcode4lib group as a good resource.
I want to search our local web site for a particular text string, and
then scrape the content of the pages referenced by the search results
into a single, or multiple, text file(s).
More specifically, I want to search our local site for "faculty cv"
and / or "faculty resume", etc, grab any CV data posted (of which
there appears to be lots), and massage that data into a rough starting
point for a bibliography of faculty publications.
I've done this kind of thing on individual pages via PHP.
But almost seems like there would be a ready-made Google API-based
tool for doing something like this.
Looking at the Google API now.
Meanwhile, if you have any suggestions, I'd love to hear them.
University of Puget Sound