Spidr 0.2.1 released!

15 views
Skip to first unread message

postmodern

unread,
Nov 30, 2009, 12:41:27 PM11/30/09
to sp...@googlegroups.com
I'm a little late on sending this announcement to the list, but here it
is anyways.

Spidr 0.2.1 has been released. This release contains a couple important
changes and a lot of new convenience methods.

* Fixed a bug where Spidr::Agent#delay was not being used to delay
requesting pages.
* Spider link and script tags in HTML pages (thanks Nick Plante).
* Added Spidr::Events#every_ok_page.
* Added Spidr::Events#every_redirect_page.
* Added Spidr::Events#every_timedout_page.
* Added Spidr::Events#every_bad_request_page.
* Added Spidr::Events#every_unauthorized_page.
* Added Spidr::Events#every_forbidden_page.
* Added Spidr::Events#every_missing_page.
* Added Spidr::Events#every_internal_server_error_page.
* Added Spidr::Events#every_txt_page.
* Added Spidr::Events#every_html_page.
* Added Spidr::Events#every_xml_page.
* Added Spidr::Events#every_xsl_page.
* Added Spidr::Events#every_doc.
* Added Spidr::Events#every_html_doc.
* Added Spidr::Events#every_xml_doc.
* Added Spidr::Events#every_xsl_doc.
* Added Spidr::Events#every_rss_doc.
* Added Spidr::Events#every_atom_doc.
* Added Spidr::Events#every_javascript_page.
* Added Spidr::Events#every_css_page.
* Added Spidr::Events#every_rss_page.
* Added Spidr::Events#every_atom_page.
* Added Spidr::Events#every_ms_word_page.
* Added Spidr::Events#every_pdf_page.
* Added Spidr::Events#every_zip_page.

The new every_* methods will only pass specific types of Spidr::Page
objects to the given block. These methods should help group your page
parsing logic into separate blocks.

Also, as of 0.2.1 documentation for Spidr is now available on the brand
new yardoc.com.
http://yardoc.com/docs/postmodern-spidr

signature.asc
Reply all
Reply to author
Forward
0 new messages