Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

categorizing items in RSS feeds

4 views
Skip to first unread message

sam...@gmail.com

unread,
Dec 15, 2005, 3:10:36 PM12/15/05
to
I want to use AI::Categorizer to analyze items within RSS feeds.
Briefly, the data looks like this (with additional elements that I'm
not concerned about):

<rss><channel>
<item><title>this</title><description>more about
this</description></item>
<item><title>that</title><description>more about
that</description></item>
</channel></rss>

Each item is a separate "document", and at any given time a single file
contains a collection of items. Over time, I'll accumulate many
instances of a given feed, which will have overlapping items with
previous instances.

At first glance, I think I need to use AI::Categorizer::Document::XML
and maybe AI::Categorizer::Collection::SingleFile. Or do I have to
roll my own AI::Categorizer::Collection::XML class and maybe a special
document class to work with it?

Any advice before I start code-diving would be appreciated.

0 new messages