Thanks!
> Hmm, so.. If one wanted to consume _all_ of blogger alerts, any
> mechanics in pshb to make that 'easy' as a subscription? Is it
> possible to issue a "give me everything you've got" type of request?
The "firehose": There's nothing like this for blogger yet. Easiest way
to get something close to this is to crawl the site for feed links and
subscribe to them all. In the future it may make sense to have an
aggregated combined feed, similar to the "Atom Stream" that Six Apart
runs, with an Atom-aggregated view of all updates.
-Brett
Doesn't blogger still publish the changes.xml updates pages? Not a flow, but a regularly changing document
On Aug 14, 2009 11:04 AM, "Brett Slatkin" <bsla...@gmail.com> wrote:On Fri, Aug 14, 2009 at 6:27 AM, igrigorik<il...@aiderss.com> wrote: > > Now that is awesome! Great ...
Thanks!
> Hmm, so.. If one wanted to consume _all_ of blogger alerts, any > mechanics in pshb to make that ...
I'm not sure if there's a blogger-specific one, but Google Blog Search
has one here:
http://www.google.com/help/blogsearch/pinging_API.html#xml-format
--David
I talked briefly with Tatsuhiko about this last week, and I think this
part of the spec makes it possible:
http://pubsubhubbub.googlecode.com/svn/trunk/pubsubhubbub-core-0.1.html#aggregatedistribution
Should be very close to how Atom Stream works, but in reverse. The
hard part is figuring out how to properly throttle pushing to clients
who fall behind. The nice thing about separate HTTP posts is you can
very easily get a success/response status from the subscriber and then
know if you should drop the packet, retry, etc. I'd assume Atom Stream
does this right now, but with TCP throttling indicators.
--David
Adding hub-managed queries to the topic model in PSHB would more or
less force the introduction of a way to cache the results of queries
... i.e. to introduce queues. This could break the lightweight
'pubsub only' model that IMHO helps to make PSHB interesting.
A better way is for hubs to remain silent about how they manage state,
enabling people to choose between:
a) endpoint based approaches, eg Brett's below
b) middleman approaches, eg sticking MQ under PSHB like RabbitHub does
HTH
alexis
The blogger one is still there too at:
http://www.blogger.com/changes.xml
not sure of its update frequency.
A query-based service could also generate a feed itself, that could
send into a hub too. I agree that applying queries to hubs seems
problematic, but if you think of an AtomPub-like query feed - eg
Google Contacts - in principle it could ping a hub whenever that query
adds a new result that matches, so addressbook updates could flow over
PSHB rapidly too, rather than rely on polling sync.
On Sat, Aug 15, 2009 at 9:17 PM, Bob Wyman<bobw...@gmail.com> wrote:
>
> The Google Contacts example would only be reasonable if, in fact, the query
> syntax was limited in expressiveness. If, for instance, the only thing I
> could say was: "Update whenever <username's> details change." then the
> maximum number of duplicate entries is small. (You might have one entry on
> the "all entries" fire-hose feed and one on a username specific query feed.)
> However, if you increase the expressiveness of the query language you will
> rapidly get to the point where you have a large number of duplicates again.
> (i.e. queries for each of fullname, firstname, lastname, street, city, and
> an infinite number of geographical proximity queries...) Thus, the problem
> is related to the expressiveness of the query language supported.
Just one side question: do you mean 'duplicates' in the sense that any
given subscriber may see more than one copy of a given message,
because more than one query generates the same result? Or, are you
referring to some other issue - eg. duplication across two users.
alexis