I'd be interested in hearing what kinds of use cases people have for
something like this.The first prototype I developed on top of it was a
"More like this" feature to pull in similar items. Maybe it's
something as simple as seeing what topics different correspondents
cover, comparing bylines over time, etc..
Hope someone finds this interesting.
Chris
Thanks so much for doing this. I think one of the great strengths of
your using Solr is its ability to get beyond the 20 item per request
limitation.
I have two ideas that I would like to work on. I think the NPR API
needs a "editorial layer" where you can generate a large set of
results, then "curate" them according to the relevance to the story or
topic that you are hoping to show. I have a use case with the NPR
Timeline widget that I'd be glad to talk about with you further.
I also have a second idea that I was intending to work on here:
http://code.google.com/p/nprmusicmegaphone/
I call it the NPR Music Megaphone:
NPR Music Megaphone aims to create a seamless listening experience for
NPR Music content. The goal is to leverage the NPR API and web
standards like xspf (and eventually HTML5) to provide a continuous
music/interview stream based on user preferences.
Starting with the NPR API Reference:
http://www.npr.org/api/inputReference.php
I am thinking of creating a web-based music station with:
Choice of Genres:
http://api.npr.org/list?id=3018
Choice of Artists:
http://api.npr.org/list?id=3009
Then, to create xml parsers (and an xml to xspf parser).
Specifically, I will want to look for links to m3u files (among other
resoureces) in a feed similar to this:
http://api.npr.org/query?id=15404041&fields=title,teaser,storyDate,audio,album,artist&output=NPRML&apiKey=MDAxNzgwMDQ5MDEyMTQ4NzYyMjU4YmY1Yw004
Then, in the browser, use the Wax Mp3 player to play the audio:
http://wiki.github.com/waxmp3/player
I'd like to host this on google app engine.
On Mar 30, 6:13 pm, Chris Beer <chris_b...@wgbh.org> wrote:
> I've been testing with ingesting items from the NPR API into Solr (a
> Lucene-based search engine) to see what kinds of interesting analysis
> and functionality can be quickly prototyped. The demonstrator, with
> about 4 years of content, is athttp://cbeer.info/~chris/npr-solr/npr.html
> . I've blogged about it athttp://authoritativeopinion.com/blog/2010/03/19/npr-api-solr/
> and pushed the code out to github athttp://github.com/cbeer/npr-solr
I'd like to allow people to add comments (via status updates to
facebook or twitter).
Also, in addition to posting to facebook, have a live chat around each
song/story.
So every time you enter a song/story you have the option to enter the
chat with people who are currently listening to this same piece.
We could use the xmmp? (jabber protocol on google app engine). These
real-time chat sessions could feature group and private modes.
Since the xml parser would use media rss or atom over nprml, we could
open this service up to jamendo or other music services.
Could we also make use of google wave as the realtime chat protocol?
The idea of playing chats back in realtime... if these were timed to a
piece of music, that would be interesting.
On Mar 31, 7:59 am, johntynan <jgty...@gmail.com> wrote:
> Chris,
>
> Thanks so much for doing this. I think one of the great strengths of
> your using Solr is its ability to get beyond the 20 item per request
> limitation.
>
> I have two ideas that I would like to work on. I think the NPR API
> needs a "editorial layer" where you can generate a large set of
> results, then "curate" them according to the relevance to the story or
> topic that you are hoping to show. I have a use case with the NPR
> Timeline widget that I'd be glad to talk about with you further.
>
> I also have a second idea that I was intending to work on here:http://code.google.com/p/nprmusicmegaphone/
>
> I call it the NPR Music Megaphone:
>
> NPR Music Megaphone aims to create a seamless listening experience for
> NPR Music content. The goal is to leverage the NPR API and web
> standards like xspf (and eventually HTML5) to provide a continuous
> music/interview stream based on user preferences.
>
> Starting with the NPR API Reference:http://www.npr.org/api/inputReference.php
>
> I am thinking of creating a web-based music station with:
>
> Choice of Genres:http://api.npr.org/list?id=3018
>
> Choice of Artists:http://api.npr.org/list?id=3009
>
> Then, to create xml parsers (and an xml to xspf parser).
>
> Specifically, I will want to look for links to m3u files (among other
> resoureces) in a feed similar to this:http://api.npr.org/query?id=15404041&fields=title,teaser,storyDate,au...
I have two ideas that I would like to work on. I think the NPR API
needs a "editorial layer" where you can generate a large set of
results, then "curate" them according to the relevance to the story or
topic that you are hoping to show. I have a use case with the NPR
Timeline widget that I'd be glad to talk about with you further.
http://twitter.com/pubmedia/status/11670649082
From the backlog last week: What can #pubmedia learn from chat
roulette (and now NY Times roulette -- http://bit.ly/dxaiiZ).
http://twitter.com/pubmedia/status/11670658750
Specifically, how can #pubmedia incorporate serendipity and planned
randomness in content discovery (using APIs, deep archives, etc.)
I proposed the idea of
* a mobile app that used geographic context to present #pubmedia
content relating to physical location
* in talking with http://twitter.com/rgutel. She suggested a public
media "roadtrip app". Listen to stories relevant to travel route.
As I recall, Daniel Jacobson created a google map / NPR/NYT map mashup
for OSCON 2009:
http://www.danieljacobson.com/NewsMap/
As Daniel describes it, NewsMap "sends content from both NPR's API and
feeds from NYTimes through Yahoo!'s PlaceMaker API
http://developer.yahoo.com/geo/placemaker/ to identify geo-information
about the stories. The stories are then placed on a Google Map.
Code is available at" http://github.com/danieljacobson/NewsMap/tree/master
On Mar 30, 6:13 pm, Chris Beer <chris_b...@wgbh.org> wrote:
> I've been testing with ingesting items from the NPR API into Solr (a
> Lucene-based search engine) to see what kinds of interesting analysis
> and functionality can be quickly prototyped. The demonstrator, with
> about 4 years of content, is athttp://cbeer.info/~chris/npr-solr/npr.html
> . I've blogged about it athttp://authoritativeopinion.com/blog/2010/03/19/npr-api-solr/
> and pushed the code out to github athttp://github.com/cbeer/npr-solr
--
To unsubscribe, reply using "remove me" as the subject.
So, what would it take to get local stations to expose their content
in a scrapable way? If I were an optimist, I'd say the new NPR ingest
API could solve all sorts of problems and provide a nice dataset. As a
pessimist though, it looks like we'd have to combine a number of
sources:
- it looks like PI provides RSS feeds, which should capture some swath
of content.. Is there a deep archive behind all that?
- some stations (including my own...) don't provide (discoverable?)
RSS feeds for their content; is there a way to incorporate all this
content?
- what can we do about local arts + culture content, which probably
isn't captured in an existing feed. Are there metadata hints page
creators could offer? would we need a manual ingest/tagging interface
(which is one approach PBS is taking for content metadata).
- can we tap into the COVE API and get useful information?
The more I think about this, the more excited I get -- there's so much
potential in aggregating and exposing local material (and then using
that to justify the importance of local public media)
Chris
On Apr 6, 11:52 am, Amanda Hirsch <ahirsc...@gmail.com> wrote:
> I'd love to see a roadtrip app and/or online guide that includes some human
> curation, in addition to whatever could be automated through the
> API...someone to pick out a selection of great cultural stories that are
> "recommended public media content" for whatever place you're traveling
> through...
>
>
>
> On Tue, Apr 6, 2010 at 11:43 AM, johntynan <jgty...@gmail.com> wrote:
> > Following up on the question:
>
> >http://twitter.com/pubmedia/status/11670649082
> > From the backlog last week: What can #pubmedia learn from chat
> > roulette (and now NY Times roulette --http://bit.ly/dxaiiZ).
>
> >http://twitter.com/pubmedia/status/11670658750
> > Specifically, how can #pubmedia incorporate serendipity and planned
> > randomness in content discovery (using APIs, deep archives, etc.)
>
> > I proposed the idea of
>
> > * a mobile app that used geographic context to present #pubmedia
> > content relating to physical location
>
> > * in talking withhttp://twitter.com/rgutel. She suggested a public
> > media "roadtrip app". Listen to stories relevant to travel route.
>
> > As I recall, Daniel Jacobson created a google map / NPR/NYT map mashup
> > for OSCON 2009:
>
> >http://www.danieljacobson.com/NewsMap/
>
> > As Daniel describes it, NewsMap "sends content from both NPR's API and
> > feeds from NYTimes through Yahoo!'s PlaceMaker API
> >http://developer.yahoo.com/geo/placemaker/to identify geo-information
> > about the stories. The stories are then placed on a Google Map.
> > Code is available at"http://github.com/danieljacobson/NewsMap/tree/master
>
> > On Mar 30, 6:13 pm, Chris Beer <chris_b...@wgbh.org> wrote:
> > > I've been testing with ingesting items from the NPR API into Solr (a
> > > Lucene-based search engine) to see what kinds of interesting analysis
> > > and functionality can be quickly prototyped. The demonstrator, with
> > > about 4 years of content, is athttp://
> > cbeer.info/~chris/npr-solr/npr.html<http://cbeer.info/%7Echris/npr-solr/npr.html>
Can you plug other feeds besides the NPR API into your Solr app Chris?
I had a dream about a search engine that indexed every story in public
media together.
--
To unsubscribe, reply using "remove me" as the subject.
I think what both PBS and NPR are doing to start capturing local
content is great. I'm really curious to see what kind of information
stations contribute back, although I fear it will be a subset of
what's actually out there (either because they self-curate, rights
restrictions, whatever) and both NPR and PBS have an interest in
collecting media assets, not just metadata. Maybe the CPB American
Archive could pick up that end of things..
Anyway, it is all very interesting and makes me wish I had more time
to throw at playing around with aggregating materials.
On a separate note, I am not sure that building systems around
scrapable content is the way to go in the long run. RSS-like content
is really just reference material that drives people back to a web
page, which is great for the web. But to make the content really
portable, to have it appear on mobile devices for example, it is much
better to get the full content centralized.
By the way, let me know if you want to collaborate on mapping the NPR
API to Dublin Core, by the way. Jack, Dave and I spent time creating
a mapping from NPRML to PBCore, so that could be a decent starting
point.