Podcast Database

19 views
Skip to first unread message

bherrmann7

unread,
Nov 1, 2010, 9:44:13 AM11/1/10
to SpokenWord.org APIs

Is there an easy way for me to access the entire podcast database?

I'd like to use the podcast database with my application, but I dont
want to couple it to the spokenword apis. I might only want a fresh
copy (or the diffs) once a month. Perhaps if the database was
mirrored in github as a tree, that would be ideal - and open - and
available to all.

Doug Kaye

unread,
Nov 3, 2010, 5:58:41 PM11/3/10
to spokenw...@googlegroups.com
The database schema is fairly complex, which is exactly why we publish the APIs. We want to provide an abstraction of the objects that is independent of the complex internal relationships. For example, there are mapping tables between programs and feeds, and programs and collections. There are also "instances" separate from programs because a given program can have more than one recording (eg, audio and video).  We want to hide that from the outside world.

If there's enough demand and a good argument for doing so, we could export a normalized version of the database to somewhere like github, but we'd need more info as to why this is more helpful than the APIs.

What is it you're trying to do?

    ...doug

Doug Kaye, Executive Director
The Conversations Network
A 501(c)(3) Non-Profit
do...@rds.com
v: 415.868.5461
twitter: dougkaye
facebook.com/doug.kaye


--
You received this message because you are subscribed to the Google Groups "SpokenWord.org APIs" group.
To post to this group, send email to spokenw...@googlegroups.com.
To unsubscribe from this group, send email to spokenword-ap...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/spokenword-api?hl=en.


Bob Herrmann

unread,
Nov 3, 2010, 6:14:19 PM11/3/10
to spokenw...@googlegroups.com
On 11/03/2010 05:58 PM, Doug Kaye wrote:
> The database schema is fairly complex, which is exactly why we publish
> the APIs. We want to provide an abstraction of the objects that is
> independent of the complex internal relationships. For example, there
> are mapping tables between programs and feeds, and programs and
> collections. There are also "instances" separate from programs because a
> given program can have more than one recording (eg, audio and video).
> We want to hide that from the outside world.
>
> If there's enough demand and a good argument for doing so, we could
> export a normalized version of the database to somewhere like github,
> but we'd need more info as to why this is more helpful than the APIs.
>
> What is it you're trying to do?

ok, so more specifically, I have a android phone based podcast download/player, http://jadn.com/carcast/ , the podcast database that I search (a
simple mysql database) is small (around three thousand entries) - so I'd like to mine your podcast database (when CarCast users search for new
podcasts to add.)

-bob

>
> ...doug
>
> Doug Kaye, Executive Director
> The Conversations Network
> A 501(c)(3) Non-Profit

> do...@rds.com <mailto:do...@rds.com>
> v: 415.868.5461
> twitter: dougkaye
> facebook.com/doug.kaye <http://facebook.com/doug.kaye>


>
> On Mon, Nov 1, 2010 at 6:44 AM, bherrmann7 <bherr...@gmail.com
> <mailto:bherr...@gmail.com>> wrote:
>
>
> Is there an easy way for me to access the entire podcast database?
>
> I'd like to use the podcast database with my application, but I dont
> want to couple it to the spokenword apis. I might only want a fresh
> copy (or the diffs) once a month. Perhaps if the database was
> mirrored in github as a tree, that would be ideal - and open - and
> available to all.
>
> --
> You received this message because you are subscribed to the Google
> Groups "SpokenWord.org APIs" group.
> To post to this group, send email to spokenw...@googlegroups.com

> <mailto:spokenw...@googlegroups.com>.


> To unsubscribe from this group, send email to
> spokenword-ap...@googlegroups.com

> <mailto:spokenword-api%2Bunsu...@googlegroups.com>.

Doug Kaye

unread,
Nov 3, 2010, 6:24:42 PM11/3/10
to spokenw...@googlegroups.com
Sounds like a good app, Bob. How do you imagine the search working? Using what search terms? We've tried using our own search and spent many, many days on it. We finally gave up and went back to using Google, but that doesn't help for API-based search. The problem is that titles, descriptions, categories, keywords, etc., are all in separate tables. The joins aren't feasible. Maybe you have some idea of how your search would work and we could do it at our end.  Also, do you want to search for feeds, episodes or both?

   ...doug

Bob Herrmann

unread,
Nov 3, 2010, 6:37:23 PM11/3/10
to spokenw...@googlegroups.com
On 11/03/2010 06:24 PM, Doug Kaye wrote:
> Sounds like a good app, Bob. How do you imagine the search working?
> Using what search terms? We've tried using our own search and spent
> many, many days on it. We finally gave up and went back to using Google,
> but that doesn't help for API-based search. The problem is that titles,
> descriptions, categories, keywords, etc., are all in separate tables.
> The joins aren't feasible. Maybe you have some idea of how your search
> would work and we could do it at our end. Also, do you want to search
> for feeds, episodes or both?

CarCast only lets users subscribe to feeds (called subscriptions) we dont let users pick and choose individual episodes (called podcasts in CarCast.)
You either subscribe or you don't. Thus I would only really be searching on feed titles.

For example searching on "NPR" returns 43 hits, as jason,

{["NPR: Intelligence Squared Podcast", "http://www.npr.org/rss/podcast.php?id=510184"],
["NPR: StoryCorps Podcast", "http://www.npr.org/rss/podcast.php?id=510200"],
...
}

If there is a simple reliable way for Car Cast to search feed titles, then that would fit the bill.

-bob

> do...@rds.com <mailto:do...@rds.com> <mailto:do...@rds.com


> <mailto:do...@rds.com>>
>
> v: 415.868.5461
> twitter: dougkaye
> facebook.com/doug.kaye <http://facebook.com/doug.kaye>
> <http://facebook.com/doug.kaye>
>
>
> On Mon, Nov 1, 2010 at 6:44 AM, bherrmann7 <bherr...@gmail.com
> <mailto:bherr...@gmail.com>

> <mailto:bherr...@gmail.com <mailto:bherr...@gmail.com>>> wrote:
>
>
> Is there an easy way for me to access the entire podcast
> database?
>
> I'd like to use the podcast database with my application,
> but I dont
> want to couple it to the spokenword apis. I might only want
> a fresh
> copy (or the diffs) once a month. Perhaps if the database was
> mirrored in github as a tree, that would be ideal - and open
> - and
> available to all.
>
> --
> You received this message because you are subscribed to the
> Google
> Groups "SpokenWord.org APIs" group.
> To post to this group, send email to
> spokenw...@googlegroups.com
> <mailto:spokenw...@googlegroups.com>

> <mailto:spokenw...@googlegroups.com


> <mailto:spokenw...@googlegroups.com>>.
>
> To unsubscribe from this group, send email to
> spokenword-ap...@googlegroups.com
> <mailto:spokenword-api%2Bunsu...@googlegroups.com>

> <mailto:spokenword-api%2Bunsu...@googlegroups.com
> <mailto:spokenword-api%252Buns...@googlegroups.com>>.

Thilo Planz

unread,
Nov 3, 2010, 11:47:13 PM11/3/10
to spokenw...@googlegroups.com
>> Sounds like a good app, Bob.

Maybe you can strike a deal with Doug.
He gives you a copy of the database or makes an API that you can use
more easily, and you integrate sending ratings to Spokenword into your
app :-)


> For example searching on "NPR" returns 43 hits, as jason,
>
> {["NPR: Intelligence Squared Podcast",
> "http://www.npr.org/rss/podcast.php?id=510184"],
> ["NPR: StoryCorps Podcast",
> "http://www.npr.org/rss/podcast.php?id=510200"],
> ...
> }
>

You should be able to get that data by calling the Spokenword API.
I am not sure if an appropriate API currently exists, but there
should be one...

Doug turned off the search functions for generic queries, but a
well-defined search like this one (substring of feed title) seems
reasonable to want.

In your case, a CSV dump of feedId, feedUrl, feedName would also be an
alternative. How big would that be? We have 8.000 feeds, maybe 8MB ? It
could be updated daily and served as a static file from somewhere.

Thilo

Bob Herrmann

unread,
Nov 4, 2010, 8:14:22 AM11/4/10
to spokenw...@googlegroups.com
On 11/03/2010 11:47 PM, Thilo Planz wrote:
>>> Sounds like a good app, Bob.
>
> Maybe you can strike a deal with Doug.
> He gives you a copy of the database or makes an API that you can use
> more easily, and you integrate sending ratings to Spokenword into your
> app :-)


I think that makes sense long term. I would need to provide a way for my users to optionally create spoken word accounts. Perhaps the first time
the go to rate something.

>
>
>> For example searching on "NPR" returns 43 hits, as jason,
>>
>> {["NPR: Intelligence Squared Podcast",
>> "http://www.npr.org/rss/podcast.php?id=510184"],
>> ["NPR: StoryCorps Podcast",
>> "http://www.npr.org/rss/podcast.php?id=510200"],
>> ...
>> }
>>
>
> You should be able to get that data by calling the Spokenword API.
> I am not sure if an appropriate API currently exists, but there
> should be one...
>
> Doug turned off the search functions for generic queries, but a
> well-defined search like this one (substring of feed title) seems
> reasonable to want.
>
> In your case, a CSV dump of feedId, feedUrl, feedName would also be an
> alternative. How big would that be? We have 8.000 feeds, maybe 8MB ? It
> could be updated daily and served as a static file from somewhere.

That would be mighty good!
-bob

Reply all
Reply to author
Forward
0 new messages