Billboard Top 100 Hits

338 views
Skip to first unread message

Ed W

unread,
Mar 15, 2012, 12:55:56 PM3/15/12
to ScraperWiki
Hello. I am interested in analyzing song lyrics. I've found a list of
the top 100 songs from each year since 1950:
http://www.jamrockentertainment.com/billboard-music-top-100-songs-listed-by-year.html

What I would like to do is scrape these pages to get the year, rank
within year, artist, and title for each song.

I would then like to scrape the lyrics from these songs via a site
like http://www.songlyrics.com/

Ultimately, I would like to have these songs saved as separate text
files with the filename containing the identifying information scraped
from the jamrockentertainment site.

I'm new to this, and I realize this is a huge and difficult project
(or at least I think it is), but I think it would save my sanity if I
didn't have to copy and paste all of these things.

Any help you can provide would be greatly appreciated.

Thad Guidry

unread,
Mar 15, 2012, 2:56:37 PM3/15/12
to scrap...@googlegroups.com
To save you time...

3/4 time mostly.
# 1 word will be LOVE.
and the genre will probably squeeze into POP or DANCE.

:)

Ed W

unread,
Mar 15, 2012, 3:08:47 PM3/15/12
to ScraperWiki
Hah. Probably. Still, having the data would be nice ;)

Ed

On Mar 15, 2:56 pm, Thad Guidry <thadgui...@gmail.com> wrote:
> To save you time...
>
> 3/4 time mostly.
> # 1 word will be LOVE.
> and the genre will probably squeeze into POP or DANCE.
>
> :)
>
>
>
>
>
>
>
>
>
> On Thu, Mar 15, 2012 at 11:55 AM, Ed W <edwardw...@gmail.com> wrote:
> > Hello. I am interested in analyzing song lyrics. I've found a list of
> > the top 100 songs from each year since 1950:
>
> >http://www.jamrockentertainment.com/billboard-music-top-100-songs-lis...
>
> > What I would like to do is scrape these pages to get the year, rank
> > within year, artist, and title for each song.
>
> > I would then like to scrape the lyrics from these songs via a site
> > likehttp://www.songlyrics.com/

Páll Hilmarsson

unread,
Mar 15, 2012, 3:29:56 PM3/15/12
to scrap...@googlegroups.com
Here you have the songs (it's currently running - should finish in a few minutes)


It's a start.

All the best,

pallih

Ed W

unread,
Mar 15, 2012, 3:33:20 PM3/15/12
to ScraperWiki
Thanks so much pallih.

Ed
> pal...@kaninka.nethttp://www.kaninka.net/pallihhttp://twitter.com/pallihhttp://is.linkedin.com/in/pallih

Páll Hilmarsson

unread,
Mar 15, 2012, 5:55:59 PM3/15/12
to scrap...@googlegroups.com
If anyone wants to take on the task of getting the lyrics for these songs, then here is a webservice:


pallih

Ed W

unread,
Mar 15, 2012, 10:47:03 PM3/15/12
to ScraperWiki
Thanks for your help pallih. I'm pretty new to this, but I pick things
up quickly. Do you think I could do this myself? If so, what
programming language would I need to use and could you point me to
some examples/tutorials? Thanks again.

Ed
> >www.kaninka.net/pallihhttp://twitter.com/pallihhttp://is.linkedin.com...

Ed W

unread,
Mar 16, 2012, 3:17:19 PM3/16/12
to scrap...@googlegroups.com
Okay. I have compiled a list of possible databases from which to draw the lyrics (including the one suggested above):


I also found this program that pulls out lyrics from all the songs in your iTunes library:
This page has a lot of information on the development of this program as well as the source code. I think it may be possible to modify the source code so it searches my file with song names, artists, and year and uses that information to pull lyrics from the dbs above.

Attached is a .csv with all the information for each set of song lyrics needed. Ideally, what I would like to do is use this as a reference for the scraper/program. That is, it takes the song information and uses that to locate the song lyrics and scrape them. The text analysis program that we use needs each piece to be analyzed to be in a separate text file. In the best of all possible scenarios what we would ultimately have is the lyrics for each song in its own text file named as a unique id that corresponds to a unique id (not yet created) in the .csv file. This would allow us to run the text files through the analysis program and then link the output of that program to the song information. 

It is beyond my skillset to do this, so I am at the mercy of the knowledge of this group. Any and all help provided would be greatly appreciated.

Thanks very much,
Ed
top_100_songs_fixed.csv
Reply all
Reply to author
Forward
0 new messages