liblyric -- first ideas.

6 views
Skip to first unread message

Dhruv Matani

unread,
Nov 8, 2006, 6:15:00 AM11/8/06
to The Distibuted DataBase
Hello everyone!!!!
This is a new project I want to get my hands dirty with. It's called
liblyric. The aim is to get a library(liblyric) which has an
interface:

string get_song_lyrics(string search_string);

Given the string to search for, eg: "Pink Floyd - Comfortably Numb"
will return the lyrics of the song as a string.

The way it will do this is by using google and the various search
results returned by google.
Steps involved:
[1] Query google for the search string.
[2] Get the first say 6 pages(results) that you get.
[3] Apply custom algo. to get text-intersection of these pages.
[4] Result is the lyrics, which can be used by calling application!!!!

Simple enough eh ;-)


I'm in the process of trying out these libraries for use:
[1] http://homepage.mac.com/pauljlucas/software/html_tree/ for HTML parsing.
[2] http://www.demailly.com/~dl/wwwtools.html#http-tiny for HTTP
request response.

If you have any better alternatives, please let me know, since this is
all still open to discussion. Preferred language of implementation:
C++. Library interface will be in C. Open to other languages of
implementation like Python, Perl, etc.... but only if you are willing
to do the work in that language.


Regards,
-Dhruv.

--
-Dhruv Matani.
http://www.geocities.com/dhruvbird/

"Be sure brain is in gear before engaging mouth"
-- Anonymous

Reply all
Reply to author
Forward
0 new messages