Re: [get.theinfo] Digest for get-theinfo@googlegroups.com - 1 Message in 1 Topic

12 views
Skip to first unread message

Dr. Jochen L. Leidner

unread,
Jul 21, 2010, 5:58:55 PM7/21/10
to get-t...@googlegroups.com
The University of Glasgow are selling a corpus called BLOG08, which is a crawls of a large number of blogs for research. While it not only includes medical topics, it should be straight forward to filter those out once you have an operational definition of what you mean by "disease blog" (eg a blog whose majority of posts mention more than one medical term).

Jochen

--
Dr. Jochen L. Leidner <lei...@acm.org>

f: +1 (651) 280-5106
w: http://www.jochenleidner.com
t: @jochenleidner

Sent from my Verizon Wireless BlackBerry


Date: Wed, 21 Jul 2010 20:57:46 +0000
To: Digest Recipients<get-thein...@googlegroups.com>
Subject: [get.theinfo] Digest for get-t...@googlegroups.com - 1 Message in 1 Topic

Group: http://groups.google.com/group/get-theinfo/topics

    MAYO <mayo...@gmail.com> Jul 21 04:38AM -0700 ^
     
    Does anyone know of a dataset which contains only medical/disease
    blogs(blogs related to medicine, disease, treatment, symptoms etc) or
    do you know of a site which contains only Medical/Disease blogs?

     

--
[from the http://groups.google.com/group/get-theinfo mailing list]
Reply all
Reply to author
Forward
0 new messages