Learning From Crowds.
http://www.umiacs.umd.edu/~vikas/publications/raykar_JMLR_2010_crowds.pdf
The reason I think it might be useful is that it follows the same
themes as papers we've discussed recently, but addresses regression
problems as well as classification. Its applications are a bit
outside of what we've dealt with before: most of the case studies
relate to biomedical data and the annotators are still trained, except
for one classification task that builds on the RTE experiment in Snow,
et al.'s paper. It is a journal article and a bit longer and more
ML-oriented than previous papers we've read. If we choose this paper,
I'm willing to lead the discussion on it.
I'm sure that some people are tired of this theme and are interested
in other types of work, and if that's the case, we might want to look
at other types of things to read.
Another paper that has caught my interest lately and might be
applicable to others in the group is Google's 2009 paper describing
Pregel, a system for large-scale distributed graph processing that
mirrors MapReduce in some ways. The citation is:
Malewicz, G., Austern, M. H., Bik, A. J., Dehnert, J. C., Horn, I.,
Leiser, N., and Czajkowski, G. 2009. Pregel: a system for large-scale
graph processing. In Proceedings of the 28th ACM Symposium on
Principles of Distributed Computing (Calgary, AB, Canada, August 10 -
12, 2009). PODC '09. ACM, New York, NY, 6-6. DOI=
http://doi.acm.org/10.1145/1582716.1582723
This paper may or may not be outside of the scope of interest for
other in the group, especially because it might be too
systems-oriented for this crowd.
Any other ideas?
-Yinon
Yinon Bentor wrote:
> The paper I wouldn't mind reading and discussing is already on the list:
>
> Learning From Crowds.
> http://www.umiacs.umd.edu/~vikas/publications/raykar_JMLR_2010_crowds.pdf
>
> The reason I think it might be useful is that it follows the same
> themes as papers we've discussed recently, but addresses regression
> problems as well as classification. Its applications are a bit
> outside of what we've dealt with before: most of the case studies
> relate to biomedical data and the annotators are still trained, except
> for one classification task that builds on the RTE experiment in Snow,
> et al.'s paper. It is a journal article and a bit longer and more
> ML-oriented than previous papers we've read. If we choose this paper,
> I'm willing to lead the discussion on it.
>