Are reCAPTCHA datasets public?

104 views
Skip to first unread message

Ben Sobel

unread,
Jun 6, 2017, 2:12:53 PM6/6/17
to reCAPTCHA
Hi everyone,

I'm doing research in machine learning and I was wondering: are the datasets created by reCAPTCHA responses published anywhere for public use, or are they for internal use at Google only? Google's intro website mentions how completed reCAPTCHAs create labeled image data for use in machine learning training: "High quality human labelled images are compiled into datasets that can be used to train Machine Learning systems. Research communities benefit from such efforts that help build the next generation of groundbreaking Artificial Intelligence solutions." Can I make use of that data myself, and if so, how?

Any resources on this would be much appreciated. So far all I've found is this unanswered question in GitHub.

Thanks,
Ben
Reply all
Reply to author
Forward
0 new messages