Seeking Developer with Common Crawl Experience for Quick Hacky Project

53 views
Skip to first unread message

evs...@gmail.com

unread,
Mar 4, 2014, 7:55:59 PM3/4/14
to common...@googlegroups.com
We have a bunch of  I have a bunch of hand categorized domains (over 4k)

For each domain, I need to visit each page from the common crawl archive, and grab the metadata information

I already have the project spec'd out from a developer who has worked with common crawl.  This will help me get training data for my app, and if you need language classified by interest / news category, this will be useful to you as well.

Please contact if interested: evs...@gmail.com
Reply all
Reply to author
Forward
0 new messages