You cannot post messages because only members can post, and you are not currently a member.
Description:
Do you try to get your hands on big data collections? Maybe you write scrapers or crawlers. Maybe you call people on the phone and beg. Or maybe you spend cash. However you do it, join this list to meet others doing the same things as you. And visit our website: theinfo.org.
|
|
|
Query logs and associated contents published
|
| |
Dear all, I am a student researcher in search and IR. I wanted to study the occasional increase in popularity of queries and associated increase in content generated which are relevant to the query. For example, when Egypt protests occurred, the search engine queries having "Egypt" keyword increased and the... more »
|
|
ProxyMesh rotating proxy service
|
| |
Hi, For those that want to do anonymous web crawling, I've recently launched [link]. With a one-time configuration on your end, it provides 10 rotating proxy servers to mask your IP address, and no bandwidth limits. Check it out if you're interested, and let me know if you have any questions.... more »
|
|
e-commerce data set
|
| |
Hi, all. Is there a data set which has some shopping online records including user,item and temporal information? thx very much and looking forward your kindly reply. yours sincerely, George
|
|
delicious crawl revisited
|
| |
So there was a thread here a couple years back on crawling delicious, the social bookmarking / tagging site. [link] It seems very likely Delicious is being shut down by Yahoo. [link]... more »
|
|
Bulk delicious export?
|
| |
Since delicious is being shut down ([link]) a lot of users are like: WHERE SHOULD WE PUT OUR DATA? (e.g. [link]) I don't have the web skills, but this seems like a great opportunity to aggregate delicious export data, and release it as a social tagging... more »
|
|
CKAN v1.2 (and datapkg v0.7) Released
|
| |
People may be interested to know that CKAN v1.2 and datapkg v0.7 have been released: <[link]> In addition to covering the main features of this release there's some discussion of the wider progress of the community over the last few... more »
|
|
Traffic Alerts Data Source
|
| |
FARS = fatal accident reporting system - US Feds, much of it on line And Investigative Reporters and Editors have a cleaned up copy, but see Google data clean up procs Also, most universiities that have a traiffiic studies or transportation studies dept willl have local and regional data -- University of North... more »
|
|
|