Account Options

  1. Sign in
The old Google Groups will be going away soon.
Switch to the new Google Groups.
Google Groups Home
« Groups Home
Group info
Language: English
Group categories: Not categorized
More group info »
Active older topics
1 new of 1 - Nov 28 2010
5 new of 5 - Nov 14 2010
4 new of 4 - Nov 6 2010
1 new of 1 - Nov 2 2010
9 new of 9 - Sep 29 2010
1 new of 1 - Sep 23 2010
1 new of 1 - Sep 14 2010
1 new of 1 - Sep 10 2010
3 new of 3 - Sep 7 2010
1 new of 1 - Aug 30 2010
Discussions
View:  Topic list, Topic summary Topics 1 - 10 of 194  Older »

You cannot post messages because only members can post, and you are not currently a member.
Description: Do you try to get your hands on big data collections? Maybe you write scrapers or crawlers. Maybe you call people on the phone and beg. Or maybe you spend cash. However you do it, join this list to meet others doing the same things as you. And visit our website: theinfo.org.
 

Revealing "Smoking guns" as "Prosecutor's Colt 45", and more, for violations of State FOI Laws 
  Per our discussion last week about federal jurisdiction appropriate in some situations for state FOI violations, Attached is a Petition to The Honorable William J. Schneider, Attorney General of Maine to impanel and charge a Grand Jury with limited purposes for a limited time period to investigate and report on violations of a... more »
By Dwight Hines  - Nov 11 2011 - 1 new of 1 message    

Old scraping site like scraperwiki with the prefix "octo-"? 
  Hey all, Does anyone remember one of the old scraper web apps that was called Octo-something? I've recently learned about scraperwiki.com, and thought there was something else before it. - Bryan [link] 1 512 203 0507
By Bryan Bishop  - Jul 24 2011 - 2 new of 2 messages    

Query logs and associated contents published 
  Dear all, I am a student researcher in search and IR. I wanted to study the occasional increase in popularity of queries and associated increase in content generated which are relevant to the query. For example, when Egypt protests occurred, the search engine queries having "Egypt" keyword increased and the... more »
By hari sankar  - Mar 8 2011 - 1 new of 1 message    

ProxyMesh rotating proxy service 
  Hi, For those that want to do anonymous web crawling, I've recently launched [link]. With a one-time configuration on your end, it provides 10 rotating proxy servers to mask your IP address, and no bandwidth limits. Check it out if you're interested, and let me know if you have any questions.... more »
By Jacob Perkins  - Mar 6 2011 - 1 new of 1 message    

Introducing http://GetTheData.org: Ask and Answer Data Related Questions 
  Hi All, People may be interested in a new project that was stealth-released a couple of weeks ago: <[link]> It's a site for asking and answering data-related questions. There's an introduction in this new post by Tony Hirst ([link]) who came up with the idea: <[link]>... more »
By Rufus Pollock  - Feb 8 2011 - 1 new of 1 message    

e-commerce data set 
  Hi, all. Is there a data set which has some shopping online records including user,item and temporal information? thx very much and looking forward your kindly reply. yours sincerely, George
By George Zhao  - Dec 21 2010 - 1 new of 1 message    

delicious crawl revisited 
  So there was a thread here a couple years back on crawling delicious, the social bookmarking / tagging site. [link] It seems very likely Delicious is being shut down by Yahoo. [link]... more »
By Dan Brickley  - Dec 17 2010 - 1 new of 1 message    

Bulk delicious export? 
  Since delicious is being shut down ([link]) a lot of users are like: WHERE SHOULD WE PUT OUR DATA? (e.g. [link]) I don't have the web skills, but this seems like a great opportunity to aggregate delicious export data, and release it as a social tagging... more »
By Joseph Turian  - Dec 16 2010 - 5 new of 5 messages    

CKAN v1.2 (and datapkg v0.7) Released 
  People may be interested to know that CKAN v1.2 and datapkg v0.7 have been released: <[link]> In addition to covering the main features of this release there's some discussion of the wider progress of the community over the last few... more »
By Rufus Pollock  - Dec 6 2010 - 1 new of 1 message    

Traffic Alerts Data Source 
  FARS = fatal accident reporting system - US Feds, much of it on line And Investigative Reporters and Editors have a cleaned up copy, but see Google data clean up procs Also, most universiities that have a traiffiic studies or transportation studies dept willl have local and regional data -- University of North... more »
By Dwight Hines  - Nov 30 2010 - 1 new of 1 message    

1 - 10 of 194   « Newer | Older »

XML       Send email to this group: get-theinfo@googlegroups.com