Scraper Tool

79 views
Skip to first unread message

flu...@gmail.com

unread,
Nov 4, 2011, 7:52:56 PM11/4/11
to stolencam...@googlegroups.com
Hi everyone,

like so many others, my camera has been stolen about 2 days ago, today I used the Flickr-Scraper for a few hours an was able to scan nearly 7500 Pictures.
I'd now like to run the scraper tool for a longer period,thus I'd like to ask if there is a commandline version of the tool available?
(my server runs linux 64bit)

regards,
Florian

Matt Burns

unread,
Nov 4, 2011, 8:08:56 PM11/4/11
to stolencam...@googlegroups.com
I'm currently in the final stages of developing such a tool. I'll let you know when it's ready for general release.

Thanks for your interest and I hope it helps find your camera!

Matt

fluetke

unread,
Jan 4, 2012, 7:42:39 AM1/4/12
to stolencam...@googlegroups.com
hi, 

its been a while since the last post.
I'm currently using an ugly hack to get flickr scrapr to work on my server:
 * Installed Xrdp, started flickr scraper (and a script that takes screenshots every 15minutes) in an rdp session, then firewalled it so nobody gets through.
works fine so far.

Florian

Matt Burns

unread,
Jan 4, 2012, 9:36:34 AM1/4/12
to stolencam...@googlegroups.com
Hi Florian,

Thanks for helping. I'll email you an early-release command line version that I'm sure you'll find much easier (and hopefully faster).

Matt

fluetke

unread,
Jan 4, 2012, 6:16:14 PM1/4/12
to stolencam...@googlegroups.com
Hi Matt, 

thanks for emailing the early version, looks good so far. 
yodaTime is spitting out parseExceptions, but this doesn't look like a real problem.
I'm currently working on my bachelors degree in computer science, so if you need help testing and developing the scraper, 
just drop me a line and i'll try to help as best as I can.

Florian

fluetke

unread,
Jan 4, 2012, 6:20:08 PM1/4/12
to stolencam...@googlegroups.com
I think I'll take a look at the source on google code and see what i can do.

Matt Burns

unread,
Jan 5, 2012, 5:26:09 AM1/5/12
to stolencam...@googlegroups.com
Thanks,
You're right, the exceptions are nothing to worry about. Basically, if an image is found with a date in an illegal format, then it is aborted and processing continues. It just doesn't look that pretty!

To help you get started, the source is separated into several components. That jar is built from scf-flickrtools. Simply type "ant jar" to build it (assuming ant is installed / on your path).

It in turn uses another component, the scf-api, which is a java library for communicating with my web services / live database. It basically wraps the public api so that it can be invoked in nice java :) The other key advantage is that it pre-validates the request so as to save time and bandwidth. For example, better validation could be applied to the date in the RequestValidator. There is a corresponding test class that gives you an idea of what is validated.

I think all of the components are stand alone eclipse projects. Once checked out, just import them into eclipse. Most have ant build scripts too.

I should probably write this sort of stuff up properly on some wiki documentation pages...

I did my Software Engineering degree ~10 years ago (eek, when did I get old?) but am happy to help you out however I can. If you're looking for final year project, a distributed, de-centralised image-crawling web spider would be a great (and useful) project...

Thanks again,
Matt

Håkan Lindgren

unread,
Oct 13, 2016, 5:18:51 PM10/13/16
to stolen camera finder
ant jar

Tariq Mageed

unread,
Nov 15, 2019, 11:17:40 AM11/15/19
to stolen camera finder
Reply all
Reply to author
Forward
0 new messages