Wondering about download numbers...

5 views
Skip to first unread message

Yannick Pouliot

unread,
Jan 30, 2012, 6:32:47 PM1/30/12
to cloudbiolinux
Howdy yall. Q: are there any data on the download frequencies for the
various CloudBioLinux machines? I'm writing a proposal that features
the ideas behind CloudBioLinux and would dearly love to quote such
numbers...

Cheers and keep up the awesome work!

Yannick

Brad Chapman

unread,
Feb 1, 2012, 6:07:33 AM2/1/12
to Yannick Pouliot, cloudbiolinux, Dawn Field

Yannick;

> Howdy yall. Q: are there any data on the download frequencies for the
> various CloudBioLinux machines? I'm writing a proposal that features
> the ideas behind CloudBioLinux and would dearly love to quote such
> numbers...

Numbers like this are a bit tricky since instantiating AMIs isn't actually
a download. I'm not aware that Amazon has logs for AMI usage.

Dawn (cc'ed) might have some numbers from her CloudBioLinux grant
writing that would be helpful.

Brad

Tim Booth

unread,
Feb 2, 2012, 7:57:27 AM2/2/12
to cloudb...@googlegroups.com, Yannick Pouliot
Hiya,

We have download numbers for non-cloud Bio-Linux which we extract from
the server logs and the voluntary registration page. We also have this
headline figure of ">5000" installed Bio-Linux systems. This is gleaned
by looking at the Apcahe logs for hits to the Packages.gz file which is
fetched by "apt-get update". I take the number of unique IP addresses
hitting this file within a month period. This would include EC2
instances but I don't know if each shows up with a unique IP (probably
not due to NAT). Also I've not attempted to pull out EC2 IP ranges
specifically from the logs, but I could do this given a little time.

Anyway, slide 4 of these slides has our headline figures:
http://nebc.nerc.ac.uk/downloads/bio-linux/tim_soon_talk_southport_2012.pdf

It might be worth adding a ping-back to the CBL image that we can use to
spot each installation. This would be run once when the machine is
initialised. It would be handy for us but maybe considered bad privacy
practise, and also doesn't differentiate 10 genuine users from one
person who restarts the image 10 times.


Cheers,

TIM

--
Tim Booth <tbo...@ceh.ac.uk>
NERC Environmental Bioinformatics Centre

Centre for Ecology and Hydrology
Maclean Bldg, Benson Lane
Crowmarsh Gifford
Wallingford, England
OX10 8BB

http://nebc.nerc.ac.uk
+44 1491 69 2705

--
This message (and any attachments) is for the recipient only. NERC
is subject to the Freedom of Information Act 2000 and the contents
of this email and any reply you make may be disclosed by NERC unless
it is exempt from release under the Act. Any material supplied to
NERC may be stored in an electronic records management system.

Reply all
Reply to author
Forward
0 new messages