Hi all,
As mentioned in a couple of threads there are some media things going
on. This is going to mean an uptick in website traffic when it happens;
exactly how much is impossible to predict. I'd quite like to put some
effort into keeping the website up.
Several people I've asked have recommended putting cloudflare in front
of the website to cache a bunch of things, and generally use their CDN
to offload work. This makes sense, is free [0, 1], and we can just turn
it on and off. The questions to my mind is "how much more traffic can we
take with cloudflare?", and "are there any rough edges we can optimise?".
The best way to answer those questions is actually doing some
benchmarking of srweb [2]. That requires knowing a representative
distribution of pages on the website that your average visitor hits. We
can then hit the website [3] with / without cloudflare using that
distribution, and measure the difference. There are also a wealth of web
benchmarking tools out there we can use to optimise Stuff.
I have several days of free time booked up; would someone have spare
time for generating such a benchmarking situation? The data available
would be a few weeks worth of webserver logs that can be processed;
there's also the contents of piwik (
srobo.org/piwik) that might yield
useful information. I can also generate a VM + subdomains to be
benchmarked at.
[0] Mild concern about SSL: they'd be using their own CA / certificate
for
studentrobotics.org. If we're at all worried about security though
we can keep a subdomain pointing at the real webserver and access that
directly.
[1] Cloudflare use SNI to disambiguate the destination domain of https
requests. This rules out several old browsers; however AFAIK our SSL
configuration (i.e. banning RC4) already rules all those old browsers out.
[2] The unwashed masses will not be using the IDE, forum, trac etc.
[3] Or a copy of it
--
Thanks,
Jeremy