You might also have luck porting models for web traffic from other
open source simulators, i.e. I know that GTNetS has some web server
and client applications, and a traffic model based on random number
generators for http request times, response times, # files that have
to be downloaded per page, etc. I think that the random number
distributions used are based upon some measured statistics from the
real world (maybe from a Jeffay paper?), but I'm not too sure.
GTNetS:
http://www.ece.gatech.edu/research/labs/MANIACS/GTNetS/
In the GTNetS tree, the following has the browser and server
applications:
SRC/application-web*.{cc,h}
And the following describes the statistics (cumulative density
function) of browser requests:
SRC/http-distributions.{cc,h}
Porting these models is vaguely in the plans for the future, but you
might be able to get the jump on this sooner, and port the GTNetS
stuff yourself.