2012/3/15 Martin Berger <martin....@gmail.com>:
> --
> You received this message because you are subscribed to the Google Groups
> "Guerrilla Capacity Planning" group.
> To view this discussion on the web visit
> https://groups.google.com/d/msg/guerrilla-capacity-planning/-/FOrml1fl9msJ.
> To post to this group, send email to
> guerrilla-cap...@googlegroups.com.
> To unsubscribe from this group, send email to
> guerrilla-capacity-...@googlegroups.com.
> For more options, visit this group at
> http://groups.google.com/group/guerrilla-capacity-planning?hl=en.
--
Twitter: http://twitter.com/znmeb Data Journalism Developer Studio
2012LX http://j.mp/DJDS2012LX
"A mathematician is a device for turning coffee into theorems." -- Paul Erdős
I did a lot of multidimensional exploratory plots around Little's Law
a while back (2008 CMG conference, to be exact) from "iostat" data on
Linux during a disk benchmark. Something like a scatterplot matrix
would work for what you're trying to do, I think.
2012/3/15 Martin Berger:
> Maybe my idea sounds somewhat strange, so please tell me if I have some big
> failures anywhere:
> Many here might now the nice graph with "Arrival rate" (R) on the x-axis and
> "Response time" (λ) on y-axis.
> It's also some fun to change the number of "servers" in the underlying
> excels/R/whatever to show slightly different graphs.
> My question is now if someone ever tried to calculate an artificial value
> for the servers (M) for a given set of measurements (R,λ) on a totally
> unknown system?
> Is this value of any usage, to describe the system? (The Service time would
> be also an outcome of this calculation, I guess).
> Any comments on this approach?
>
> thank you,
> Martin
>
> --
> You received this message because you are subscribed to the Google Groups
> "Guerrilla Capacity Planning" group.
> To view this discussion on the web visit
> https://groups.google.com/d/msg/guerrilla-capacity-planning/-/FOrml1fl9msJ.
> To post to this group, send email to
> To unsubscribe from this group, send email to
> For more options, visit this group at
> http://groups.google.com/group/guerrilla-capacity-planning?hl=en.
To view this discussion on the web visit https://groups.google.com/d/msg/guerrilla-capacity-planning/-/oL2q-USn3BEJ.
To post to this group, send email to guerrilla-cap...@googlegroups.com.
To unsubscribe from this group, send email to guerrilla-capacity-...@googlegroups.com.
thank you for your 2 replies.
I did the formal transformations before and my equitation was
R = S / [ 1 - ( λ/(mS) )^m ]
that was the point where I decided to ask the group.
Your assumption regarding my question is correct. I still try to do my
best to make it somehow useable for me:
Can I say "If I have a measurement with a very small λ (max. only one
request is in the whole system at the same time) I can say S ≈ R" ?
This would make the whole problem slightly easier :-)
I will have to force R (the program, not the residence time) to solve
my problem.
>>>> > guerrilla-cap...@googlegroups.com.
>>>> > To unsubscribe from this group, send email to
>>>> > guerrilla-capacity-...@googlegroups.com.
>>>> > For more options, visit this group at
>>>> > http://groups.google.com/group/guerrilla-capacity-planning?hl=en.
>>>>
>>>> --
>>>> Twitter: http://twitter.com/znmeb Data Journalism Developer Studio
>>>> 2012LX http://j.mp/DJDS2012LX
>>>>
>>>> "A mathematician is a device for turning coffee into theorems." -- Paul
>>>> Erdős
>
> --
> You received this message because you are subscribed to the Google Groups
> "Guerrilla Capacity Planning" group.
> To view this discussion on the web visit
> https://groups.google.com/d/msg/guerrilla-capacity-planning/-/zzVSiS3ahHUJ.
>
> To post to this group, send email to
> guerrilla-cap...@googlegroups.com.
> To unsubscribe from this group, send email to
> guerrilla-capacity-...@googlegroups.com.
> For more options, visit this group at
> http://groups.google.com/group/guerrilla-capacity-planning?hl=en.
--
Martin Berger martin....@gmail.com
Lederergasse 27/2/14 +43 660 660 83306
1080 Wien http://berx.at/
Neil,thank you for your 2 replies.
I did the formal transformations before and my equitation was
R = S / [ 1 - ( λ/(mS) )^m ]
that was the point where I decided to ask the group.
Your assumption regarding my question is correct. I still try to do my
best to make it somehow useable for me:Can I say "If I have a measurement with a very small λ (max. only one
request is in the whole system at the same time) I can say S ≈ R" ?
This would make the whole problem slightly easier :-)I will have to force R (the program, not the residence time) to solve
my problem.
>>>> > To unsubscribe from this group, send email to
>>>> > guerrilla-capacity-planning+unsub...@googlegroups.com.
>>>> > For more options, visit this group at
>>>> > http://groups.google.com/group/guerrilla-capacity-planning?hl=en.
>>>>
>>>> --
>
> --
> You received this message because you are subscribed to the Google Groups
> "Guerrilla Capacity Planning" group.
> To view this discussion on the web visit
> https://groups.google.com/d/msg/guerrilla-capacity-planning/-/zzVSiS3ahHUJ.
>
> To post to this group, send email to
> To unsubscribe from this group, send email to
> For more options, visit this group at
> http://groups.google.com/group/guerrilla-capacity-planning?hl=en.
in general I try at least to get a 'good knowledge' about the storage
subsystem.
That means a study of the documentation as well as a test with orion
or some other tools. (If you want to test physical and logical IOs of
an oracle instance, maybe you are interested in [1] )
As we have in general full storage networks, there sometimes are
effects our storage admins does not want to accept at first sign. As
an example: we tested one big virtual 'disk' attached to a linux host
versus 10 smaller 'disks (which where based on the same underlying
structures - so they where 'the same thing' - but we got much better
results by these 10 disks. - I was never allowed to investigate this
to find the real reason, but my curent working theory is the multipoe
IO-queues on OS are the key for the better results.
In this particular question I do NOT want to model any sub-system - in
fact I try to do it the other way: step 'back' until I can see the
whole system as a single black box, where all the details can not e
seen anymore.
let's see if it gives me any good :)
Martin
[1] http://kevinclosson.wordpress.com/2012/02/06/introducing-slob-the-silly-little-oracle-benchmark/
> --
> You received this message because you are subscribed to the Google Groups "Guerrilla Capacity Planning" group.
> To post to this group, send email to guerrilla-cap...@googlegroups.com.
> To unsubscribe from this group, send email to guerrilla-capacity-...@googlegroups.com.
--
You received this message because you are subscribed to the Google Groups "Guerrilla Capacity Planning" group.
To view this discussion on the web visit https://groups.google.com/d/msg/guerrilla-capacity-planning/-/ccRbTVQ2C9cJ.
To post to this group, send email to guerrilla-cap...@googlegroups.com.
To unsubscribe from this group, send email to guerrilla-capacity-...@googlegroups.com.
To post to this group, send email to guerrilla-capacity-planning@googlegroups.com.
To unsubscribe from this group, send email to guerrilla-capacity-planning+unsub...@googlegroups.com.
re: Plot. Vast improvement. Nice job. One tweak you might add is to note which model you are fitting against.Anyway, now we can get down to some brass tacks.Before launching off into various performance modeling exotica (like, nonlinear regression and overdriven demand), you need to convince yourself that the target model (i.e., M/M/m) even makes sense for an RDMBS. For example, I could take the position that any RDMBS can only have a finite number (N) of processes handing DB requests. In which case, the queue length is bounded by N. An open queueing model like M/M/m is not compatible with that assumption b/c there, the number of requests can be unbounded. So which it it?
On the other hand, looking at your nice new plot, it seems to indicate that the RDMBS you're measuring does behave like an unbounded queue, b/c it goes asymptotic at about 2500 RPS. If I assume that indication (fitted curve) is correct and apply Little's law at saturation, I calculate your mean service time to be about 0.0004 s. And that, indeed, seems to jive with your y-intercept. Of course, we could've seen this immediately if you'd plotted against the utilization rather that the request rate on the x-axis. (cf. Fig. 4.17 on p. 121 of PPQ2)
But you say, "an M/M/m queue doesn't fit the data all too well" which I now find puzzling. Your new plot has pretty much convinced me of the thing you were trying to unconvince me of.
To view this discussion on the web visit https://groups.google.com/d/msg/guerrilla-capacity-planning/-/j9AibcSdUWgJ.
To post to this group, send email to guerrilla-cap...@googlegroups.com.
To unsubscribe from this group, send email to guerrilla-capacity-...@googlegroups.com.
On Tue, Jun 26, 2012 at 5:57 PM, DrQ wrote:re: Plot. Vast improvement. Nice job. One tweak you might add is to note which model you are fitting against.Anyway, now we can get down to some brass tacks.Before launching off into various performance modeling exotica (like, nonlinear regression and overdriven demand), you need to convince yourself that the target model (i.e., M/M/m) even makes sense for an RDMBS. For example, I could take the position that any RDMBS can only have a finite number (N) of processes handing DB requests. In which case, the queue length is bounded by N. An open queueing model like M/M/m is not compatible with that assumption b/c there, the number of requests can be unbounded. So which it it?The number of concurrent request is bounded, but the server is configured so that we never hit that limit.How can I best model limited queue size?
On the other hand, looking at your nice new plot, it seems to indicate that the RDMBS you're measuring does behave like an unbounded queue, b/c it goes asymptotic at about 2500 RPS. If I assume that indication (fitted curve) is correct and apply Little's law at saturation, I calculate your mean service time to be about 0.0004 s. And that, indeed, seems to jive with your y-intercept. Of course, we could've seen this immediately if you'd plotted against the utilization rather that the request rate on the x-axis. (cf. Fig. 4.17 on p. 121 of PPQ2)New plot attached
df <- read.csv("~/Desktop/wouterdb/wdata")
m <- 2
mmm.fit <- nls(Rtime ~ S/(1-(b*Arate*S/m)^m), data=df, start=list(S=1e-4,b=1.0))
summary(mmm.fit)
plot(df,xlab="Arrival rate (RPS)",ylab="Response time (s)")
lines(df$Arate,predict(mmm.fit),col="blue")
title(main=paste(paste("Parametric M/M/",toString(m),sep=""),"Model of RDBMS Data"))
Done.
But you say, "an M/M/m queue doesn't fit the data all too well" which I now find puzzling. Your new plot has pretty much convinced me of the thing you were trying to unconvince me of.The plot fits quite well, but only because the request rate has been multiplied by 2.3. I know it works, but I don't see its real world meaning.I think a multiplier smaller than one would mean there is a delay station somewhere inside the server, which adds to the response time, but not to utilization.But what does a multiplier larger than one mean?
On Tuesday, June 26, 2012 5:49:11 AM UTC-7, wouterdb wrote:
I stand corrected, new graph attachedWhat I am trying to do is automatically determine the parameters of an M/M/m queue, given a set of measurements.In the example graph, the data is from a mysql database, executing a set of select queries.However, it turns out that an M/M/m queue doesn't fit the data all too well, so I added overdriven throughput (as in 'analyzing computer system performance with PDQ, 2e ed, p 381').I have two concrete questions about this:
- I was wondering if anyone else has ever tried to do automated parameter estimation before?
- How can I improve the closeness of fit of the model?
Wouter
is supposed to represent), but I can say this. It doesn't do your cause any favors to produce a plot with no labeled axes or legend of any kind. If you are doing science, rather than art, the reader shouldn't be left guessing what you are trying to convey. Otherwise, the response is likely to be a deafening silence.
On Apr 5, 11:04 pm, DrQ wrote:
To unsubscribe from this group, send email to guerrilla-capacity-planning+unsubs...@googlegroups.com.
To unsubscribe from this group, send email to guerrilla-capacity-planning+unsubs...@googlegroups.com.
To unsubscribe from this group, send email to guerrilla-capacity-planning+unsubs...@googlegroups.com.
To unsubscribe from this group, send email to guerrilla-capacity-planning+unsubs...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/guerrilla-capacity-planning?hl=en.
To view this discussion on the web visit https://groups.google.com/d/msg/guerrilla-capacity-planning/-/CgBt_IKMmkcJ.
To post to this group, send email to guerrilla-cap...@googlegroups.com.
To unsubscribe from this group, send email to guerrilla-capacity-...@googlegroups.com.
I've never considered R before, but I'll look into it ;-)
On the other hand, looking at your nice new plot, it seems to indicate that the RDMBS you're measuring does behave like an unbounded queue, b/c it goes asymptotic at about 2500 RPS. If I assume that indication (fitted curve) is correct and apply Little's law at saturation, I calculate your mean service time to be about 0.0004 s. And that, indeed, seems to jive with your y-intercept. Of course, we could've seen this immediately if you'd plotted against the utilization rather that the request rate on the x-axis. (cf. Fig. 4.17 on p. 121 of PPQ2)
To unsubscribe from this group, send email to guerrilla-capacity-planning+unsubs...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/guerrilla-capacity-planning?hl=en.
On Wednesday, June 27, 2012 2:55:22 AM UTC-7, wouterdb wrote:
On Tue, Jun 26, 2012 at 5:57 PM, DrQ wrote:re: Plot. Vast improvement. Nice job. One tweak you might add is to note which model you are fitting against.Anyway, now we can get down to some brass tacks.Before launching off into various performance modeling exotica (like, nonlinear regression and overdriven demand), you need to convince yourself that the target model (i.e., M/M/m) even makes sense for an RDMBS. For example, I could take the position that any RDMBS can only have a finite number (N) of processes handing DB requests. In which case, the queue length is bounded by N. An open queueing model like M/M/m is not compatible with that assumption b/c there, the number of requests can be unbounded. So which it it?The number of concurrent request is bounded, but the server is configured so that we never hit that limit.How can I best model limited queue size?
On the other hand, looking at your nice new plot, it seems to indicate that the RDMBS you're measuring does behave like an unbounded queue, b/c it goes asymptotic at about 2500 RPS. If I assume that indication (fitted curve) is correct and apply Little's law at saturation, I calculate your mean service time to be about 0.0004 s. And that, indeed, seems to jive with your y-intercept. Of course, we could've seen this immediately if you'd plotted against the utilization rather that the request rate on the x-axis. (cf. Fig. 4.17 on p. 121 of PPQ2)
To unsubscribe from this group, send email to guerrilla-capacity-planning+unsubs...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/guerrilla-capacity-planning?hl=en.
On Wednesday, June 27, 2012 2:55:22 AM UTC-7, wouterdb wrote:
On Tue, Jun 26, 2012 at 5:57 PM, DrQ wrote:re: Plot. Vast improvement. Nice job. One tweak you might add is to note which model you are fitting against.Anyway, now we can get down to some brass tacks.Before launching off into various performance modeling exotica (like, nonlinear regression and overdriven demand), you need to convince yourself that the target model (i.e., M/M/m) even makes sense for an RDMBS. For example, I could take the position that any RDMBS can only have a finite number (N) of processes handing DB requests. In which case, the queue length is bounded by N. An open queueing model like M/M/m is not compatible with that assumption b/c there, the number of requests can be unbounded. So which it it?The number of concurrent request is bounded, but the server is configured so that we never hit that limit.How can I best model limited queue size?
On the other hand, looking at your nice new plot, it seems to indicate that the RDMBS you're measuring does behave like an unbounded queue, b/c it goes asymptotic at about 2500 RPS. If I assume that indication (fitted curve) is correct and apply Little's law at saturation, I calculate your mean service time to be about 0.0004 s. And that, indeed, seems to jive with your y-intercept. Of course, we could've seen this immediately if you'd plotted against the utilization rather that the request rate on the x-axis. (cf. Fig. 4.17 on p. 121 of PPQ2)
To unsubscribe from this group, send email to guerrilla-capacity-planning+unsubs...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/guerrilla-capacity-planning?hl=en.
--
You received this message because you are subscribed to the Google Groups "Guerrilla Capacity Planning" group.
To view this discussion on the web visit https://groups.google.com/d/msg/guerrilla-capacity-planning/-/CgBt_IKMmkcJ.

To view this discussion on the web visit https://groups.google.com/d/msg/guerrilla-capacity-planning/-/vTZRjZZKrJ0J.
To post to this group, send email to guerrilla-cap...@googlegroups.com.
To unsubscribe from this group, send email to guerrilla-capacity-...@googlegroups.com.