Andre Costa
unread,Sep 17, 2023, 5:37:32 AM9/17/23Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to cache-trace
Hello, I was utilizing these real traces to compare them to my synthetic workloads.
The objective was to understand how well my synthetic workload could imitate reality, so that I can evaluate how much we can trust results obtained by our synthetic workloads.
My question is: how did you calculate the zipf alpha value in the traces? I know that it is possible that the access distribution changes over time, but from my analysis some of the zipf alpha values are wrong.
For example, from the first 465 million requests from cluster1, the zipfian alpha is 0.22; but the value in the statistics is 2.6 (which makes a huge difference). Even though I know it is possible that the skewness of the workload changed later on, it seems unlikely that it changed that drastically.
I might be missing the point entirely, please excuse me if that's the case.
Thank you for your help!