Randomization is activated by setting perf_event_attr.random_period_width
to a non-zero value. It represents the width of the mask to apply to the
random value, i.e, maximum range of variation. The random value is applied
AROUND the period, i.e., period may be longer or shorter with a maximum
variation of half the bit width. Thus, on average the sampling period remains
equal to the initial period passed in perf_event_attr.sample_period.
Note that randomization is not available when a target interrupt rate
(freq) is enabled.
The last used period can be collected using the PERF_SAMPLE_PERIOD flag
in sample_type.
Randomization is implemented in generic code thus it applies to all
PMU models and software events.
Signed-off-by: Stephane Eranian <era...@google.com>
--
include/linux/perf_event.h | 3 +++
kernel/perf_event.c | 44 ++++++++++++++++++++++++++++++++++++++++++++
2 files changed, 47 insertions(+)
diff --git a/include/linux/perf_event.h b/include/linux/perf_event.h
index 04f06b4..9848e08 100644
--- a/include/linux/perf_event.h
+++ b/include/linux/perf_event.h
@@ -214,6 +214,8 @@ struct perf_event_attr {
__u32 bp_type;
__u64 bp_addr;
__u64 bp_len;
+
+ __u8 random_period_width;
};
/*
@@ -877,6 +879,7 @@ extern int perf_swevent_get_recursion_context(void);
extern void perf_swevent_put_recursion_context(int rctx);
extern void perf_event_enable(struct perf_event *event);
extern void perf_event_disable(struct perf_event *event);
+extern void perf_randomize_event_period(struct perf_event *event);
#else
static inline void
perf_event_task_sched_in(struct task_struct *task) { }
diff --git a/kernel/perf_event.c b/kernel/perf_event.c
index a661e79..c581a99 100644
--- a/kernel/perf_event.c
+++ b/kernel/perf_event.c
@@ -30,6 +30,7 @@
#include <linux/perf_event.h>
#include <linux/ftrace_event.h>
#include <linux/hw_breakpoint.h>
+#include <linux/random.h>
#include <asm/irq_regs.h>
@@ -3866,6 +3867,9 @@ static int __perf_event_overflow(struct perf_event *event, int nmi,
else
perf_event_output(event, nmi, data, regs);
+ if (event->attr.random_period_width)
+ perf_randomize_event_period(event);
+
return ret;
}
@@ -3876,6 +3880,22 @@ int perf_event_overflow(struct perf_event *event, int nmi,
return __perf_event_overflow(event, nmi, 1, data, regs);
}
+void perf_randomize_event_period(struct perf_event *event)
+{
+ u64 mask = ((1ULL << event->attr.random_period_width) - 1);
+ u64 new_seed;
+
+ new_seed = random32();
+
+ if (unlikely(mask >> 32))
+ new_seed |= (u64)random32() << 32;
+
+ event->hw.sample_period = event->attr.sample_period
+ + (new_seed & mask) - (mask >> 1);
+
+ event->hw.last_period = event->hw.sample_period;
+}
+
/*
* Generic software event infrastructure
*/
@@ -4665,6 +4685,28 @@ done:
return event;
}
+static int perf_check_random_period(struct perf_event_attr *attr)
+{
+ u8 width = attr->random_period_width;
+ u64 period = attr->sample_period;
+ u64 half;
+
+ if (width > 63 || attr->freq)
+ return -EINVAL;
+
+ half = (1ULL << (width - 1)) - 1;
+
+ /* overflow */
+ if ((period + half) < period)
+ return -EINVAL;
+
+ /* underflow and zero */
+ if (period <= half)
+ return -EINVAL;
+
+ return 0;
+}
+
static int perf_copy_attr(struct perf_event_attr __user *uattr,
struct perf_event_attr *attr)
{
@@ -4736,6 +4778,8 @@ static int perf_copy_attr(struct perf_event_attr __user *uattr,
if (attr->read_format & ~(PERF_FORMAT_MAX-1))
return -EINVAL;
+ if (attr->random_period_width)
+ ret = perf_check_random_period(attr);
out:
return ret;
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majo...@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Reviewed-by: Robert Richter <robert....@amd.com>
--
Advanced Micro Devices, Inc.
Operating System Research Center
email: robert....@amd.com
> This patch adds support for randomizing the sampling period. Randomization
> is very useful to mitigate the bias that exists with sampling. The random
> number generator does not need to be sophisticated. This patch uses the
> builtin random32() generator.
> + if (width > 63 || attr->freq)
> + return -EINVAL;
Why not for freq counters? Those are semi-randomized already, but it might
make sense to make them 'more' randomized in special circumstances. That would
also allow us to enable the randomization in perf top and perf record, by
default.
Without that we'd have no immediate usecase and no way to ensure that this
code works as intended.
Thanks,
Ingo
Randomization may prevent achieving the rate, or it may slow
it down. What's the value add of that?
> Without that we'd have no immediate usecase and no way to ensure that this
> code works as intended.
>
Why?
With perf you also have fixed sampling period (-c option), you simply need to
express the fact you want it randomized.
> Thanks,
>
> Ingo
>
--
Stephane Eranian | EMEA Software Engineering
Google France | 38 avenue de l'Opéra | 75002 Paris
Tel : +33 (0) 1 42 68 53 00
This email may be confidential or privileged. If you received this
communication by mistake, please
don't forward it to anyone else, please erase all copies and
attachments, and please let me know that
it went to the wrong person. Thanks
> On Thu, Mar 4, 2010 at 3:32 AM, Ingo Molnar <mi...@elte.hu> wrote:
> >
> > * era...@google.com <era...@google.com> wrote:
> >
> >> This patch adds support for randomizing the sampling period. ??Randomization
> >> is very useful to mitigate the bias that exists with sampling. The random
> >> number generator does not need to be sophisticated. This patch uses the
> >> builtin random32() generator.
> >
> >> + ?? ?? if (width > 63 || attr->freq)
> >> + ?? ?? ?? ?? ?? ?? return -EINVAL;
> >
> > Why not for freq counters? Those are semi-randomized already, but it might
> > make sense to make them 'more' randomized in special circumstances. That would
> > also allow us to enable the randomization in perf top and perf record, by
> > default.
> >
>
> What's the goal of freq?
> Achieve and maintain the target interrupt/rate.
> In doing so, it has to adjust the period (not randomly).
No, the goal of auto-freq is to keep a steady average rate of sampling.
There is no requirement to keep it 'steady' - each sample comes with a
specific weight.
> Randomization may prevent achieving the rate, or it may slow
> it down. What's the value add of that?
Why do you assume that the two are incompatible? We can randomize auto-freq
and still have a perfectly stable average rate.
We know how long each sample takes so the result is precise, via
PERF_SAMPLE_PERIOD.
> > Without that we'd have no immediate usecase and no way to ensure that this
> > code works as intended.
>
> Why?
>
> With perf you also have fixed sampling period (-c option), you simply need
> to express the fact you want it randomized.
-c is legacy in essence. The default is auto-freq and i doubt anyone uses -c
anymore.
Why would they use it? Auto-freq is so much more convenient - it adapts to the
workload and achieves a steady state of sampling, regardless of how frequent
the hardware events are and regardless of how dynamic the workload itself is.
Ingo
> There is no requirement to keep it 'steady' - each sample comes with a
> specific weight.
>
>> Randomization may prevent achieving the rate, or it may slow
>> it down. What's the value add of that?
>
> Why do you assume that the two are incompatible? We can randomize auto-freq
> and still have a perfectly stable average rate.
>
What would that buy you compared to what you already have?
> We know how long each sample takes so the result is precise, via
> PERF_SAMPLE_PERIOD.
>
I understand that.
> -c is legacy in essence. The default is auto-freq and i doubt anyone uses -c
> anymore.
>
The -c option may be more convenient when not counting events correlated with
time.
> On Thu, Mar 4, 2010 at 1:13 PM, Ingo Molnar <mi...@elte.hu> wrote:
> >
> > * Stephane Eranian <era...@google.com> wrote:
> >
> >> On Thu, Mar 4, 2010 at 3:32 AM, Ingo Molnar <mi...@elte.hu> wrote:
> >> >
> >> > * era...@google.com <era...@google.com> wrote:
> >> >
> >> >> This patch adds support for randomizing the sampling period. ??Randomization
> >> >> is very useful to mitigate the bias that exists with sampling. The random
> >> >> number generator does not need to be sophisticated. This patch uses the
> >> >> builtin random32() generator.
> >> >
> >> >> + ?? ?? if (width > 63 || attr->freq)
> >> >> + ?? ?? ?? ?? ?? ?? return -EINVAL;
> >> >
> >> > Why not for freq counters? Those are semi-randomized already, but it might
> >> > make sense to make them 'more' randomized in special circumstances. That would
> >> > also allow us to enable the randomization in perf top and perf record, by
> >> > default.
> >> >
> >>
> >> What's the goal of freq?
> >> Achieve and maintain the target interrupt/rate.
> >> In doing so, it has to adjust the period (not randomly).
> >
> > No, the goal of auto-freq is to keep a steady average rate of sampling.
>
> rate of samples = rate of interrupts (if period < 32 bits on Intel).
What's your point? I corrected your statement which said that the goal of
auto-freq was to maintain a target interrupt-rate and as such wouldnt be
randomizable. So i said that auto-freq is slightly different from that: it
provides a steady _average_ rate, and as such small amounts of randomization
'fuzz' could still be injected - the auto-freq system would auto-correct the
effects of that.
Think of it as a dynamic steady-state equilibrium with noise injected. If the
noise isnt too brutal and the system can adapt, the average sampling rate
doesnt change.
> > There is no requirement to keep it 'steady' - each sample comes with a
> > specific weight.
> >
> >> Randomization may prevent achieving the rate, or it may slow it down.
> >> What's the value add of that?
> >
> > Why do you assume that the two are incompatible? We can randomize
> > auto-freq and still have a perfectly stable average rate.
>
> What would that buy you compared to what you already have?
The same goal as randomization in general: to decrease the chance for sampling
artifacts that can occur due to the sampling frequency oscillating together
with some internal workload parameter, skewing the sample.
Ingo