The 12th bit seems to be also reserved.
Shouldn't it be 0xffff1ff0 ?
What kind of bitwise operations do you think it could help?
All of the operations I can find on dr6 are simple masks
test/set/clear.
> #define DR_TRAP0 (0x1) /* db0 */
> #define DR_TRAP1 (0x2) /* db1 */
> #define DR_TRAP2 (0x4) /* db2 */
> Index: linux-2.6-tip/arch/x86/kernel/traps.c
> ===================================================================
> --- linux-2.6-tip.orig/arch/x86/kernel/traps.c
> +++ linux-2.6-tip/arch/x86/kernel/traps.c
> @@ -534,6 +534,9 @@ dotraplinkage void __kprobes do_debug(st
>
> get_debugreg(dr6, 6);
>
> + /* Filter out all the reserved bits which are preset to 1 */
> + dr6 &= ~DR6_RESERVED;
> +
> /* Catch kmemcheck conditions first of all! */
> if ((dr6 & DR_STEP) && kmemcheck_trap(regs))
> return;
>
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majo...@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Oh and now that I see this patch, the previous one indeed makes sense
with this check:
if (dr6 & (~DR_TRAP_BITS))
rc = NOTIFY_DONE;
That said, it means thread.debugreg6 won't get the reserved bits anymore.
I see some use of them from kvm (it restores the reserved bits on guest<->host
switch). Not sure if this inconsistency could affect kvm...
Looks good. This stops any further checks though. This
is fine in case we have the DR_STEP bit as we will
toggle to NOTIFY_DONE. But what about vm86 case? Is it
possible we might still have something to handle in this
side? (although I don't quite understand how we can have
a breakpoint triggered in vm86 mode).
BTW, I'm not sure this is the right way to check if we want to send
a signal or not. Although it's not yet supported, we'll probably
bring the support for userspace breakpoints by perf.
May be we should put a "ptrace" flag in struct hw_perf_event instead.
The 12th bit is reserved to be 0 always.
> What kind of bitwise operations do you think it could help?
>
> All of the operations I can find on dr6 are simple masks
> test/set/clear.
>
As you found out later, this bitmask helps us in
hw_breakpoint_handler().
Thanks,
K.Prasad
Can you point me to the relevant code?
Anyway will copy this to Jan Kiszka <jan.k...@web.de> to hear what
this change means to KVM...on a similar note, will be happy to be
re-assured by Roland/Oleg about the patch's harmlessness to the
user-space (ptrace/utrace).
Hi Jan,
Patch 2009122618...@in.ibm.com introduces a change that
Patch 1/2: Clears the arch-reserved bits from debug status register.
This helps easy bitwise operations - such as the check for non-trap bits in
hw_breakpoint_handler. A check for the same using
"if (dr6 & (~DR_TRAP_BITS))" throws incorrect results due to the
presence of preset reserved bits.
Let us know if you foresee any harm from the said change to the
behaviour seen under KVM.
Thanks,
K.Prasad
I see various uses of DR6_VOLATILE and DR6_FIXED_1 in arch/x86/kvm/,
DR6_FIXED_1 being the fixed unused bits in dr6. Not sure how
this patch would affect what's set there.
I'll wait for Jan's answer.
Thanks.
Ah, ok.
> > What kind of bitwise operations do you think it could help?
> >
> > All of the operations I can find on dr6 are simple masks
> > test/set/clear.
> >
>
> As you found out later, this bitmask helps us in
> hw_breakpoint_handler().
Yeah, ok. Just waiting for Jan's answer to be sure it has
not side effects :)
Thanks.
You may need to synchronize me: What does the patch change, the shadow
register KVM will restore into DR6 on return to the host? Or the
register content KVM finds on guest entry?
The rules are simple: On entry, KVM assumes nothing about the register
state, just overwrites it (on demand) with the guest state. On exit, it
calls into hw_breakpoint_restore to ensure the host sees a proper state
(if required). But there is at no time an architecturally invalid state
loaded into the real register (that's basically what DR6_VOLATILE and
DR6_FIXED_1 are used for while in guest mode).
Jan
--
Siemens AG, Corporate Technology, CT T DE IT 1
Corporate Competence Center Embedded Linux
Sorry, this mail got buried deeply in my mailbox (hence the delay).
Basically, this patch tries to remove DR6 from its reserved bits to help
easy checks for certain status bits (such as DR_STEP). For instance, in
order to verify if DR_STEP (Bit 14) is set we must now do
if ((DR6 & ~DR6_RESERVED) & DR_STEP) {}
or
if (DR6 & (DR_STEP | DR6_RESERVED)) {}
which is redundant.
Instead this patch would expunge all reserved bits in DR6 before checks
for various status bits (to detect the cause of exception) are made in
do_debug().
At the outset, I don't think changes in the way the value of DR6 is used
for comparison in do_debug() would affect exception handling for either
KVM's guest or host OS (given that there are no hooks for the same in
do_debug()).
> The rules are simple: On entry, KVM assumes nothing about the register
> state, just overwrites it (on demand) with the guest state. On exit, it
> calls into hw_breakpoint_restore to ensure the host sees a proper state
> (if required). But there is at no time an architecturally invalid state
> loaded into the real register (that's basically what DR6_VOLATILE and
> DR6_FIXED_1 are used for while in guest mode).
>
Such a behaviour shouldn't be affected by the above change...your
confirmation would help!
Thanks,
K.Prasad
Hi Jan,
I presume that the above explanation makes the role of this
patch/bugfix clear.
Kindly let me know if you have any further queries.
Nope. There should be really no conflicts of your optimization with kvm.
Jan
--
Siemens AG, Corporate Technology, CT T DE IT 1
Corporate Competence Center Embedded Linux
Hi Jan,
Thanks for the confirmation.
Hi Frederic,
Can you pull these fixes in? (LKML references:
2009122618...@in.ibm.com and 2009122618...@in.ibm.com).
Thanks,
K.Prasad
Please tell a bit more in your changelogs. It took me some time
to guess whether this is a fix or not.
And this is not a fix but an optimization because SIGTRAP
is only sent if needed.
Here is what happens in do_debug() after handling the
breakpoint:
if (tsk->thread.debugreg6 & (DR_STEP | DR_TRAP_BITS))
send_sigtrap(tsk, regs, error_code, si_code);
This can only happen if we took the ptrace handler path.
Also:
Is that < TASK_SIZE an accurate check? We want support for
userspace breakpoints on perf tools later, and those don't want
signals.
We do this cleanup in the beginning of the breakpoint handler:
current->thread.debugreg6 &= ~DR_TRAP_BITS;
And from ptrace.c:ptrace_triggered():
thread->debugreg6 |= (DR_TRAP0 << i);
This is called on perf_bp_event().
Instead of checking if this is a userspace thread, we should actually
check if this is a ptrace breakpoint by looking at this
in the end of hw_breakpoint_handler().
current->thread.debugreg6 & DR_TRAP_BITS
Only ptrace breakpoints require signals.
Thanks.
I got a deeper look at these patches and answered with some comments.
If you address these, I can apply for .34 (because it's about optimizations
and not fixes).
Thanks.
Sure, please apply them against a version that you deem appropriate.
Thanks,
K.Prasad
Sorry about that...will add a descriptive changelog.
> And this is not a fix but an optimization because SIGTRAP
> is only sent if needed.
>
> Here is what happens in do_debug() after handling the
> breakpoint:
>
> if (tsk->thread.debugreg6 & (DR_STEP | DR_TRAP_BITS))
> send_sigtrap(tsk, regs, error_code, si_code);
>
> This can only happen if we took the ptrace handler path.
>
Agreed...signals are prevented as above...except that the notifier
semantics aren't properly used (NOTIFY_DONE vs NOTIFY_STOP).
Well, signal generation for user-space breakpoints happened
unconditionally for 'historical' reasons (guess that Alan Stern's
original patch had it that way).
We could change that into a 'ptrace-only' signal generation now.
> We do this cleanup in the beginning of the breakpoint handler:
>
> current->thread.debugreg6 &= ~DR_TRAP_BITS;
>
> And from ptrace.c:ptrace_triggered():
>
> thread->debugreg6 |= (DR_TRAP0 << i);
>
> This is called on perf_bp_event().
> Instead of checking if this is a userspace thread, we should actually
> check if this is a ptrace breakpoint by looking at this
> in the end of hw_breakpoint_handler().
>
> current->thread.debugreg6 & DR_TRAP_BITS
>
> Only ptrace breakpoints require signals.
>
Yes, this does look like a clean way to limit signals to those requests
that are interested (I was looking at round-about ways like doing a
lookup based on callback functions).
I will send the next version of the patch with the above changes.
Thanks,
K.Prasad
Yeah, now that we can have multiple-purpose concurrent breakpoints,
this is necessary.
> > We do this cleanup in the beginning of the breakpoint handler:
> >
> > current->thread.debugreg6 &= ~DR_TRAP_BITS;
> >
> > And from ptrace.c:ptrace_triggered():
> >
> > thread->debugreg6 |= (DR_TRAP0 << i);
> >
> > This is called on perf_bp_event().
> > Instead of checking if this is a userspace thread, we should actually
> > check if this is a ptrace breakpoint by looking at this
> > in the end of hw_breakpoint_handler().
> >
> > current->thread.debugreg6 & DR_TRAP_BITS
> >
> > Only ptrace breakpoints require signals.
> >
>
> Yes, this does look like a clean way to limit signals to those requests
> that are interested (I was looking at round-about ways like doing a
> lookup based on callback functions).
>
> I will send the next version of the patch with the above changes.
Thanks.