BUG: bad usercopy in memdup

syzbot

unread,

Dec 18, 2017, 8:40:03 AM12/18/17

to da...@nullcore.net, kees...@chromium.org, keun-...@darkmatter.ae, lab...@redhat.com, linux-...@vger.kernel.org, linu...@kvack.org, mark.r...@arm.com, mi...@kernel.org, syzkall...@googlegroups.com, will....@arm.com

Hello,

syzkaller hit the following crash on
6084b576dca2e898f5c101baef151f7bfdbb606d
git://git.kernel.org/pub/scm/linux/kernel/git/next/linux-next.git/master
compiler: gcc (GCC) 7.1.1 20170620
.config is attached
Raw console output is attached.

Unfortunately, I don't have any reproducer for this bug yet.

netlink: 1 bytes leftover after parsing attributes in process
`syz-executor5'.
------------[ cut here ]------------
kernel BUG at mm/usercopy.c:84!
invalid opcode: 0000 [#1] SMP
Dumping ftrace buffer:
(ftrace buffer empty)
Modules linked in:
CPU: 0 PID: 3943 Comm: syz-executor0 Not tainted 4.15.0-rc3-next-20171214+
#67
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS
Google 01/01/2011
RIP: 0010:report_usercopy mm/usercopy.c:76 [inline]
RIP: 0010:__check_object_size+0x1e2/0x250 mm/usercopy.c:276
RSP: 0018:ffffc90000d6fca8 EFLAGS: 00010292
RAX: 0000000000000062 RBX: ffffffff82e57be7 RCX: ffffffff8123dede
RDX: 0000000000006340 RSI: ffffc900036c9000 RDI: ffff88021fc136f8
RBP: ffffc90000d6fce0 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000000 R12: ffff8801e076dc50
R13: 0000000000000400 R14: 0000000000000000 R15: ffffffff82edf8a5
FS: 00007fe747e20700(0000) GS:ffff88021fc00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 000000002000bffa CR3: 00000001dbd02006 CR4: 00000000001626f0
DR0: 0000000020000000 DR1: 0000000000010000 DR2: 000000000000f004
DR3: 0000000000000000 DR6: 00000000fffe0ff3 DR7: 0000000000000600
Call Trace:
check_object_size include/linux/thread_info.h:112 [inline]
check_copy_size include/linux/thread_info.h:143 [inline]
copy_from_user include/linux/uaccess.h:146 [inline]
memdup_user+0x46/0x90 mm/util.c:168
kvm_arch_vcpu_ioctl+0xc85/0x1810 arch/x86/kvm/x86.c:3499
kvm_vcpu_ioctl+0xf3/0x820 arch/x86/kvm/../../../virt/kvm/kvm_main.c:2715
vfs_ioctl fs/ioctl.c:46 [inline]
do_vfs_ioctl+0xaf/0x840 fs/ioctl.c:686
SYSC_ioctl fs/ioctl.c:701 [inline]
SyS_ioctl+0x8f/0xc0 fs/ioctl.c:692
entry_SYSCALL_64_fastpath+0x1f/0x96
RIP: 0033:0x452a09
RSP: 002b:00007fe747e1fc58 EFLAGS: 00000212 ORIG_RAX: 0000000000000010
RAX: ffffffffffffffda RBX: 000000000071bea0 RCX: 0000000000452a09
RDX: 0000000020ebec00 RSI: 000000004400ae8f RDI: 0000000000000019
RBP: 000000000000023b R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000212 R12: 00000000006f0628
R13: 00000000ffffffff R14: 00007fe747e206d4 R15: 0000000000000000
Code: 7b e5 82 48 0f 44 da e8 8d 82 eb ff 48 8b 45 d0 4d 89 e9 4c 89 e1 4c
89 fa 48 89 de 48 c7 c7 a8 51 e6 82 49 89 c0 e8 76 b7 e3 ff <0f> 0b 48 c7
c0 43 51 e6 82 eb a1 48 c7 c0 53 51 e6 82 eb 98 48
RIP: report_usercopy mm/usercopy.c:76 [inline] RSP: ffffc90000d6fca8
RIP: __check_object_size+0x1e2/0x250 mm/usercopy.c:276 RSP: ffffc90000d6fca8
---[ end trace 189465b430781fff ]---
Kernel panic - not syncing: Fatal exception
Dumping ftrace buffer:
(ftrace buffer empty)
Kernel Offset: disabled
Rebooting in 86400 seconds..

---
This bug is generated by a dumb bot. It may contain errors.
See https://goo.gl/tpsmEJ for details.
Direct all questions to syzk...@googlegroups.com.
Please credit me with: Reported-by: syzbot <syzk...@googlegroups.com>

syzbot will keep track of this bug report.
Once a fix for this bug is merged into any tree, reply to this email with:
#syz fix: exact-commit-title
To mark this as a duplicate of another syzbot report, please reply with:
#syz dup: exact-subject-of-another-report
If it's a one-off invalid bug report, please reply with:
#syz invalid
Note: if the crash happens again, it will cause creation of a new bug
report.
Note: all commands must start from beginning of the line in the email body.

config.txt

raw.log

Tetsuo Handa

unread,

Dec 18, 2017, 9:23:12 AM12/18/17

to linu...@kvack.org, syzbot, da...@nullcore.net, kees...@chromium.org, keun-...@darkmatter.ae, lab...@redhat.com, linux-...@vger.kernel.org, mark.r...@arm.com, mi...@kernel.org, syzkall...@googlegroups.com, will....@arm.com

On 2017/12/18 22:40, syzbot wrote:
> Hello,
>
> syzkaller hit the following crash on 6084b576dca2e898f5c101baef151f7bfdbb606d
> git://git.kernel.org/pub/scm/linux/kernel/git/next/linux-next.git/master
> compiler: gcc (GCC) 7.1.1 20170620
> .config is attached
> Raw console output is attached.
>
> Unfortunately, I don't have any reproducer for this bug yet.
>
>

This BUG is reporting

[ 26.089789] usercopy: kernel memory overwrite attempt detected to 0000000022a5b430 (kmalloc-1024) (1024 bytes)

line. But isn't 0000000022a5b430 strange for kmalloc(1024, GFP_KERNEL)ed kernel address?

Kees Cook

unread,

Dec 18, 2017, 7:57:58 PM12/18/17

to Tetsuo Handa, Linux-MM, syzbot, David Windsor, keun-...@darkmatter.ae, Laura Abbott, LKML, Mark Rutland, Ingo Molnar, syzkall...@googlegroups.com, Will Deacon

On Mon, Dec 18, 2017 at 6:22 AM, Tetsuo Handa
<penguin...@i-love.sakura.ne.jp> wrote:
> On 2017/12/18 22:40, syzbot wrote:
>> Hello,
>>
>> syzkaller hit the following crash on 6084b576dca2e898f5c101baef151f7bfdbb606d
>> git://git.kernel.org/pub/scm/linux/kernel/git/next/linux-next.git/master
>> compiler: gcc (GCC) 7.1.1 20170620
>> .config is attached
>> Raw console output is attached.
>>
>> Unfortunately, I don't have any reproducer for this bug yet.
>>
>>
>
> This BUG is reporting
>
> [ 26.089789] usercopy: kernel memory overwrite attempt detected to 0000000022a5b430 (kmalloc-1024) (1024 bytes)
>
> line. But isn't 0000000022a5b430 strange for kmalloc(1024, GFP_KERNEL)ed kernel address?

The address is hashed (see the %p threads for 4.15).

There's something strange going on with the slab allocator (I haven't
tracked it down yet, unfortunately).

-Kees

--
Kees Cook
Pixel Security

Dmitry Vyukov

unread,

Dec 19, 2017, 3:13:19 AM12/19/17

to Kees Cook, Tetsuo Handa, Linux-MM, syzbot, David Windsor, keun-...@darkmatter.ae, Laura Abbott, LKML, Mark Rutland, Ingo Molnar, syzkall...@googlegroups.com, Will Deacon, m...@tobin.cc

On Tue, Dec 19, 2017 at 1:57 AM, Kees Cook <kees...@chromium.org> wrote:
> On Mon, Dec 18, 2017 at 6:22 AM, Tetsuo Handa
> <penguin...@i-love.sakura.ne.jp> wrote:
>> On 2017/12/18 22:40, syzbot wrote:
>>> Hello,
>>>
>>> syzkaller hit the following crash on 6084b576dca2e898f5c101baef151f7bfdbb606d
>>> git://git.kernel.org/pub/scm/linux/kernel/git/next/linux-next.git/master
>>> compiler: gcc (GCC) 7.1.1 20170620
>>> .config is attached
>>> Raw console output is attached.
>>>
>>> Unfortunately, I don't have any reproducer for this bug yet.
>>>
>>>
>>
>> This BUG is reporting
>>
>> [ 26.089789] usercopy: kernel memory overwrite attempt detected to 0000000022a5b430 (kmalloc-1024) (1024 bytes)
>>
>> line. But isn't 0000000022a5b430 strange for kmalloc(1024, GFP_KERNEL)ed kernel address?
>
> The address is hashed (see the %p threads for 4.15).

+Tobin, is there a way to disable hashing entirely? The only
designation of syzbot is providing crash reports to kernel developers
with as much info as possible. It's fine for it to leak whatever.

> --
> You received this message because you are subscribed to the Google Groups "syzkaller-bugs" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to syzkaller-bug...@googlegroups.com.
> To view this discussion on the web visit https://groups.google.com/d/msgid/syzkaller-bugs/CAGXu5jKLBuQ8Ne6BjjPH%2B1SVw-Fj4ko5H04GHn-dxXYwoMEZtw%40mail.gmail.com.
> For more options, visit https://groups.google.com/d/optout.

Tobin C. Harding

unread,

Dec 19, 2017, 3:37:52 AM12/19/17

to Dmitry Vyukov, Kees Cook, Tetsuo Handa, Linux-MM, syzbot, David Windsor, keun-...@darkmatter.ae, Laura Abbott, LKML, Mark Rutland, Ingo Molnar, syzkall...@googlegroups.com, Will Deacon

On Tue, Dec 19, 2017 at 09:12:58AM +0100, Dmitry Vyukov wrote:
> On Tue, Dec 19, 2017 at 1:57 AM, Kees Cook <kees...@chromium.org> wrote:
> > On Mon, Dec 18, 2017 at 6:22 AM, Tetsuo Handa
> > <penguin...@i-love.sakura.ne.jp> wrote:
> >> On 2017/12/18 22:40, syzbot wrote:
> >>> Hello,
> >>>
> >>> syzkaller hit the following crash on 6084b576dca2e898f5c101baef151f7bfdbb606d
> >>> git://git.kernel.org/pub/scm/linux/kernel/git/next/linux-next.git/master
> >>> compiler: gcc (GCC) 7.1.1 20170620
> >>> .config is attached
> >>> Raw console output is attached.
> >>>
> >>> Unfortunately, I don't have any reproducer for this bug yet.
> >>>
> >>>
> >>
> >> This BUG is reporting
> >>
> >> [ 26.089789] usercopy: kernel memory overwrite attempt detected to 0000000022a5b430 (kmalloc-1024) (1024 bytes)
> >>
> >> line. But isn't 0000000022a5b430 strange for kmalloc(1024, GFP_KERNEL)ed kernel address?
> >
> > The address is hashed (see the %p threads for 4.15).
>
>
> +Tobin, is there a way to disable hashing entirely? The only
> designation of syzbot is providing crash reports to kernel developers
> with as much info as possible. It's fine for it to leak whatever.

We have new specifier %px to print addresses in hex if leaking info is
not a worry.

Hope this helps,
Tobin.

Dmitry Vyukov

unread,

Dec 19, 2017, 3:42:01 AM12/19/17

to Tobin C. Harding, Kees Cook, Tetsuo Handa, Linux-MM, syzbot, David Windsor, keun-...@darkmatter.ae, Laura Abbott, LKML, Mark Rutland, Ingo Molnar, syzkall...@googlegroups.com, Will Deacon

On Tue, Dec 19, 2017 at 9:37 AM, Tobin C. Harding <m...@tobin.cc> wrote:
>> > <penguin...@i-love.sakura.ne.jp> wrote:
>> >> On 2017/12/18 22:40, syzbot wrote:
>> >>> Hello,
>> >>>
>> >>> syzkaller hit the following crash on 6084b576dca2e898f5c101baef151f7bfdbb606d
>> >>> git://git.kernel.org/pub/scm/linux/kernel/git/next/linux-next.git/master
>> >>> compiler: gcc (GCC) 7.1.1 20170620
>> >>> .config is attached
>> >>> Raw console output is attached.
>> >>>
>> >>> Unfortunately, I don't have any reproducer for this bug yet.
>> >>>
>> >>>
>> >>
>> >> This BUG is reporting
>> >>
>> >> [ 26.089789] usercopy: kernel memory overwrite attempt detected to 0000000022a5b430 (kmalloc-1024) (1024 bytes)
>> >>
>> >> line. But isn't 0000000022a5b430 strange for kmalloc(1024, GFP_KERNEL)ed kernel address?
>> >
>> > The address is hashed (see the %p threads for 4.15).
>>
>>
>> +Tobin, is there a way to disable hashing entirely? The only
>> designation of syzbot is providing crash reports to kernel developers
>> with as much info as possible. It's fine for it to leak whatever.
>
> We have new specifier %px to print addresses in hex if leaking info is
> not a worry.

This is not a per-printf-site thing. It's per-machine thing.

Tobin C. Harding

unread,

Dec 19, 2017, 4:04:23 AM12/19/17

to Dmitry Vyukov, Kees Cook, Tetsuo Handa, Linux-MM, syzbot, David Windsor, keun-...@darkmatter.ae, Laura Abbott, LKML, Mark Rutland, Ingo Molnar, syzkall...@googlegroups.com, Will Deacon

There is no way to disable the hashing currently built into the system.

Tobin

Dmitry Vyukov

unread,

Dec 19, 2017, 4:08:20 AM12/19/17

to Tobin C. Harding, Kees Cook, Tetsuo Handa, Linux-MM, syzbot, David Windsor, keun-...@darkmatter.ae, Laura Abbott, LKML, Mark Rutland, Ingo Molnar, syzkall...@googlegroups.com, Will Deacon

Ack.
Any kind of continuous testing systems would be a use case for this.

Matthew Wilcox

unread,

Dec 19, 2017, 8:22:56 AM12/19/17

to Tobin C. Harding, Dmitry Vyukov, Kees Cook, Tetsuo Handa, Linux-MM, syzbot, David Windsor, keun-...@darkmatter.ae, Laura Abbott, LKML, Mark Rutland, Ingo Molnar, syzkall...@googlegroups.com, Will Deacon

On Tue, Dec 19, 2017 at 07:37:46PM +1100, Tobin C. Harding wrote:
> On Tue, Dec 19, 2017 at 09:12:58AM +0100, Dmitry Vyukov wrote:
> > On Tue, Dec 19, 2017 at 1:57 AM, Kees Cook <kees...@chromium.org> wrote:
> > > On Mon, Dec 18, 2017 at 6:22 AM, Tetsuo Handa

> > >> This BUG is reporting
> > >>
> > >> [ 26.089789] usercopy: kernel memory overwrite attempt detected to 0000000022a5b430 (kmalloc-1024) (1024 bytes)
> > >>
> > >> line. But isn't 0000000022a5b430 strange for kmalloc(1024, GFP_KERNEL)ed kernel address?
> > >
> > > The address is hashed (see the %p threads for 4.15).
> >
> >
> > +Tobin, is there a way to disable hashing entirely? The only
> > designation of syzbot is providing crash reports to kernel developers
> > with as much info as possible. It's fine for it to leak whatever.
>
> We have new specifier %px to print addresses in hex if leaking info is
> not a worry.

Could we have a way to know that the printed address is hashed and not just
a pointer getting completely scrogged? Perhaps prefix it with ... a hash!
So this line would look like:

[ 26.089789] usercopy: kernel memory overwrite attempt detected to #0000000022a5b430 (kmalloc-1024) (1024 bytes)

Or does that miss the point of hashing the address, so the attacker
thinks its a real address?

Dmitry Vyukov

unread,

Dec 19, 2017, 8:42:16 AM12/19/17

to Matthew Wilcox, Tobin C. Harding, Kees Cook, Tetsuo Handa, Linux-MM, syzbot, David Windsor, keun-...@darkmatter.ae, Laura Abbott, LKML, Mark Rutland, Ingo Molnar, syzkall...@googlegroups.com, Will Deacon

On Tue, Dec 19, 2017 at 2:22 PM, Matthew Wilcox <wi...@infradead.org> wrote:
>> > >> This BUG is reporting
>> > >>
>> > >> [ 26.089789] usercopy: kernel memory overwrite attempt detected to 0000000022a5b430 (kmalloc-1024) (1024 bytes)
>> > >>
>> > >> line. But isn't 0000000022a5b430 strange for kmalloc(1024, GFP_KERNEL)ed kernel address?
>> > >
>> > > The address is hashed (see the %p threads for 4.15).
>> >
>> >
>> > +Tobin, is there a way to disable hashing entirely? The only
>> > designation of syzbot is providing crash reports to kernel developers
>> > with as much info as possible. It's fine for it to leak whatever.
>>
>> We have new specifier %px to print addresses in hex if leaking info is
>> not a worry.
>
> Could we have a way to know that the printed address is hashed and not just
> a pointer getting completely scrogged? Perhaps prefix it with ... a hash!
> So this line would look like:
>
> [ 26.089789] usercopy: kernel memory overwrite attempt detected to #0000000022a5b430 (kmalloc-1024) (1024 bytes)
>
> Or does that miss the point of hashing the address, so the attacker
> thinks its a real address?

If we do something with this, I would suggest that we just disable
hashing. Any of the concerns that lead to hashed pointers are not
applicable in this context, moreover they are harmful, cause confusion
and make it harder to debug these bugs. That perfectly can be an
opt-in CONFIG_DEBUG_INSECURE_BLA_BLA_BLA.

Tetsuo Handa

unread,

Dec 19, 2017, 9:08:37 AM12/19/17

to dvy...@google.com, wi...@infradead.org, m...@tobin.cc, kees...@chromium.org, linu...@kvack.org, bot+719398b443fd30155f...@syzkaller.appspotmail.com, da...@nullcore.net, keun-...@darkmatter.ae, lab...@redhat.com, linux-...@vger.kernel.org, mark.r...@arm.com, mi...@kernel.org, syzkall...@googlegroups.com, will....@arm.com

Why not a kernel command line option? Hashing by default.

Dmitry Vyukov

unread,

Dec 19, 2017, 9:12:27 AM12/19/17

to Tetsuo Handa, Matthew Wilcox, Tobin C. Harding, Kees Cook, Linux-MM, syzbot, David Windsor, keun-...@darkmatter.ae, Laura Abbott, LKML, Mark Rutland, Ingo Molnar, syzkall...@googlegroups.com, Will Deacon

Would work for continuous testing systems too.
I just thought that since it has security implications, a config would
be more reliable. Say if a particular distribution builds kernel
without this config, then there is no way to enable it on the fly,
intentionally or not.

Tobin C. Harding

unread,

Dec 19, 2017, 3:34:02 PM12/19/17

to Matthew Wilcox, Dmitry Vyukov, Kees Cook, Tetsuo Handa, Linux-MM, syzbot, David Windsor, keun-...@darkmatter.ae, Laura Abbott, LKML, Mark Rutland, Ingo Molnar, syzkall...@googlegroups.com, Will Deacon

This poses the risk of breaking userland tools that parse the
address. The zeroing of the first 32 bits was a design compromise to
keep the address format while making _kind of_ explicit that some funny
business was going on.

> Or does that miss the point of hashing the address, so the attacker
> thinks its a real address?

No subterfuge intended.

Bonus points Wily, I had to go to 'The New Hackers Dictionary' to look
up 'scrogged' :)

thanks,
Tobin.

Tobin C. Harding

unread,

Dec 19, 2017, 3:45:13 PM12/19/17

to Dmitry Vyukov, Tetsuo Handa, Matthew Wilcox, Kees Cook, Linux-MM, syzbot, David Windsor, keun-...@darkmatter.ae, Laura Abbott, LKML, Mark Rutland, Ingo Molnar, syzkall...@googlegroups.com, Will Deacon, Linus Torvalds

Adding Linus

I wasn't the architect behind the hashing, I've cc'd Linus in the event
he wants to correct me. I believe that some of the benefit of hashing
was to shake things up and force people to think about this issue. If we
implement a method of disabling hashing (command-line parameter or
CONFIG_) at this stage then we risk loosing this benefit since one has
to assume that people will just take the easy option and disable
it. Though perhaps after things settle a bit we could implement this
without the risk?

thanks,
Tobin.

Linus Torvalds

unread,

Dec 19, 2017, 4:36:48 PM12/19/17

to Matthew Wilcox, Tobin C. Harding, Dmitry Vyukov, Kees Cook, Tetsuo Handa, Linux-MM, syzbot, David Windsor, keun-...@darkmatter.ae, Laura Abbott, LKML, Mark Rutland, Ingo Molnar, syzkall...@googlegroups.com, Will Deacon

On Tue, Dec 19, 2017 at 5:22 AM, Matthew Wilcox <wi...@infradead.org> wrote:
>
> Could we have a way to know that the printed address is hashed and not just
> a pointer getting completely scrogged? Perhaps prefix it with ... a hash!
> So this line would look like:

The problem with that is that it will break tools that parse things.

So no, it won't work.

When we find something like this, we should either remove it, fix the
permissions, or switch to %px.

In this case, there's obviously no permission issue: it's an error
report. So it's either "remove it, or switch to %px".

I'm personally not clear on whether the pointer really makes any sense
at all. But if it does, it should just be changed to %px, since it's a
bug report.

But honestly, what do people expect that the pointer value will
actually tell you if it is unhashed?

I suspect that an "offset and size within the kernel object" value
might make sense. But what does the _pointer_ tell you?

I've noticed this with pretty much every report. People get upset
about the hashing, but don't seem to actually be able to ever tell
what the f*ck they would use the non-hashed pointer value for.

I've asked for this before: whenever somebody complains about the
hashing, you had better tell exactly what the unhashed value would
have given you, and how it would have helped debug the problem.

Because if you can't tell that, then dammit, then we should just
_remove_ the stupid %p.

Instead, people ask for "can I get everything unhashed" even when they
can't give a reason for it.

Linus

Al Viro

unread,

Dec 19, 2017, 4:49:24 PM12/19/17

to Linus Torvalds, Matthew Wilcox, Tobin C. Harding, Dmitry Vyukov, Kees Cook, Tetsuo Handa, Linux-MM, syzbot, David Windsor, keun-...@darkmatter.ae, Laura Abbott, LKML, Mark Rutland, Ingo Molnar, syzkall...@googlegroups.com, Will Deacon

On Tue, Dec 19, 2017 at 01:36:46PM -0800, Linus Torvalds wrote:

> I suspect that an "offset and size within the kernel object" value
> might make sense. But what does the _pointer_ tell you?

Well, for example seeing a 0xfffffffffffffff4 where a pointer to object
must have been is a pretty strong hint to start looking for a way for
that ERR_PTR(-ENOMEM) having ended up there... Something like
0x6e69622f7273752f is almost certainly a misplaced "/usr/bin", i.e. a
pathname overwriting whatever it ends up in, etc. And yes, I have run
into both of those in real life.

Debugging the situation when crap value has ended up in place of a
pointer is certainly a case where you do want to see what exactly has
ended up in there...

Kees Cook

unread,

Dec 19, 2017, 4:54:58 PM12/19/17

to Linus Torvalds, Matthew Wilcox, Tobin C. Harding, Dmitry Vyukov, Tetsuo Handa, Linux-MM, syzbot, David Windsor, keun-...@darkmatter.ae, Laura Abbott, LKML, Mark Rutland, Ingo Molnar, syzkall...@googlegroups.com, Will Deacon

On Tue, Dec 19, 2017 at 1:36 PM, Linus Torvalds
<torv...@linux-foundation.org> wrote:
> In this case, there's obviously no permission issue: it's an error
> report. So it's either "remove it, or switch to %px".

Yup, my intention was to kill this %p and enhance the report to
actually include the useful information like, "what is the offset from
the start of the object", etc. That's what really matters.

-Kees

Randy Dunlap

unread,

Dec 19, 2017, 5:09:59 PM12/19/17

to Al Viro, Linus Torvalds, Matthew Wilcox, Tobin C. Harding, Dmitry Vyukov, Kees Cook, Tetsuo Handa, Linux-MM, syzbot, David Windsor, keun-...@darkmatter.ae, Laura Abbott, LKML, Mark Rutland, Ingo Molnar, syzkall...@googlegroups.com, Will Deacon

On 12/19/2017 01:48 PM, Al Viro wrote:
> On Tue, Dec 19, 2017 at 01:36:46PM -0800, Linus Torvalds wrote:
>
>> I suspect that an "offset and size within the kernel object" value
>> might make sense. But what does the _pointer_ tell you?
>
> Well, for example seeing a 0xfffffffffffffff4 where a pointer to object
> must have been is a pretty strong hint to start looking for a way for
> that ERR_PTR(-ENOMEM) having ended up there... Something like
> 0x6e69622f7273752f is almost certainly a misplaced "/usr/bin", i.e. a

possibly poison values also?

> pathname overwriting whatever it ends up in, etc. And yes, I have run
> into both of those in real life.
>
> Debugging the situation when crap value has ended up in place of a
> pointer is certainly a case where you do want to see what exactly has
> ended up in there...

--
~Randy

Matthew Wilcox

unread,

Dec 19, 2017, 5:16:36 PM12/19/17

to Linus Torvalds, Tobin C. Harding, Dmitry Vyukov, Kees Cook, Tetsuo Handa, Linux-MM, syzbot, David Windsor, keun-...@darkmatter.ae, Laura Abbott, LKML, Mark Rutland, Ingo Molnar, syzkall...@googlegroups.com, Will Deacon

On Tue, Dec 19, 2017 at 01:36:46PM -0800, Linus Torvalds wrote:

> On Tue, Dec 19, 2017 at 5:22 AM, Matthew Wilcox <wi...@infradead.org> wrote:
> >
> > Could we have a way to know that the printed address is hashed and not just
> > a pointer getting completely scrogged? Perhaps prefix it with ... a hash!
> > So this line would look like:
>
> The problem with that is that it will break tools that parse things.

Yeah, but the problem is that until people know to expect hashes, it
breaks people. I spent most of a day last week puzzling over a value
coming from a VM_BUG_ON that was explicitly tested for and couldn't
happen.

> When we find something like this, we should either remove it, fix the
> permissions, or switch to %px.

Right; I sent a patch to fix VM_BUG_ON earlier today after reading
this thread.

> But honestly, what do people expect that the pointer value will
> actually tell you if it is unhashed?

It would have been meaningful to me. For a start, I would have seen
that the bottom two bits were clear, so this was actually a pointer and
not something masquerading as a pointer.

Laura Abbott

unread,

Dec 19, 2017, 5:24:12 PM12/19/17

to Linus Torvalds, Matthew Wilcox, Tobin C. Harding, Dmitry Vyukov, Kees Cook, Tetsuo Handa, Linux-MM, syzbot, David Windsor, keun-...@darkmatter.ae, LKML, Mark Rutland, Ingo Molnar, syzkall...@googlegroups.com, Will Deacon

It's most useful in the "I really have no idea what's going on" case.
Sometimes just narrowing down if a pointer was kernel or vmalloc or
kmap or whatever is the only starting point. I agree that
size plus offset from object would be helpful.

Thanks,
Laura

Linus Torvalds

unread,

Dec 19, 2017, 6:24:30 PM12/19/17

to Al Viro, Matthew Wilcox, Tobin C. Harding, Dmitry Vyukov, Kees Cook, Tetsuo Handa, Linux-MM, syzbot, David Windsor, keun-...@darkmatter.ae, Laura Abbott, LKML, Mark Rutland, Ingo Molnar, syzkall...@googlegroups.com, Will Deacon

On Tue, Dec 19, 2017 at 1:48 PM, Al Viro <vi...@zeniv.linux.org.uk> wrote:
> On Tue, Dec 19, 2017 at 01:36:46PM -0800, Linus Torvalds wrote:
>
>> I suspect that an "offset and size within the kernel object" value
>> might make sense. But what does the _pointer_ tell you?
>
> Well, for example seeing a 0xfffffffffffffff4 where a pointer to object
> must have been is a pretty strong hint to start looking for a way for
> that ERR_PTR(-ENOMEM) having ended up there... Something like
> 0x6e69622f7273752f is almost certainly a misplaced "/usr/bin", i.e. a
> pathname overwriting whatever it ends up in, etc. And yes, I have run
> into both of those in real life.

Sure. But that's for a faulting address when you have an invalid pointer.

That's not the case here at all.

Here, we've explicitly checked that it's a kernel pointer of some
particular type (in a slab cache in this case), and the pointer is
valid but shouldn't be copied to/from user space.

So it's not something like 0xfffffffffffffff4 or 0x6e69622f7273752f.
It's something like "in slab cache for size 1024".

So the pointer value isn't interesting. But the offset within the slab could be.

See? This is what I am talking about. People don't actually seem to
*think* about what the %p is. There seems to be very little critical
thinking about what should be printed out, and what is actually
useful.

The most common thing seems to be "I'm confused by a bad value". But
that should *not* cause a mindless "let's not hash it" reaction.

It should cause actual thinking about the situation! Not about %p in
general, but very much about the situation of THAT PARTICULAR use of
%p.

That's what I'm looking for, and what I'm not seeing in these discussions.

Linus

Matthew Wilcox

unread,

Dec 19, 2017, 10:51:06 PM12/19/17

to Al Viro, Linus Torvalds, Tobin C. Harding, Dmitry Vyukov, Kees Cook, Tetsuo Handa, Linux-MM, syzbot, David Windsor, keun-...@darkmatter.ae, Laura Abbott, LKML, Mark Rutland, Ingo Molnar, syzkall...@googlegroups.com, Will Deacon

On Tue, Dec 19, 2017 at 09:48:49PM +0000, Al Viro wrote:
> Well, for example seeing a 0xfffffffffffffff4 where a pointer to object
> must have been is a pretty strong hint to start looking for a way for
> that ERR_PTR(-ENOMEM) having ended up there... Something like
> 0x6e69622f7273752f is almost certainly a misplaced "/usr/bin", i.e. a
> pathname overwriting whatever it ends up in, etc. And yes, I have run
> into both of those in real life.
>
> Debugging the situation when crap value has ended up in place of a
> pointer is certainly a case where you do want to see what exactly has
> ended up in there...

Linus, how would you feel about printing ERR_PTRs without molestation?
It's not going to leak any information about the kernel address space
layout. I'm a little less certain about trying to detect ASCII strings,
but I think this is an improvement.

diff --git a/lib/vsprintf.c b/lib/vsprintf.c
index 01c3957b2de6..c80c60b4b3ef 100644
--- a/lib/vsprintf.c
+++ b/lib/vsprintf.c
@@ -1859,6 +1859,9 @@ char *pointer(const char *fmt, char *buf, char *end, void *ptr,
return string(buf, end, "(null)", spec);
}

+ if (IS_ERR(ptr))
+ return pointer_string(buf, end, ptr, spec);
+
switch (*fmt) {
case 'F':
case 'f':

Linus Torvalds

unread,

Dec 19, 2017, 11:05:24 PM12/19/17

to Matthew Wilcox, Al Viro, Tobin C. Harding, Dmitry Vyukov, Kees Cook, Tetsuo Handa, Linux-MM, syzbot, David Windsor, keun-...@darkmatter.ae, Laura Abbott, LKML, Mark Rutland, Ingo Molnar, syzkall...@googlegroups.com, Will Deacon

On Tue, Dec 19, 2017 at 7:50 PM, Matthew Wilcox <wi...@infradead.org> wrote:
> On Tue, Dec 19, 2017 at 09:48:49PM +0000, Al Viro wrote:
>> Well, for example seeing a 0xfffffffffffffff4 where a pointer to object
>> must have been is a pretty strong hint to start looking for a way for
>> that ERR_PTR(-ENOMEM) having ended up there... Something like
>> 0x6e69622f7273752f is almost certainly a misplaced "/usr/bin", i.e. a
>> pathname overwriting whatever it ends up in, etc. And yes, I have run
>> into both of those in real life.
>>
>> Debugging the situation when crap value has ended up in place of a
>> pointer is certainly a case where you do want to see what exactly has
>> ended up in there...
>
> Linus, how would you feel about printing ERR_PTRs without molestation?

Stop this stupidity already.

Guys, seriously. You're making idiotic arguments that have nothing to
do with reality.

If you actually have an invalid pointer that causes an oops (whether
it's an ERR_PTR or something like 0x6e69622f7273752f or something like
the list poison values etc),

WE ALREADY PRINT OUT THE WHOLE UNHASHED POINTER VALUE

This "but but but some pointers are magic and shouldn't be hashed"
stuff is *garbage*. You're making shit up. You don't have a single
actual real-life example that you can point to that is relevant.

Again, I've seen those bad pointer oopses myself. Yes, the magic
values are relevant, and should be printed out.

BUT THEY ALREADY ARE PRINTED OUT.

Christ.

So let me repeat:

- don't change %p behavior.

- don't use "I was confused" as an argument. Yes, things changed, and
yes, it clearly caused confusion, but that is temporary and is not an
argument for making magic changes.

- don't make up some garbage theoretical example: give a hard example
of output that actually didn't have enough information.

And yes, we'll just replace the %p with %px when that last situation
holds. Really. Really really.

But it needs to be a real example, not a "what if" that doesn't make sense.

Not some pet theory that doesn't hold water.

This whole "what if it was a poison pointer" argument is a _prime_
example of pure and utter garbage.

If we have an oops, and it was due a poison value or an err-pointer
that we dereferenced, we will *see* the poison value.

It will be right there in the register state.

It will be right there in the bad address.

It will be quite visible.

And yes, we had a few cases where the hashing actually did hide the
values, and I've been applying patches to turn those from %p to %px.

But it had better be actual real cases, and real thought out
situations. Not "let's just randomly pick values that we don't hash".

Linus

Linus Torvalds

unread,

Dec 19, 2017, 11:36:36 PM12/19/17

to Matthew Wilcox, Al Viro, Tobin C. Harding, Dmitry Vyukov, Kees Cook, Tetsuo Handa, Linux-MM, syzbot, David Windsor, keun-...@darkmatter.ae, Laura Abbott, LKML, Mark Rutland, Ingo Molnar, syzkall...@googlegroups.com, Will Deacon

On Tue, Dec 19, 2017 at 8:05 PM, Linus Torvalds
<torv...@linux-foundation.org> wrote:
>
> And yes, we had a few cases where the hashing actually did hide the
> values, and I've been applying patches to turn those from %p to %px.

So far at least:

10a7e9d84915 Do not hash userspace addresses in fault handlers
85c3e4a5a185 mm/slab.c: do not hash pointers when debugging slab
d81041820873 powerpc/xmon: Don't print hashed pointers in xmon
328b4ed93b69 x86: don't hash faulting address in oops printout
b7ad7ef742a9 remove task and stack pointer printout from oops dump
6424f6bb4327 kasan: use %px to print addresses instead of %p

although that next-to-last case is a "remove %p" case rather than
"convert to %px".

And we'll probably hit a few more, I'm not at all claiming that we're
somehow "done". There's bound to be other cases people haven't noticed
yet (or haven't patched yet, like the usercopy case that Kees is
signed up to fix up).

But considering that we had something like 12k of those %p users, I
think a handful now (and maybe a few tens eventually) is worth the
pain and confusion.

I just want to make sure that the ones we _do_ convert we actually
spend the mental effort really looking at, and really asking "does it
make sense to convert this?"

Not just knee-jerking "oh, it's hashed, let's just unhash it".

Linus

David Laight

unread,

Dec 20, 2017, 4:43:50 AM12/20/17

to Al Viro, Linus Torvalds, Matthew Wilcox, Tobin C. Harding, Dmitry Vyukov, Kees Cook, Tetsuo Handa, Linux-MM, syzbot, David Windsor, keun-...@darkmatter.ae, Laura Abbott, LKML, Mark Rutland, Ingo Molnar, syzkall...@googlegroups.com, Will Deacon

From: Al Viro
> Sent: 19 December 2017 21:49

I've certainly seen a lot of ascii in pointers (usually because the
previous item has overrun).
Although I suspect they'd appear in the fault frame - which hopefully
carries real addresses.

A compromise would be to hash the 'page' part of the address.
On 64bit systems this is probably about 32 bits.
It would still show whether pointers are user, kernel, vmalloc (etc)
but without giving away the actual value.
The page offset (12 bits) would show the alignment (etc).

Including a per-boot random number would make it harder to generate
'rainbow tables' to reverse the hash.

David

Dmitry Vyukov

unread,

Dec 31, 2017, 3:11:27 AM12/31/17

to David Laight, Al Viro, Linus Torvalds, Matthew Wilcox, Tobin C. Harding, Kees Cook, Tetsuo Handa, Linux-MM, syzbot, David Windsor, keun-...@darkmatter.ae, Laura Abbott, LKML, Mark Rutland, Ingo Molnar, syzkall...@googlegroups.com, Will Deacon

Bad things on kmalloc-1024 are most likely caused by an invalid free
in pcrypt, it freed a pointer into a middle of a 1024 byte heap object
which was undetected by KASAN (now there is a patch for this in mm
tree) and later caused all kinds of bad things:
https://groups.google.com/forum/#!topic/syzkaller-bugs/NKn_ivoPOpk
https://patchwork.kernel.org/patch/10126761/

#syz dup: KASAN: use-after-free Read in __list_del_entry_valid (2)

Reply all

Reply to author

Forward

BUG: bad usercopy in memdup_user

syzbot

Tetsuo Handa

Kees Cook

Dmitry Vyukov

Tobin C. Harding

Dmitry Vyukov

Tobin C. Harding

Dmitry Vyukov

Matthew Wilcox

Dmitry Vyukov

Tetsuo Handa

Dmitry Vyukov

Tobin C. Harding

Tobin C. Harding

Linus Torvalds

Al Viro

Kees Cook

Randy Dunlap

Matthew Wilcox

Laura Abbott

Linus Torvalds

Matthew Wilcox

Linus Torvalds

Linus Torvalds

David Laight

Dmitry Vyukov