BUG: soft lockup (2)

55 views
Skip to first unread message

syzbot

unread,
Dec 10, 2017, 8:32:02 AM12/10/17
to aaro...@intel.com, adob...@gmail.com, ak...@linux-foundation.org, fred...@kernel.org, linux-...@vger.kernel.org, mi...@kernel.org, pet...@infradead.org, syzkall...@googlegroups.com, ying....@intel.com
Hello,

syzkaller hit the following crash on
43f462f1c2e111d2882b48baeeff774ae42e7c56
git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/master
compiler: gcc (GCC) 7.1.1 20170620
.config is attached
Raw console output is attached.

Unfortunately, I don't have any reproducer for this bug yet.


watchdog: BUG: soft lockup - CPU#0 stuck for 134s! [kworker/0:2:1400]
Modules linked in:
irq event stamp: 494028
hardirqs last enabled at (494027): [<ffffffff8516b14a>]
restore_regs_and_return_to_kernel+0x0/0x26
hardirqs last disabled at (494028): [<ffffffff8516c088>]
apic_timer_interrupt+0x98/0xb0 arch/x86/entry/entry_64.S:795
softirqs last enabled at (484570): [<ffffffff85171d23>]
__do_softirq+0x733/0xbb2 kernel/softirq.c:311
softirqs last disabled at (484563): [<ffffffff81426983>] invoke_softirq
kernel/softirq.c:365 [inline]
softirqs last disabled at (484563): [<ffffffff81426983>]
irq_exit+0x1d3/0x210 kernel/softirq.c:405
CPU: 0 PID: 1400 Comm: kworker/0:2 Not tainted 4.15.0-rc1+ #198
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS
Google 01/01/2011
Workqueue: events jump_label_update_timeout
task: ffff8801d2abe540 task.stack: ffff8801d2ac0000
RIP: 0010:rep_nop arch/x86/include/asm/processor.h:636 [inline]
RIP: 0010:cpu_relax arch/x86/include/asm/processor.h:641 [inline]
RIP: 0010:csd_lock_wait kernel/smp.c:108 [inline]
RIP: 0010:smp_call_function_single+0x364/0x560 kernel/smp.c:302
RSP: 0018:ffff8801d2ac6f00 EFLAGS: 00000293 ORIG_RAX: ffffffffffffff11
RAX: ffff8801d2abe540 RBX: 1ffff1003a558de8 RCX: ffffffff8164c2e2
RDX: 0000000000000000 RSI: 00000000000000fb RDI: ffff8801d2ac6ff8
RBP: ffff8801d2ac7050 R08: 1ffff1003a558dff R09: 0000000000000000
R10: ffff8801d2ac7078 R11: 0000000000000000 R12: ffff8801d2ac6ff8
R13: dffffc0000000000 R14: 0000000000000000 R15: ffffed003a558df4
FS: 0000000000000000(0000) GS:ffff8801db400000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000020335ffc CR3: 00000001d0869000 CR4: 00000000001426f0
Call Trace:
smp_call_function_many+0x773/0x930 kernel/smp.c:434
smp_call_function kernel/smp.c:492 [inline]
on_each_cpu+0x3d/0x1b0 kernel/smp.c:602
text_poke_bp+0xe4/0x170 arch/x86/kernel/alternative.c:819
__jump_label_transform.isra.0+0x6a5/0x8a0 arch/x86/kernel/jump_label.c:102
arch_jump_label_transform+0x2f/0x40 arch/x86/kernel/jump_label.c:110
__jump_label_update+0x207/0x2d0 kernel/jump_label.c:368
jump_label_update+0x22c/0x2b0 kernel/jump_label.c:735
static_key_slow_dec_cpuslocked+0x176/0x1d0 kernel/jump_label.c:204
__static_key_slow_dec kernel/jump_label.c:214 [inline]
jump_label_update_timeout+0x1f/0x30 kernel/jump_label.c:222
process_one_work+0xbfd/0x1be0 kernel/workqueue.c:2112
worker_thread+0x223/0x1990 kernel/workqueue.c:2246
kthread+0x37a/0x440 kernel/kthread.c:238
ret_from_fork+0x24/0x30 arch/x86/entry/entry_64.S:441
Code: 00 00 00 fc ff df 44 89 bd 50 ff ff ff 48 c1 e8 03 4c 01 e8 41 83 e7
01 c6 00 f8 74 4e 49 89 c7 49 83 c4 18 e8 4e 25 0b 00 f3 90 <4c> 89 e2 41
c6 07 04 48 c1 ea 03 42 0f b6 14 2a 84 d2 74 09 80
Kernel panic - not syncing: softlockup: hung tasks
CPU: 0 PID: 1400 Comm: kworker/0:2 Tainted: G L 4.15.0-rc1+
#198
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS
Google 01/01/2011
Workqueue: events jump_label_update_timeout
Call Trace:
<IRQ>
__dump_stack lib/dump_stack.c:17 [inline]
dump_stack+0x194/0x257 lib/dump_stack.c:53
panic+0x1e4/0x41c kernel/panic.c:183
watchdog_timer_fn+0x314/0x320 kernel/watchdog.c:443
__run_hrtimer kernel/time/hrtimer.c:1211 [inline]
__hrtimer_run_queues+0x349/0xe10 kernel/time/hrtimer.c:1275
hrtimer_interrupt+0x1d4/0x5f0 kernel/time/hrtimer.c:1309
local_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1025 [inline]
smp_apic_timer_interrupt+0x14a/0x700 arch/x86/kernel/apic/apic.c:1050
apic_timer_interrupt+0x9d/0xb0 arch/x86/entry/entry_64.S:795
</IRQ>
RIP: 0010:rep_nop arch/x86/include/asm/processor.h:636 [inline]
RIP: 0010:cpu_relax arch/x86/include/asm/processor.h:641 [inline]
RIP: 0010:csd_lock_wait kernel/smp.c:108 [inline]
RIP: 0010:smp_call_function_single+0x364/0x560 kernel/smp.c:302
RSP: 0018:ffff8801d2ac6f00 EFLAGS: 00000293 ORIG_RAX: ffffffffffffff11
RAX: ffff8801d2abe540 RBX: 1ffff1003a558de8 RCX: ffffffff8164c2e2
RDX: 0000000000000000 RSI: 00000000000000fb RDI: ffff8801d2ac6ff8
RBP: ffff8801d2ac7050 R08: 1ffff1003a558dff R09: 0000000000000000
R10: ffff8801d2ac7078 R11: 0000000000000000 R12: ffff8801d2ac6ff8
R13: dffffc0000000000 R14: 0000000000000000 R15: ffffed003a558df4
smp_call_function_many+0x773/0x930 kernel/smp.c:434
smp_call_function kernel/smp.c:492 [inline]
on_each_cpu+0x3d/0x1b0 kernel/smp.c:602
text_poke_bp+0xe4/0x170 arch/x86/kernel/alternative.c:819
__jump_label_transform.isra.0+0x6a5/0x8a0 arch/x86/kernel/jump_label.c:102
arch_jump_label_transform+0x2f/0x40 arch/x86/kernel/jump_label.c:110
__jump_label_update+0x207/0x2d0 kernel/jump_label.c:368
jump_label_update+0x22c/0x2b0 kernel/jump_label.c:735
static_key_slow_dec_cpuslocked+0x176/0x1d0 kernel/jump_label.c:204
__static_key_slow_dec kernel/jump_label.c:214 [inline]
jump_label_update_timeout+0x1f/0x30 kernel/jump_label.c:222
process_one_work+0xbfd/0x1be0 kernel/workqueue.c:2112
worker_thread+0x223/0x1990 kernel/workqueue.c:2246
kthread+0x37a/0x440 kernel/kthread.c:238
ret_from_fork+0x24/0x30 arch/x86/entry/entry_64.S:441
Dumping ftrace buffer:
(ftrace buffer empty)
Kernel Offset: disabled
Rebooting in 86400 seconds..


---
This bug is generated by a dumb bot. It may contain errors.
See https://goo.gl/tpsmEJ for details.
Direct all questions to syzk...@googlegroups.com.
Please credit me with: Reported-by: syzbot <syzk...@googlegroups.com>

syzbot will keep track of this bug report.
Once a fix for this bug is merged into any tree, reply to this email with:
#syz fix: exact-commit-title
To mark this as a duplicate of another syzbot report, please reply with:
#syz dup: exact-subject-of-another-report
If it's a one-off invalid bug report, please reply with:
#syz invalid
Note: if the crash happens again, it will cause creation of a new bug
report.
Note: all commands must start from beginning of the line in the email body.
config.txt
raw.log

syzbot

unread,
Jan 5, 2018, 12:47:03 PM1/5/18
to aaro...@intel.com, adob...@gmail.com, ak...@linux-foundation.org, alsa-...@alsa-project.org, fred...@kernel.org, linux-...@vger.kernel.org, mi...@kernel.org, o-ta...@sakamocchi.jp, pe...@perex.cz, pet...@infradead.org, syzkall...@googlegroups.com, ti...@suse.com, ying....@intel.com
syzkaller has found reproducer for the following crash on
e1915c8195b38393005be9b74bfa6a3a367c83b3
git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/master
compiler: gcc (GCC) 7.1.1 20170620
.config is attached
Raw console output is attached.
C reproducer is attached
syzkaller reproducer is attached. See https://goo.gl/kgGztJ
for information about syzkaller reproducers


IMPORTANT: if you fix the bug, please add the following tag to the commit:
Reported-by:
syzbot+f76f3c62dfadce02...@syzkaller.appspotmail.com
It will help syzbot understand when the bug is fixed.

watchdog: BUG: soft lockup - CPU#0 stuck for 135s! [syzkaller670324:3527]
Modules linked in:
irq event stamp: 2531226
hardirqs last enabled at (2531225): [<00000000f1ec093f>]
snd_pcm_stream_unlock_irq+0x78/0xe0 sound/core/pcm_native.c:166
hardirqs last disabled at (2531226): [<000000003c6ef1cd>]
apic_timer_interrupt+0xa4/0xb0 arch/x86/entry/entry_64.S:920
softirqs last enabled at (41848): [<0000000081bd5f03>]
__do_softirq+0x7a0/0xb85 kernel/softirq.c:311
softirqs last disabled at (41829): [<00000000d02c6d52>] invoke_softirq
kernel/softirq.c:365 [inline]
softirqs last disabled at (41829): [<00000000d02c6d52>]
irq_exit+0x1cc/0x200 kernel/softirq.c:405
CPU: 0 PID: 3527 Comm: syzkaller670324 Not tainted 4.15.0-rc6+ #158
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS
Google 01/01/2011
RIP: 0010:memcpy+0x45/0x50 mm/kasan/kasan.c:305
RSP: 0018:ffff8801bf6676f0 EFLAGS: 00000246 ORIG_RAX: ffffffffffffff11
RAX: ffffc9000137ba06 RBX: 0000000000000002 RCX: 0000000000000000
RDX: 0000000000000002 RSI: ffff8801bf6677da RDI: ffffc9000137ba08
RBP: ffff8801bf667708 R08: fffff5200026f741 R09: fffff5200026f741
R10: 0000000000000001 R11: fffff5200026f740 R12: ffffc9000137ba06
R13: ffff8801bf6677d8 R14: dffffc0000000000 R15: ffffc9000137ba06
FS: 0000000000000000(0000) GS:ffff8801db200000(0063) knlGS:00000000f7ec6b40
CS: 0010 DS: 002b ES: 002b CR0: 0000000080050033
CR2: 0000000020735ee0 CR3: 00000001bfba8002 CR4: 00000000001606f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
memcpy include/linux/string.h:344 [inline]
cvt_s16_to_native sound/core/oss/mulaw.c:164 [inline]
mulaw_decode+0x52f/0x770 sound/core/oss/mulaw.c:195
mulaw_transfer+0x222/0x270 sound/core/oss/mulaw.c:273
snd_pcm_plug_write_transfer+0x22d/0x420 sound/core/oss/pcm_plugin.c:611
snd_pcm_oss_write2+0x260/0x420 sound/core/oss/pcm_oss.c:1311
snd_pcm_oss_write1 sound/core/oss/pcm_oss.c:1372 [inline]
snd_pcm_oss_write+0x5fe/0x830 sound/core/oss/pcm_oss.c:2646
__vfs_write+0xef/0x970 fs/read_write.c:480
vfs_write+0x189/0x510 fs/read_write.c:544
SYSC_write fs/read_write.c:589 [inline]
SyS_write+0xef/0x220 fs/read_write.c:581
do_syscall_32_irqs_on arch/x86/entry/common.c:327 [inline]
do_fast_syscall_32+0x3ee/0xf9d arch/x86/entry/common.c:389
entry_SYSENTER_compat+0x54/0x63 arch/x86/entry/entry_64_compat.S:129
RIP: 0023:0xf7f0cc79
RSP: 002b:00000000f7ec61fc EFLAGS: 00000246 ORIG_RAX: 0000000000000004
RAX: ffffffffffffffda RBX: 0000000000000005 RCX: 0000000020735ee0
RDX: 00000000fffffee4 RSI: 0000000000000000 RDI: 0000000000000000
RBP: 00000000003d0f00 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
Code: 89 de 31 d2 e8 9d f9 ff ff 48 8b 4d 08 48 89 de 4c 89 e7 ba 01 00 00
00 e8 89 f9 ff ff 48 89 da 4c 89 ee 4c 89 e7 e8 7b eb c0 03 <5b> 41 5c 41
5d 5d c3 0f 1f 40 00 89 f1 b8 00 10 00 00 55 48 d3

config.txt
raw.log
repro.txt
repro.c

Eric Biggers

unread,
Jan 10, 2018, 2:46:14 AM1/10/18
to syzbot, Takashi Iwai, aaro...@intel.com, adob...@gmail.com, ak...@linux-foundation.org, alsa-...@alsa-project.org, fred...@kernel.org, linux-...@vger.kernel.org, mi...@kernel.org, o-ta...@sakamocchi.jp, pe...@perex.cz, pet...@infradead.org, syzkall...@googlegroups.com, ti...@suse.com, ying....@intel.com
Seems that this is fixed in sound/for-linus by:

#syz fix: ALSA: pcm: Abort properly at pending signal in OSS read/write loops
Reply all
Reply to author
Forward
0 new messages