[moderation] [perf?] BUG: soft lockup in xfrm_timer_handler (2)

1 view
Skip to first unread message

syzbot

unread,
5:12 AM (8 hours ago) 5:12 AM
to syzkaller-upst...@googlegroups.com
Hello,

syzbot found the following issue on:

HEAD commit: 81f88f6ab674 libbpf: Add debug messaging in dedup equivale..
git tree: bpf-next
console output: https://syzkaller.appspot.com/x/log.txt?x=135c0eb4580000
kernel config: https://syzkaller.appspot.com/x/.config?x=9e5198eaf003f1d1
dashboard link: https://syzkaller.appspot.com/bug?extid=3439e38c37aa777d590a
compiler: Debian clang version 20.1.8 (++20250708063551+0c9f909b7976-1~exp1~20250708183702.136), Debian LLD 20.1.8
CC: [ac...@kernel.org adrian...@intel.com alexander...@linux.intel.com iro...@google.com james...@linaro.org jo...@kernel.org linux-...@vger.kernel.org linux-pe...@vger.kernel.org mark.r...@arm.com mi...@redhat.com namh...@kernel.org net...@vger.kernel.org pet...@infradead.org]

Unfortunately, I don't have any reproducer for this issue yet.

Downloadable assets:
disk image: https://storage.googleapis.com/syzbot-assets/cc29c7f8f7ae/disk-81f88f6a.raw.xz
vmlinux: https://storage.googleapis.com/syzbot-assets/7c97e72415a5/vmlinux-81f88f6a.xz
kernel image: https://storage.googleapis.com/syzbot-assets/58cf94e0fd45/bzImage-81f88f6a.xz

IMPORTANT: if you fix the issue, please add the following tag to the commit:
Reported-by: syzbot+3439e3...@syzkaller.appspotmail.com

watchdog: BUG: soft lockup - CPU#1 stuck for 143s! [syz.6.1593:12416]
Modules linked in:
irq event stamp: 14963491
hardirqs last enabled at (14963490): [<ffffffff8b494f9d>] irqentry_exit+0x5dd/0x660 kernel/entry/common.c:219
hardirqs last disabled at (14963491): [<ffffffff8b49380e>] sysvec_apic_timer_interrupt+0xe/0xc0 arch/x86/kernel/apic/apic.c:1056
softirqs last enabled at (3059222): [<ffffffff8185372a>] __do_softirq kernel/softirq.c:656 [inline]
softirqs last enabled at (3059222): [<ffffffff8185372a>] invoke_softirq kernel/softirq.c:496 [inline]
softirqs last enabled at (3059222): [<ffffffff8185372a>] __irq_exit_rcu+0xca/0x1f0 kernel/softirq.c:723
softirqs last disabled at (3059225): [<ffffffff8185372a>] __do_softirq kernel/softirq.c:656 [inline]
softirqs last disabled at (3059225): [<ffffffff8185372a>] invoke_softirq kernel/softirq.c:496 [inline]
softirqs last disabled at (3059225): [<ffffffff8185372a>] __irq_exit_rcu+0xca/0x1f0 kernel/softirq.c:723
CPU: 1 UID: 0 PID: 12416 Comm: syz.6.1593 Not tainted syzkaller #0 PREEMPT(full)
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 10/25/2025
RIP: 0010:perf_swevent_event+0x587/0x5e0 kernel/events/core.c:10577
Code: ce ff eb 1d e8 7a e2 ce ff eb 0c e8 73 e2 ce ff eb 05 e8 6c e2 ce ff 49 bd 00 00 00 00 00 fc ff df 48 c7 44 24 40 0e 36 e0 45 <4a> c7 04 2b 00 00 00 00 65 48 8b 05 09 88 84 10 48 3b 84 24 80 00
RSP: 0018:ffffc90000a07e20 EFLAGS: 00000246
RAX: ffffffff81f26393 RBX: 1ffff92000140fcc RCX: ffff888055045b80
RDX: 0000000000000100 RSI: 0000000000000000 RDI: 0000000000000000
RBP: ffffc90000a07ef0 R08: ffffc90000a07f97 R09: 0000000000000000
R10: ffffc90000a07f88 R11: fffff52000140ff3 R12: 0000000000000001
R13: dffffc0000000000 R14: 0000000000000000 R15: ffff888076682bf8
FS: 00007fd8870e56c0(0000) GS:ffff8881261b1000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00007f3a6ed5fe9c CR3: 00000000763a8000 CR4: 00000000003526f0
DR0: 0000200000000300 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000600
Call Trace:
<IRQ>
perf_tp_event+0x4f6/0x1380 kernel/events/core.c:11083
perf_trace_run_bpf_submit+0xee/0x170 kernel/events/core.c:11007
do_perf_trace_lock include/trace/events/lock.h:50 [inline]
perf_trace_lock+0x2f8/0x3b0 include/trace/events/lock.h:50
__do_trace_lock_release include/trace/events/lock.h:69 [inline]
trace_lock_release include/trace/events/lock.h:69 [inline]
lock_release+0x389/0x3b0 kernel/locking/lockdep.c:5879
rcu_lock_release include/linux/rcupdate.h:341 [inline]
rcu_read_unlock include/linux/rcupdate.h:897 [inline]
class_rcu_destructor include/linux/rcupdate.h:1195 [inline]
unwind_next_frame+0x19a9/0x2390 arch/x86/kernel/unwind_orc.c:680
arch_stack_walk+0x11c/0x150 arch/x86/kernel/stacktrace.c:25
stack_trace_save+0x9c/0xe0 kernel/stacktrace.c:122
kasan_save_stack mm/kasan/common.c:56 [inline]
kasan_save_track+0x3e/0x80 mm/kasan/common.c:77
unpoison_slab_object mm/kasan/common.c:342 [inline]
__kasan_slab_alloc+0x6c/0x80 mm/kasan/common.c:368
kasan_slab_alloc include/linux/kasan.h:252 [inline]
slab_post_alloc_hook mm/slub.c:4948 [inline]
slab_alloc_node mm/slub.c:5258 [inline]
kmem_cache_alloc_node_noprof+0x433/0x710 mm/slub.c:5310
kmalloc_reserve+0xbd/0x290 net/core/skbuff.c:586
__alloc_skb+0x27e/0x430 net/core/skbuff.c:690
alloc_skb include/linux/skbuff.h:1383 [inline]
xfrm_alloc_compat+0x1a6/0x16f0 net/xfrm/xfrm_compat.c:348
xfrm_nlmsg_multicast+0xda/0x1f0 net/xfrm/xfrm_user.c:1584
xfrm_exp_state_notify net/xfrm/xfrm_user.c:3594 [inline]
xfrm_send_state_notify+0x11f1/0x18b0 net/xfrm/xfrm_user.c:3761
km_state_notify+0x110/0x1f0 net/xfrm/xfrm_state.c:2751
km_state_expired net/xfrm/xfrm_state.c:2765 [inline]
xfrm_timer_handler+0x288/0x990 net/xfrm/xfrm_state.c:720
__run_hrtimer kernel/time/hrtimer.c:1777 [inline]
__hrtimer_run_queues+0x51c/0xc30 kernel/time/hrtimer.c:1841
hrtimer_run_softirq+0x187/0x2b0 kernel/time/hrtimer.c:1858
handle_softirqs+0x27d/0x850 kernel/softirq.c:622
__do_softirq kernel/softirq.c:656 [inline]
invoke_softirq kernel/softirq.c:496 [inline]
__irq_exit_rcu+0xca/0x1f0 kernel/softirq.c:723
irq_exit_rcu+0x9/0x30 kernel/softirq.c:739
instr_sysvec_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1056 [inline]
sysvec_apic_timer_interrupt+0xa6/0xc0 arch/x86/kernel/apic/apic.c:1056
</IRQ>
<TASK>
asm_sysvec_apic_timer_interrupt+0x1a/0x20 arch/x86/include/asm/idtentry.h:697
RIP: 0010:preempt_schedule_irq+0xb0/0x150 kernel/sched/core.c:7190
Code: 24 20 f6 44 24 21 02 74 0c 90 0f 0b 48 f7 03 10 00 00 00 74 64 bf 01 00 00 00 e8 cb 4d 46 f6 e8 b6 93 7e f6 fb bf 01 00 00 00 <e8> 1b a8 ff ff 48 c7 44 24 40 00 00 00 00 9c 8f 44 24 40 8b 44 24
RSP: 0018:ffffc90004d2fa40 EFLAGS: 00000286
RAX: 116723ba34647800 RBX: 0000000000000000 RCX: 116723ba34647800
RDX: 0000000000000007 RSI: ffffffff8d76bd39 RDI: 0000000000000001
RBP: ffffc90004d2fae0 R08: ffffffff8f805a77 R09: 1ffffffff1f00b4e
R10: dffffc0000000000 R11: fffffbfff1f00b4f R12: 0000000000000000
R13: 0000000000000000 R14: dffffc0000000000 R15: 1ffff920009a5f48
irqentry_exit+0x5d8/0x660 kernel/entry/common.c:216
asm_sysvec_apic_timer_interrupt+0x1a/0x20 arch/x86/include/asm/idtentry.h:697
RIP: 0010:__phys_addr+0x2d/0x180 arch/x86/mm/physaddr.c:17
Code: fa 41 57 41 56 53 48 89 fb 49 bf 00 00 00 00 00 fc ff df e8 15 8d 4b 00 48 bf ff ff ff 7f ff ff ff ff 48 89 de e8 e3 91 4b 00 <49> 89 de 49 81 ee 00 00 00 80 0f 83 a4 00 00 00 48 c7 c0 20 35 98
RSP: 0018:ffffc90004d2fbf8 EFLAGS: 00000297
RAX: ffffffff8175bd5d RBX: ffff888045d06640 RCX: ffff888055045b80
RDX: 0000000000000002 RSI: ffff888045d06640 RDI: ffffffff7fffffff
RBP: ffff888045d06640 R08: 0000000000000001 R09: ffffffff8227414c
R10: dffffc0000000000 R11: fffffbfff1f00b4f R12: ffffea0000000000
R13: 0000000000000000 R14: 0000000000000000 R15: dffffc0000000000
virt_to_slab mm/slab.h:178 [inline]
qlink_to_cache mm/kasan/quarantine.c:131 [inline]
qlist_free_all+0x39/0x100 mm/kasan/quarantine.c:176
kasan_quarantine_reduce+0x148/0x160 mm/kasan/quarantine.c:286
__kasan_slab_alloc+0x22/0x80 mm/kasan/common.c:352
kasan_slab_alloc include/linux/kasan.h:252 [inline]
slab_post_alloc_hook mm/slub.c:4948 [inline]
slab_alloc_node mm/slub.c:5258 [inline]
kmem_cache_alloc_noprof+0x367/0x6f0 mm/slub.c:5265
alloc_empty_file+0x55/0x1d0 fs/file_table.c:237
alloc_file fs/file_table.c:354 [inline]
alloc_file_pseudo+0x13d/0x210 fs/file_table.c:383
sock_alloc_file+0xb8/0x2e0 net/socket.c:483
sock_map_fd net/socket.c:508 [inline]
__sys_socket+0x13e/0x320 net/socket.c:1747
__do_sys_socket net/socket.c:1752 [inline]
__se_sys_socket net/socket.c:1750 [inline]
__x64_sys_socket+0x7a/0x90 net/socket.c:1750
do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
do_syscall_64+0xfa/0xf80 arch/x86/entry/syscall_64.c:94
entry_SYSCALL_64_after_hwframe+0x77/0x7f
RIP: 0033:0x7fd88618f749
Code: ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 a8 ff ff ff f7 d8 64 89 01 48
RSP: 002b:00007fd8870e5038 EFLAGS: 00000246 ORIG_RAX: 0000000000000029
RAX: ffffffffffffffda RBX: 00007fd8863e5fa0 RCX: 00007fd88618f749
RDX: 0000000000000015 RSI: 0000000000000003 RDI: 0000000000000010
RBP: 00007fd886213f91 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000000
R13: 00007fd8863e6038 R14: 00007fd8863e5fa0 R15: 00007fffc7a48588
</TASK>
Sending NMI from CPU 1 to CPUs 0:
NMI backtrace for cpu 0
CPU: 0 UID: 0 PID: 11004 Comm: kworker/u8:33 Not tainted syzkaller #0 PREEMPT(full)
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 10/25/2025
Workqueue: events_unbound toggle_allocation_gate
RIP: 0010:csd_lock_wait kernel/smp.c:342 [inline]
RIP: 0010:smp_call_function_many_cond+0xce0/0x12b0 kernel/smp.c:877
Code: 41 83 e5 01 49 bd 00 00 00 00 00 fc ff df 75 07 e8 f5 8f 0b 00 eb 38 f3 90 42 0f b6 04 2b 84 c0 75 11 41 f7 04 24 01 00 00 00 <74> 1e e8 d9 8f 0b 00 eb e4 44 89 e1 80 e1 07 80 c1 03 38 c1 7c e2
RSP: 0018:ffffc9000ba4f620 EFLAGS: 00000202
RAX: 0000000000000000 RBX: 1ffff11017128101 RCX: ffff88802d4b9e80
RDX: 0000000000000000 RSI: 0000000000000001 RDI: 0000000000000000
RBP: ffffc9000ba4f7a0 R08: ffffffff8f805a77 R09: 1ffffffff1f00b4e
R10: dffffc0000000000 R11: fffffbfff1f00b4f R12: ffff8880b8940808
R13: dffffc0000000000 R14: ffff8880b883b9c0 R15: 0000000000000001
FS: 0000000000000000(0000) GS:ffff8881260b1000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00007efc0dfcb6b0 CR3: 000000000dd3a000 CR4: 00000000003526f0
DR0: 0000200000000300 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000600
Call Trace:
<TASK>
on_each_cpu_cond_mask+0x3f/0x80 kernel/smp.c:1043
on_each_cpu include/linux/smp.h:71 [inline]
smp_text_poke_sync_each_cpu arch/x86/kernel/alternative.c:2711 [inline]
smp_text_poke_batch_finish+0x5f9/0x1130 arch/x86/kernel/alternative.c:2921
arch_jump_label_transform_apply+0x1c/0x30 arch/x86/kernel/jump_label.c:146
static_key_enable_cpuslocked+0x128/0x240 kernel/jump_label.c:210
static_key_enable+0x1a/0x20 kernel/jump_label.c:223
toggle_allocation_gate+0xad/0x240 mm/kfence/core.c:854
process_one_work kernel/workqueue.c:3257 [inline]
process_scheduled_works+0xad1/0x1770 kernel/workqueue.c:3340
worker_thread+0x8a0/0xda0 kernel/workqueue.c:3421
kthread+0x711/0x8a0 kernel/kthread.c:463
ret_from_fork+0x599/0xb30 arch/x86/kernel/process.c:158
ret_from_fork_asm+0x1a/0x30 arch/x86/entry/entry_64.S:246
</TASK>


---
This report is generated by a bot. It may contain errors.
See https://goo.gl/tpsmEJ for more information about syzbot.
syzbot engineers can be reached at syzk...@googlegroups.com.

syzbot will keep track of this issue. See:
https://goo.gl/tpsmEJ#status for how to communicate with syzbot.

If the report is already addressed, let syzbot know by replying with:
#syz fix: exact-commit-title

If you want to overwrite report's subsystems, reply with:
#syz set subsystems: new-subsystem
(See the list of subsystem names on the web dashboard)

If the report is a duplicate of another one, reply with:
#syz dup: exact-subject-of-another-report

If you want to undo deduplication, reply with:
#syz undup
Reply all
Reply to author
Forward
0 new messages