[v5.15] INFO: rcu detected stall in batadv_purge_orig

0 views
Skip to first unread message

syzbot

unread,
Dec 6, 2023, 1:55:26 PM12/6/23
to syzkaller...@googlegroups.com
Hello,

syzbot found the following issue on:

HEAD commit: 9b91d36ba301 Linux 5.15.141
git tree: linux-5.15.y
console output: https://syzkaller.appspot.com/x/log.txt?x=15fdda02e80000
kernel config: https://syzkaller.appspot.com/x/.config?x=31f4c22724493853
dashboard link: https://syzkaller.appspot.com/bug?extid=eea665fc50931e17c799
compiler: Debian clang version 15.0.6, GNU ld (GNU Binutils for Debian) 2.40

Unfortunately, I don't have any reproducer for this issue yet.

Downloadable assets:
disk image: https://storage.googleapis.com/syzbot-assets/5fc81554d3d4/disk-9b91d36b.raw.xz
vmlinux: https://storage.googleapis.com/syzbot-assets/191387cf41a2/vmlinux-9b91d36b.xz
kernel image: https://storage.googleapis.com/syzbot-assets/8d074d8a66a4/bzImage-9b91d36b.xz

IMPORTANT: if you fix the issue, please add the following tag to the commit:
Reported-by: syzbot+eea665...@syzkaller.appspotmail.com

rcu: INFO: rcu_preempt detected stalls on CPUs/tasks:
rcu: 0-...!: (0 ticks this GP) idle=b49/1/0x4000000000000000 softirq=76560/76560 fqs=0
(detected by 1, t=10506 jiffies, g=114989, q=90)
Sending NMI from CPU 1 to CPUs 0:
NMI backtrace for cpu 0
CPU: 0 PID: 12855 Comm: kworker/u4:30 Not tainted 5.15.141-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 11/10/2023
Workqueue: bat_events batadv_purge_orig
RIP: 0010:check_preemption_disabled+0x49/0x110 lib/smp_processor_id.c:55
Code: 75 65 8b 05 31 83 e6 75 a9 ff ff ff 7f 74 22 65 48 8b 04 25 28 00 00 00 48 3b 44 24 08 0f 85 c7 00 00 00 89 d8 48 83 c4 10 5b <41> 5c 41 5e 41 5f c3 48 c7 04 24 00 00 00 00 9c 8f 04 24 f7 04 24
RSP: 0018:ffffc90000007c58 EFLAGS: 00000086
RAX: 0000000000000000 RBX: 0000000000000002 RCX: ffff88801bf75940
RDX: ffff88801bf75940 RSI: ffffffff8a8b2000 RDI: ffffffff8ad87d40
RBP: 0000000000000001 R08: ffffffff886acba7 R09: 0000000000000003
R10: ffffffffffffffff R11: dffffc0000000001 R12: 0000000000000046
R13: ffff88801bf75940 R14: 00000000ffffffff R15: ffff88801e870300
FS: 0000000000000000(0000) GS:ffff8880b9a00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000001b30622000 CR3: 000000007b6f6000 CR4: 00000000003506f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
<NMI>
</NMI>
<IRQ>
lockdep_recursion_finish kernel/locking/lockdep.c:436 [inline]
lock_is_held_type+0xfd/0x180 kernel/locking/lockdep.c:5667
lock_is_held include/linux/lockdep.h:287 [inline]
advance_sched+0x69/0x940 net/sched/sch_taprio.c:717
__run_hrtimer kernel/time/hrtimer.c:1685 [inline]
__hrtimer_run_queues+0x598/0xcf0 kernel/time/hrtimer.c:1749
hrtimer_interrupt+0x392/0x980 kernel/time/hrtimer.c:1811
local_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1085 [inline]
__sysvec_apic_timer_interrupt+0x139/0x470 arch/x86/kernel/apic/apic.c:1102
sysvec_apic_timer_interrupt+0x8c/0xb0 arch/x86/kernel/apic/apic.c:1096
</IRQ>
<TASK>
asm_sysvec_apic_timer_interrupt+0x16/0x20 arch/x86/include/asm/idtentry.h:638
RIP: 0010:lock_acquire+0x1/0x4f0 kernel/locking/lockdep.c:5591
Code: fd ff ff 48 c7 c7 08 c9 e3 8d e8 1a 61 67 00 e9 a2 fd ff ff 0f 1f 44 00 00 65 8b 05 61 f1 9f 7e a9 00 ff ff 00 0f 95 c0 c3 55 <48> 89 e5 41 57 41 56 41 55 41 54 53 48 83 e4 e0 48 81 ec 20 01 00
RSP: 0018:ffffc9000638fab0 EFLAGS: 00000246
RAX: b9b640943c46c300 RBX: ffffffff89fa1ec4 RCX: 0000000000000000
RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff888030bc5f18
RBP: ffffc9000638fc40 R08: 0000000000000001 R09: 0000000000000000
R10: 0000000000000000 R11: dffffc0000000001 R12: dffffc0000000000
R13: ffff88801aecde28 R14: ffff888030bc5f00 R15: ffff88801aecde18
__raw_spin_lock_bh include/linux/spinlock_api_smp.h:135 [inline]
_raw_spin_lock_bh+0x31/0x40 kernel/locking/spinlock.c:178
spin_lock_bh include/linux/spinlock.h:368 [inline]
batadv_purge_orig_ref+0x1b4/0x15a0 net/batman-adv/originator.c:1243
batadv_purge_orig+0x15/0x60 net/batman-adv/originator.c:1272
process_one_work+0x8a1/0x10c0 kernel/workqueue.c:2310
worker_thread+0xaca/0x1280 kernel/workqueue.c:2457
kthread+0x3f6/0x4f0 kernel/kthread.c:319
ret_from_fork+0x1f/0x30 arch/x86/entry/entry_64.S:298
</TASK>
rcu: rcu_preempt kthread timer wakeup didn't happen for 10505 jiffies! g114989 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x402
rcu: Possible timer handling issue on cpu=0 timer-softirq=92271
rcu: rcu_preempt kthread starved for 10506 jiffies! g114989 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x402 ->cpu=0
rcu: Unless rcu_preempt kthread gets sufficient CPU time, OOM is now expected behavior.
rcu: RCU grace-period kthread stack dump:
task:rcu_preempt state:I stack:27000 pid: 15 ppid: 2 flags:0x00004000
Call Trace:
<TASK>
context_switch kernel/sched/core.c:5030 [inline]
__schedule+0x12c4/0x45b0 kernel/sched/core.c:6376
schedule+0x11b/0x1f0 kernel/sched/core.c:6459
schedule_timeout+0x1b9/0x300 kernel/time/timer.c:1884
rcu_gp_fqs_loop+0x2bf/0x1080 kernel/rcu/tree.c:1972
rcu_gp_kthread+0xa4/0x360 kernel/rcu/tree.c:2145
kthread+0x3f6/0x4f0 kernel/kthread.c:319
ret_from_fork+0x1f/0x30 arch/x86/entry/entry_64.S:298
</TASK>
rcu: Stack dump where RCU GP kthread last ran:
Sending NMI from CPU 1 to CPUs 0:
NMI backtrace for cpu 0
CPU: 0 PID: 12855 Comm: kworker/u4:30 Not tainted 5.15.141-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 11/10/2023
Workqueue: bat_events batadv_purge_orig
RIP: 0010:bytes_is_nonzero mm/kasan/generic.c:84 [inline]
RIP: 0010:memory_is_nonzero mm/kasan/generic.c:102 [inline]
RIP: 0010:memory_is_poisoned_n mm/kasan/generic.c:128 [inline]
RIP: 0010:memory_is_poisoned mm/kasan/generic.c:159 [inline]
RIP: 0010:check_region_inline mm/kasan/generic.c:180 [inline]
RIP: 0010:kasan_check_range+0x68/0x290 mm/kasan/generic.c:189
Code: fc ff df 4e 8d 0c 03 4c 8d 54 37 ff 49 c1 ea 03 49 bb 01 00 00 00 00 fc ff df 4f 8d 34 1a 4c 89 f5 4c 29 cd 48 83 fd 10 7f 26 <48> 85 ed 0f 84 3a 01 00 00 49 f7 d2 49 01 da 41 80 39 00 0f 85 c4
RSP: 0018:ffffc90000007a58 EFLAGS: 00000083
RAX: 0000000000000001 RBX: 1ffffffff1f79c19 RCX: ffffffff816296a9
RDX: 0000000000000000 RSI: 0000000000000008 RDI: ffffffff8fbce0c8
RBP: 0000000000000001 R08: dffffc0000000000 R09: fffffbfff1f79c19
R10: 1ffffffff1f79c19 R11: dffffc0000000001 R12: 0000000000000002
R13: ffff88801bf76428 R14: fffffbfff1f79c1a R15: ffff88801bf764a0
FS: 0000000000000000(0000) GS:ffff8880b9a00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000001b30622000 CR3: 000000007b6f6000 CR4: 00000000003506f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
<NMI>
</NMI>
<IRQ>
instrument_atomic_read include/linux/instrumented.h:71 [inline]
test_bit include/asm-generic/bitops/instrumented-non-atomic.h:134 [inline]
hlock_class kernel/locking/lockdep.c:197 [inline]
__lock_acquire+0x1209/0x1ff0 kernel/locking/lockdep.c:5009
lock_acquire+0x1db/0x4f0 kernel/locking/lockdep.c:5623
__raw_spin_lock_irq include/linux/spinlock_api_smp.h:128 [inline]
_raw_spin_lock_irq+0xcf/0x110 kernel/locking/spinlock.c:170
__run_hrtimer kernel/time/hrtimer.c:1689 [inline]
__hrtimer_run_queues+0x662/0xcf0 kernel/time/hrtimer.c:1749
hrtimer_interrupt+0x392/0x980 kernel/time/hrtimer.c:1811
local_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1085 [inline]
__sysvec_apic_timer_interrupt+0x139/0x470 arch/x86/kernel/apic/apic.c:1102
sysvec_apic_timer_interrupt+0x8c/0xb0 arch/x86/kernel/apic/apic.c:1096
</IRQ>
<TASK>
asm_sysvec_apic_timer_interrupt+0x16/0x20 arch/x86/include/asm/idtentry.h:638
RIP: 0010:lock_acquire+0x1/0x4f0 kernel/locking/lockdep.c:5591
Code: fd ff ff 48 c7 c7 08 c9 e3 8d e8 1a 61 67 00 e9 a2 fd ff ff 0f 1f 44 00 00 65 8b 05 61 f1 9f 7e a9 00 ff ff 00 0f 95 c0 c3 55 <48> 89 e5 41 57 41 56 41 55 41 54 53 48 83 e4 e0 48 81 ec 20 01 00
RSP: 0018:ffffc9000638fab0 EFLAGS: 00000246
RAX: b9b640943c46c300 RBX: ffffffff89fa1ec4 RCX: 0000000000000000
RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff888030bc5f18
RBP: ffffc9000638fc40 R08: 0000000000000001 R09: 0000000000000000
R10: 0000000000000000 R11: dffffc0000000001 R12: dffffc0000000000
R13: ffff88801aecde28 R14: ffff888030bc5f00 R15: ffff88801aecde18
__raw_spin_lock_bh include/linux/spinlock_api_smp.h:135 [inline]
_raw_spin_lock_bh+0x31/0x40 kernel/locking/spinlock.c:178
spin_lock_bh include/linux/spinlock.h:368 [inline]
batadv_purge_orig_ref+0x1b4/0x15a0 net/batman-adv/originator.c:1243
batadv_purge_orig+0x15/0x60 net/batman-adv/originator.c:1272
process_one_work+0x8a1/0x10c0 kernel/workqueue.c:2310
worker_thread+0xaca/0x1280 kernel/workqueue.c:2457
kthread+0x3f6/0x4f0 kernel/kthread.c:319
ret_from_fork+0x1f/0x30 arch/x86/entry/entry_64.S:298
</TASK>


---
This report is generated by a bot. It may contain errors.
See https://goo.gl/tpsmEJ for more information about syzbot.
syzbot engineers can be reached at syzk...@googlegroups.com.

syzbot will keep track of this issue. See:
https://goo.gl/tpsmEJ#status for how to communicate with syzbot.

If the report is already addressed, let syzbot know by replying with:
#syz fix: exact-commit-title

If you want to overwrite report's subsystems, reply with:
#syz set subsystems: new-subsystem
(See the list of subsystem names on the web dashboard)

If the report is a duplicate of another one, reply with:
#syz dup: exact-subject-of-another-report

If you want to undo deduplication, reply with:
#syz undup
Reply all
Reply to author
Forward
0 new messages