Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

Bug#986837: aoe: kernel crash on blk_update_request: I/O error, BUG: scheduling while atomic

32 views
Skip to first unread message

Valentin Kleibel

unread,
Apr 12, 2021, 1:20:04 PM4/12/21
to
Package: linux-image-amd64
Version: 5.10.26-1
Source: linux

Dear Maintainers,

It seems we found a race condition in the aoe driver that leads to a
kernel crash. It is triggered when an aoe device is unavailable and
therefore produces an I/O error in the code that tries to remove the
device. (drivers/block/aoe/aoedev.c: aoedev_downdev)

example process to reproduce:
* add an aoe-target to a lvm2 volume group
* make the aoe target unavailable (e.g. set the network dev down) but
don't flush it
* run a command that scans all physical volumes, e.g. 'vgs'
* wait for aoe to time out (default for aoe_deadsecs is 180s)

result: kernel crash
relevant dmesg output:
[....]
[ 183.855191] mlx4_en: enp65s0d1: Close port called
[ 183.931534] mlx4_en: enp65s0d1: Link Down
[ 408.620155] blk_update_request: I/O error, dev etherd/e42.0, sector
4096 op 0x0:(READ) flags 0x0 phys_seg 2 prio class 0
[ 408.620235] blk_update_request: I/O error, dev etherd/e42.0, sector 0
op 0x0:(READ) flags 0x0 phys_seg 2 prio class 0
[ 408.620290] BUG: scheduling while atomic: swapper/16/0/0x00000100
[ 408.620325] Modules linked in: sctp bridge 8021q garp stp mrp llc
psmouse dlm configfs aoe ipmi_ssif amd64_edac_mod edac_mce_amd
amd_energy kvm_amd kvm irqbypass ghash_clmulni_intel aesni_intel libaes
crypto_simd cryptd glue_helper rapl pcspkr ast drm_vram_helper
drm_ttm_helper ttm drm_kms_helper cec drm evdev joydev ccp sg sp5100_tco
rng_core watchdog k10temp acpi_ipmi ipmi_si ipmi_devintf ipmi_msghandler
acpi_cpufreq button ext4 crc16 mbcache jbd2 dm_mod raid10 raid456
async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq
libcrc32c crc32c_generic raid0 multipath linear mlx4_ib ib_uverbs
mlx4_en raid1 md_mod sd_mod t10_pi crc_t10dif crct10dif_generic ib_core
hid_generic usbhid hid crct10dif_pclmul crct10dif_common crc32_pclmul
crc32c_intel xhci_pci igb mpt3sas xhci_hcd ahci libahci i2c_algo_bit dca
ptp libata pps_core raid_class usbcore scsi_transport_sas mlx4_core
scsi_mod i2c_piix4 usb_common
[ 408.620422] CPU: 16 PID: 0 Comm: swapper/16 Not tainted
5.10.0-5-amd64 #1 Debian 5.10.26-1
[ 408.620424] Hardware name: Supermicro AS -2013S-C0R/H11SSL-C, BIOS
2.1 02/21/2020
[ 408.620425] Call Trace:
[ 408.620428] <IRQ>
[ 408.620437] dump_stack+0x6b/0x83
[ 408.620442] __schedule_bug.cold+0x4c/0x58
[ 408.620446] __schedule+0x719/0x870
[ 408.620449] schedule+0x46/0xb0
[ 408.620453] blk_mq_freeze_queue_wait+0x62/0x90
[ 408.620458] ? add_wait_queue_exclusive+0x70/0x70
[ 408.620466] aoedev_downdev+0x106/0x150 [aoe]
[ 408.620471] rexmit_timer+0x4ea/0x500 [aoe]
[ 408.620476] ? rexmit_deferred+0x380/0x380 [aoe]
[ 408.620480] call_timer_fn+0x29/0xf0
[ 408.620483] __run_timers.part.0+0x1d3/0x240
[ 408.620485] ? ktime_get+0x38/0xa0
[ 408.620488] ? lapic_next_event+0x1d/0x20
[ 408.620491] ? clockevents_program_event+0x8d/0xf0
[ 408.620494] run_timer_softirq+0x26/0x50
[ 408.620496] __do_softirq+0xc5/0x275
[ 408.620499] asm_call_irq_on_stack+0x12/0x20
[ 408.620501] </IRQ>
[ 408.620505] do_softirq_own_stack+0x37/0x40
[ 408.620509] irq_exit_rcu+0x8e/0xc0
[ 408.620512] sysvec_apic_timer_interrupt+0x36/0x80
[ 408.620515] asm_sysvec_apic_timer_interrupt+0x12/0x20
[ 408.620520] RIP: 0010:cpuidle_enter_state+0xc7/0x350
[ 408.620523] Code: 8b 3d dd 5b b7 6b e8 d8 4f a2 ff 49 89 c5 0f 1f 44
00 00 31 ff e8 69 5a a2 ff 45 84 ff 0f 85 fa 00 00 00 fb 66 0f 1f 44 00
00 <45> 85 f6 0f 88 06 01 00 00 49 63 c6 4c 2b 2c 24 48 8d 14 40 48 8d
[ 408.620525] RSP: 0018:ffffba890038fea8 EFLAGS: 00000246
[ 408.620527] RAX: ffff9c18afc2bc00 RBX: 0000000000000002 RCX:
000000000000001f
[ 408.620529] RDX: 0000000000000000 RSI: 000000003677d46d RDI:
0000000000000000
[ 408.620530] RBP: ffff9c28d634b000 R08: 0000005f23a8382d R09:
0000000000000018
[ 408.620531] R10: 0000000000000dd5 R11: 0000000000001169 R12:
ffffffff955b8fa0
[ 408.620532] R13: 0000005f23a8382d R14: 0000000000000002 R15:
0000000000000000
[ 408.620537] ? cpuidle_enter_state+0xb7/0x350
[ 408.620540] cpuidle_enter+0x29/0x40
[ 408.620543] do_idle+0x1ef/0x2b0
[ 408.620546] cpu_startup_entry+0x19/0x20
[ 408.620550] secondary_startup_64_no_verify+0xb0/0xbb
[ 408.620561] bad: scheduling from the idle thread!
[ 408.620591] CPU: 16 PID: 0 Comm: swapper/16 Tainted: G W
5.10.0-5-amd64 #1 Debian 5.10.26-1
[ 408.620593] Hardware name: Supermicro AS -2013S-C0R/H11SSL-C, BIOS
2.1 02/21/2020
[ 408.620601] Call Trace:
[ 408.620609] <IRQ>
[ 408.620617] dump_stack+0x6b/0x83
[ 408.620625] dequeue_task_idle+0x28/0x40
[ 408.620632] __schedule+0x3bf/0x870
[ 408.620641] schedule+0x46/0xb0
[ 408.620648] blk_mq_freeze_queue_wait+0x62/0x90
[ 408.620657] ? add_wait_queue_exclusive+0x70/0x70
[ 408.620666] aoedev_downdev+0x106/0x150 [aoe]
[ 408.620679] rexmit_timer+0x4ea/0x500 [aoe]
[ 408.620688] ? rexmit_deferred+0x380/0x380 [aoe]
[ 408.620694] call_timer_fn+0x29/0xf0
[ 408.620701] __run_timers.part.0+0x1d3/0x240
[ 408.620709] ? ktime_get+0x38/0xa0
[ 408.620715] ? lapic_next_event+0x1d/0x20
[ 408.620720] ? clockevents_program_event+0x8d/0xf0
[ 408.620728] run_timer_softirq+0x26/0x50
[ 408.620734] __do_softirq+0xc5/0x275
[ 408.620739] asm_call_irq_on_stack+0x12/0x20
[ 408.620743] </IRQ>
[ 408.620749] do_softirq_own_stack+0x37/0x40
[ 408.620757] irq_exit_rcu+0x8e/0xc0
[ 408.620767] sysvec_apic_timer_interrupt+0x36/0x80
[ 408.620774] asm_sysvec_apic_timer_interrupt+0x12/0x20
[ 408.620786] RIP: 0010:cpuidle_enter_state+0xc7/0x350
[ 408.620794] Code: 8b 3d dd 5b b7 6b e8 d8 4f a2 ff 49 89 c5 0f 1f 44
00 00 31 ff e8 69 5a a2 ff 45 84 ff 0f 85 fa 00 00 00 fb 66 0f 1f 44 00
00 <45> 85 f6 0f 88 06 01 00 00 49 63 c6 4c 2b 2c 24 48 8d 14 40 48 8d
[ 408.620796] RSP: 0018:ffffba890038fea8 EFLAGS: 00000246
[ 408.620798] RAX: ffff9c18afc2bc00 RBX: 0000000000000002 RCX:
000000000000001f
[ 408.620801] RDX: 0000000000000000 RSI: 000000003677d46d RDI:
0000000000000000
[ 408.620802] RBP: ffff9c28d634b000 R08: 0000005f23a8382d R09:
0000000000000018
[ 408.620804] R10: 0000000000000dd5 R11: 0000000000001169 R12:
ffffffff955b8fa0
[ 408.620805] R13: 0000005f23a8382d R14: 0000000000000002 R15:
0000000000000000
[ 408.620810] ? cpuidle_enter_state+0xb7/0x350
[ 408.620813] cpuidle_enter+0x29/0x40
[ 408.620817] do_idle+0x1ef/0x2b0
[ 408.620820] cpu_startup_entry+0x19/0x20
[ 408.620822] secondary_startup_64_no_verify+0xb0/0xbb
[ 408.622975] bad: scheduling from the idle thread!
[ 408.623007] CPU: 16 PID: 0 Comm: swapper/16 Tainted: G W
5.10.0-5-amd64 #1 Debian 5.10.26-1
[ 408.623008] Hardware name: Supermicro AS -2013S-C0R/H11SSL-C, BIOS
2.1 02/21/2020
[ 408.623010] Call Trace:
[ 408.623012] <IRQ>
[ 408.623014] dump_stack+0x6b/0x83
[ 408.623017] dequeue_task_idle+0x28/0x40
[ 408.623020] __schedule+0x3bf/0x870
[ 408.623022] schedule+0x46/0xb0
[ 408.623024] blk_mq_freeze_queue_wait+0x62/0x90
[ 408.623027] ? add_wait_queue_exclusive+0x70/0x70
[ 408.623031] aoedev_downdev+0x106/0x150 [aoe]
[ 408.623035] rexmit_timer+0x4ea/0x500 [aoe]
[ 408.623040] ? rexmit_deferred+0x380/0x380 [aoe]
[ 408.623052] call_timer_fn+0x29/0xf0
[ 408.623061] __run_timers.part.0+0x1d3/0x240
[ 408.623069] ? ktime_get+0x38/0xa0
[ 408.623078] ? lapic_next_event+0x1d/0x20
[ 408.623087] ? clockevents_program_event+0x8d/0xf0
[ 408.623095] run_timer_softirq+0x26/0x50
[ 408.623103] __do_softirq+0xc5/0x275
[ 408.623106] asm_call_irq_on_stack+0x12/0x20
[ 408.623107] </IRQ>
[ 408.623110] do_softirq_own_stack+0x37/0x40
[ 408.623114] irq_exit_rcu+0x8e/0xc0
[ 408.623116] sysvec_apic_timer_interrupt+0x36/0x80
[ 408.623119] asm_sysvec_apic_timer_interrupt+0x12/0x20
[ 408.623123] RIP: 0010:cpuidle_enter_state+0xc7/0x350
[ 408.623125] Code: 8b 3d dd 5b b7 6b e8 d8 4f a2 ff 49 89 c5 0f 1f 44
00 00 31 ff e8 69 5a a2 ff 45 84 ff 0f 85 fa 00 00 00 fb 66 0f 1f 44 00
00 <45> 85 f6 0f 88 06 01 00 00 49 63 c6 4c 2b 2c 24 48 8d 14 40 48 8d
[ 408.623127] RSP: 0018:ffffba890038fea8 EFLAGS: 00000246
[ 408.623130] RAX: ffff9c18afc2bc00 RBX: 0000000000000002 RCX:
000000000000001f
[ 408.623131] RDX: 0000000000000000 RSI: 000000003677d46d RDI:
0000000000000000
[ 408.623132] RBP: ffff9c28d634b000 R08: 0000005f23a8382d R09:
0000000000000018
[ 408.623133] R10: 0000000000000dd5 R11: 0000000000001169 R12:
ffffffff955b8fa0
[ 408.623135] R13: 0000005f23a8382d R14: 0000000000000002 R15:
0000000000000000
[ 408.623140] ? cpuidle_enter_state+0xb7/0x350
[ 408.623147] cpuidle_enter+0x29/0x40
[ 408.623154] do_idle+0x1ef/0x2b0
[ 408.623164] cpu_startup_entry+0x19/0x20
[ 408.623173] secondary_startup_64_no_verify+0xb0/0xbb
[ 408.623484] bad: scheduling from the idle thread!
[ 408.623514] CPU: 16 PID: 0 Comm: swapper/16 Tainted: G W
5.10.0-5-amd64 #1 Debian 5.10.26-1
[ 408.623515] Hardware name: Supermicro AS -2013S-C0R/H11SSL-C, BIOS
2.1 02/21/2020
[ 408.623516] Call Trace:
[ 408.623518] <IRQ>
[ 408.623520] dump_stack+0x6b/0x83
[ 408.623523] dequeue_task_idle+0x28/0x40
[ 408.623525] __schedule+0x3bf/0x870
[ 408.623528] schedule+0x46/0xb0
[ 408.623531] blk_mq_freeze_queue_wait+0x62/0x90
[ 408.623533] ? add_wait_queue_exclusive+0x70/0x70
[ 408.623537] aoedev_downdev+0x106/0x150 [aoe]
[ 408.623542] rexmit_timer+0x4ea/0x500 [aoe]
[ 408.623554] ? rexmit_deferred+0x380/0x380 [aoe]
[ 408.623563] call_timer_fn+0x29/0xf0
[ 408.623572] __run_timers.part.0+0x1d3/0x240
[ 408.623581] ? ktime_get+0x38/0xa0
[ 408.623587] ? lapic_next_event+0x1d/0x20
[ 408.623596] ? clockevents_program_event+0x8d/0xf0
[ 408.623604] run_timer_softirq+0x26/0x50
[ 408.623610] __do_softirq+0xc5/0x275
[ 408.623613] asm_call_irq_on_stack+0x12/0x20
[ 408.623615] </IRQ>
[ 408.623618] do_softirq_own_stack+0x37/0x40
[ 408.623620] irq_exit_rcu+0x8e/0xc0
[ 408.623623] sysvec_apic_timer_interrupt+0x36/0x80
[ 408.623626] asm_sysvec_apic_timer_interrupt+0x12/0x20
[ 408.623630] RIP: 0010:cpuidle_enter_state+0xc7/0x350
[ 408.623632] Code: 8b 3d dd 5b b7 6b e8 d8 4f a2 ff 49 89 c5 0f 1f 44
00 00 31 ff e8 69 5a a2 ff 45 84 ff 0f 85 fa 00 00 00 fb 66 0f 1f 44 00
00 <45> 85 f6 0f 88 06 01 00 00 49 63 c6 4c 2b 2c 24 48 8d 14 40 48 8d
[ 408.623634] RSP: 0018:ffffba890038fea8 EFLAGS: 00000246
[ 408.623636] RAX: ffff9c18afc2bc00 RBX: 0000000000000002 RCX:
000000000000001f
[ 408.623637] RDX: 0000000000000000 RSI: 000000003677d46d RDI:
0000000000000000
[ 408.623639] RBP: ffff9c28d634b000 R08: 0000005f23a8382d R09:
0000000000000018
[ 408.623641] R10: 0000000000000dd5 R11: 0000000000001169 R12:
ffffffff955b8fa0
[ 408.623644] R13: 0000005f23a8382d R14: 0000000000000002 R15:
0000000000000000
[ 408.623651] ? cpuidle_enter_state+0xb7/0x350
[ 408.623660] cpuidle_enter+0x29/0x40
[ 408.623668] do_idle+0x1ef/0x2b0
[ 408.623677] cpu_startup_entry+0x19/0x20
[ 408.623685] secondary_startup_64_no_verify+0xb0/0xbb
[ 408.623991] bad: scheduling from the idle thread!
[ 408.624027] CPU: 16 PID: 0 Comm: swapper/16 Tainted: G W
5.10.0-5-amd64 #1 Debian 5.10.26-1
[ 408.624028] Hardware name: Supermicro AS -2013S-C0R/H11SSL-C, BIOS
2.1 02/21/2020
[ 408.624029] Call Trace:
[ 408.624030] <IRQ>
[ 408.624034] dump_stack+0x6b/0x83
[ 408.624036] dequeue_task_idle+0x28/0x40
[ 408.624038] __schedule+0x3bf/0x870
[ 408.624041] schedule+0x46/0xb0
[ 408.624043] blk_mq_freeze_queue_wait+0x62/0x90
[ 408.624047] ? add_wait_queue_exclusive+0x70/0x70
[ 408.624051] aoedev_downdev+0x106/0x150 [aoe]
[ 408.624054] rexmit_timer+0x4ea/0x500 [aoe]
[ 408.624058] ? rexmit_deferred+0x380/0x380 [aoe]
[ 408.624062] call_timer_fn+0x29/0xf0
[ 408.624064] __run_timers.part.0+0x1d3/0x240
[ 408.624066] ? ktime_get+0x38/0xa0
[ 408.624068] ? lapic_next_event+0x1d/0x20
[ 408.624070] ? clockevents_program_event+0x8d/0xf0
[ 408.624072] run_timer_softirq+0x26/0x50
[ 408.624075] __do_softirq+0xc5/0x275
[ 408.624077] asm_call_irq_on_stack+0x12/0x20
[ 408.624078] </IRQ>
[ 408.624080] do_softirq_own_stack+0x37/0x40
[ 408.624084] irq_exit_rcu+0x8e/0xc0
[ 408.624094] sysvec_apic_timer_interrupt+0x36/0x80
[ 408.624103] asm_sysvec_apic_timer_interrupt+0x12/0x20
[ 408.624112] RIP: 0010:cpuidle_enter_state+0xc7/0x350
[ 408.624121] Code: 8b 3d dd 5b b7 6b e8 d8 4f a2 ff 49 89 c5 0f 1f 44
00 00 31 ff e8 69 5a a2 ff 45 84 ff 0f 85 fa 00 00 00 fb 66 0f 1f 44 00
00 <45> 85 f6 0f 88 06 01 00 00 49 63 c6 4c 2b 2c 24 48 8d 14 40 48 8d
[ 408.624130] RSP: 0018:ffffba890038fea8 EFLAGS: 00000246
[ 408.624142] RAX: ffff9c18afc2bc00 RBX: 0000000000000002 RCX:
000000000000001f
[ 408.624150] RDX: 0000000000000000 RSI: 000000003677d46d RDI:
0000000000000000
[ 408.624158] RBP: ffff9c28d634b000 R08: 0000005f23a8382d R09:
0000000000000018
[ 408.624163] R10: 0000000000000dd5 R11: 0000000000001169 R12:
ffffffff955b8fa0
[ 408.624172] R13: 0000005f23a8382d R14: 0000000000000002 R15:
0000000000000000
[ 408.624182] ? cpuidle_enter_state+0xb7/0x350
[ 408.624189] cpuidle_enter+0x29/0x40
[ 408.624196] do_idle+0x1ef/0x2b0
[ 408.624200] cpu_startup_entry+0x19/0x20
[ 408.624203] secondary_startup_64_no_verify+0xb0/0xbb


These messages continue until the machine is reset.

It seems to be a regression from commit 3582dd291788 ("aoe: convert
aoeblk to blk-mq") and a similar bug has already been fixed [1].

running on kernel 4.19.0-16-amd64 we get the expected result that the
aoe device is removed upon timeout. dmesg output:
[....]
[ 301.543788] mlx4_en: enp65s0d1: Close port called
[ 301.608154] mlx4_en: enp65s0d1: Link Down
[ 527.124182] print_req_error: I/O error, dev etherd/e42.0, sector 4096
[ 527.124248] aoe: device 42.0 is not up
[ 527.124251] print_req_error: I/O error, dev etherd/e42.0, sector 0
[ 527.124299] aoe: device 42.0 is not up
[ 527.124300] aoe: device 42.0 is not up
[ 527.124316] aoe: device 42.0 is not up


Hope someone can resolve this issue,

thanks for your help,
Valentin Kleibel

[1] https://lkml.org/lkml/2019/8/27/400

Salvatore Bonaccorso

unread,
Apr 12, 2021, 1:30:03 PM4/12/21
to
Hi Valentin,
Thanks for the report. I assume you can reproduce the issue as well
with 5.10.28-1 in unstable?

Can you report the issue to upstream and loop in the bug?

Regards,
Salvatore

Valentin Kleibel

unread,
Apr 13, 2021, 8:20:03 AM4/13/21
to
Hi Salvatore,

> Thanks for the report. I assume you can reproduce the issue as well
> with 5.10.28-1 in unstable?

I did not test this before as the aoe driver code was not changed at all
in the last 7 months. I can now report that the behavior is exactly the
same running the kernel 5.10.0-6-amd64 #1 SMP Debian 5.10.28-1 from
unstable.

> Can you report the issue to upstream and loop in the bug?

Yes but i did not do this before. From what i can find i'd report to
the IO/STorage - Other section in bugzilla [1] and mail to the listed
maintainer "Justin Sanders" <jus...@coraid.com>. Is that the way to go
or did i forget something?

Regards,
Valentin

[1]
https://bugzilla.kernel.org/buglist.cgi?component=Other&product=IO%2FStorage&resolution=---

Salvatore Bonaccorso

unread,
Apr 13, 2021, 8:40:03 AM4/13/21
to
Hi Valentin,

On Tue, Apr 13, 2021 at 01:42:49PM +0200, Valentin Kleibel wrote:
> Hi Salvatore,
>
> > Thanks for the report. I assume you can reproduce the issue as well
> > with 5.10.28-1 in unstable?
>
> I did not test this before as the aoe driver code was not changed at all in
> the last 7 months. I can now report that the behavior is exactly the same
> running the kernel 5.10.0-6-amd64 #1 SMP Debian 5.10.28-1 from unstable.

Ack thanks for confirming that as well.

>
> > Can you report the issue to upstream and loop in the bug?
>
> Yes but i did not do this before. From what i can find i'd report to
> the IO/STorage - Other section in bugzilla [1] and mail to the listed
> maintainer "Justin Sanders" <jus...@coraid.com>. Is that the way to go or
> did i forget something?

You can run the get_maintainer.pl script to get information where to
report the issue, it gives me:

$ ./scripts/get_maintainer.pl ./drivers/block/aoe/
Justin Sanders <jus...@coraid.com> (supporter:ATA OVER ETHERNET (AOE) DRIVER)
Jens Axboe <ax...@kernel.dk> (maintainer:BLOCK LAYER)
linux...@vger.kernel.org (open list:BLOCK LAYER)
linux-...@vger.kernel.org (open list)

So yes, this is the right approach, mail to Justin and Jens, keeping
the mailinglist in the loop.

Let me know if you need help here further!

Regards,
Salvatore

Valentin Kleibel

unread,
Apr 13, 2021, 11:00:04 AM4/13/21
to
Hi

> $ ./scripts/get_maintainer.pl ./drivers/block/aoe/
> Justin Sanders <jus...@coraid.com> (supporter:ATA OVER ETHERNET (AOE) DRIVER)
> Jens Axboe <ax...@kernel.dk> (maintainer:BLOCK LAYER)
> linux...@vger.kernel.org (open list:BLOCK LAYER)
> linux-...@vger.kernel.org (open list)

Thanks for your help.
The bug has been reported upstream:
linux-kernel: https://lkml.org/lkml/2021/4/13/672
linux-block:
https://lore.kernel.org/linux-block/b6aea08d-7190-e341...@vrvis.at/T/#u
kernel.org bugzilla: https://bugzilla.kernel.org/show_bug.cgi?id=212665

Regards,
Valentin

Valentin Kleibel

unread,
May 17, 2021, 12:30:03 PM5/17/21
to
Hello,

> Thanks for your help.
> The bug has been reported upstream:
> linux-kernel: https://lkml.org/lkml/2021/4/13/672
> linux-block:
> https://lore.kernel.org/linux-block/b6aea08d-7190-e341...@vrvis.at/T/#u
> kernel.org bugzilla: https://bugzilla.kernel.org/show_bug.cgi?id=212665

After waiting some weeks and in addition trying to reach out to
sup...@coraid.com to find someone who might be willing to address this
bug i unfortunately did not receive any response.
Do you have suggestions what i could do next? Unfortunately i do not
think i have the skills to fix the bug myself.

Regards, Valentin

Raymond Burkholder

unread,
May 17, 2021, 10:00:03 PM5/17/21
to
On 5/17/21 10:17 AM, Valentin Kleibel wrote:
>> The bug has been reported upstream:
>> linux-kernel: https://lkml.org/lkml/2021/4/13/672
>> linux-block:
>> https://lore.kernel.org/linux-block/b6aea08d-7190-e341...@vrvis.at/T/#u
>>
>> kernel.org bugzilla: https://bugzilla.kernel.org/show_bug.cgi?id=212665
>
> After waiting some weeks and in addition trying to reach out to
> sup...@coraid.com to find someone who might be willing to address this
> bug i unfortunately did not receive any response.

Probably won't. I think coraid no longer exists as of about 3 or 4
years ago. [if this is about aoe style systems]


> Do you have suggestions what i could do next? Unfortunately i do not
> think i have the skills to fix the bug myself.

Try something else? :-(

Valentin (Sysadmin)

unread,
Mar 1, 2022, 6:10:03 AM3/1/22
to
Hi,

I finally managed to identify the root cause of this issue and do have a
patch and a more detailed description of the issue attched to the kernel
bugtracker.
The attached patch is applicable to stable (5.10.92) and experimental
(5.17-rc4) kernels.
As I did not receive any response to the original upstream report, I
fear that this might be the same for the proposed patch as well.
Do you have any suggestions on what to do?
I would now follow up with a mail to the maintainers, the linux-block
list and the lkml but I don't know anything more I could try.

Regards,
Valentin
aoe-fix-deadsecs-timeout-handling.patch
0 new messages