Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

Bug#949369: i915: kernel crash in i915_active_acquire()

6 views
Skip to first unread message

Arturo Borrero Gonzalez

unread,
Jan 20, 2020, 5:30:05 AM1/20/20
to
Source: linux
Version: 5.4.8-1
Severity: normal
Tags: upstream

Dear maintainers, thanks for your hard work with the linux package, it is
really appreciated.

I had this kernel crash today that let the system unusable.

kernel: [ 973.595610] #PF: supervisor read access in kernel mode
kernel: [ 973.595610] #PF: supervisor read access in kernel mode
kernel: [ 973.595611] #PF: error_code(0x0000) - not-present page
kernel: [ 973.595612] PGD 0 P4D 0
kernel: [ 973.595614] Oops: 0000 [#1] SMP PTI
kernel: [ 973.595616] CPU: 3 PID: 1240 Comm: xfwm4 Tainted: P OE 5.4.0-2-amd64 #1 Debian 5.4.8-1
kernel: [ 973.595617] Hardware name: LENOVO 20H9CTO1WW/20H9CTO1WW, BIOS N1VET40W (1.30 ) 02/07/2018
kernel: [ 973.595644] RIP: 0010:i915_active_acquire+0x9/0x70 [i915]
kernel: [ 973.595646] Code: 00 00 00 48 c7 46 58 00 00 00 00 c7 46 38 00 00 00 00 48 c7 c6 0a 20 29 c2 e9 c3 f7 12 cf 0f 1f 00 0f 1f 44 00 00 41 54 55 53 <8b> 47 38 48 89 fb 85 c0 74 15 8d 50 01 f0 0f b1 53 38 75 f2 45 31
kernel: [ 973.595647] RSP: 0018:ffffb9924256ba40 EFLAGS: 00010286
kernel: [ 973.595648] RAX: 0000000000000000 RBX: ffff8b29346850c0 RCX: 0000000000000000
kernel: [ 973.595649] RDX: ffff8b288f04e880 RSI: ffff8b29346850c0 RDI: 0000000000000008
kernel: [ 973.595650] RBP: ffff8b288f04e880 R08: ffff8b2936174c08 R09: ffff8b2936174c08
kernel: [ 973.595651] R10: 000000000000a000 R11: 0000000000000000 R12: 0000000000000008
kernel: [ 973.595652] R13: 0000000000000004 R14: ffff8b288f04e880 R15: 000000004000001c
kernel: [ 973.595654] FS: 00007f342b744f00(0000) GS:ffff8b293e380000(0000) knlGS:0000000000000000
kernel: [ 973.595655] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
kernel: [ 973.595655] CR2: 0000000000000040 CR3: 0000000460b2e005 CR4: 00000000003606e0
kernel: [ 973.595656] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
kernel: [ 973.595657] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
kernel: [ 973.595658] Call Trace:
kernel: [ 973.595679] i915_active_ref+0x21/0x210 [i915]
kernel: [ 973.595683] ? _cond_resched+0x15/0x30
kernel: [ 973.595703] i915_vma_move_to_active+0x6e/0xf0 [i915]
kernel: [ 973.595723] i915_gem_do_execbuffer+0xc62/0x1520 [i915]
kernel: [ 973.595726] ? _cond_resched+0x15/0x30
kernel: [ 973.595727] ? mutex_lock+0xe/0x30
kernel: [ 973.595729] ? unix_stream_read_generic+0x1f7/0x8f0
kernel: [ 973.595733] ? __kmalloc_node+0x1f5/0x300
kernel: [ 973.595749] i915_gem_execbuffer2_ioctl+0x1df/0x3d0 [i915]
kernel: [ 973.595767] ? i915_gem_madvise_ioctl+0x13a/0x290 [i915]
kernel: [ 973.595782] ? i915_gem_execbuffer_ioctl+0x2e0/0x2e0 [i915]
kernel: [ 973.595794] drm_ioctl_kernel+0xaa/0xf0 [drm]
kernel: [ 973.595801] drm_ioctl+0x208/0x390 [drm]
kernel: [ 973.595817] ? i915_gem_execbuffer_ioctl+0x2e0/0x2e0 [i915]
kernel: [ 973.595828] do_vfs_ioctl+0x40e/0x670
kernel: [ 973.595830] ? __schedule+0x2eb/0x740
kernel: [ 973.595832] ksys_ioctl+0x5e/0x90
kernel: [ 973.595835] ? exit_to_usermode_loop+0x6a/0xf0
kernel: [ 973.595837] __x64_sys_ioctl+0x16/0x20
kernel: [ 973.595838] do_syscall_64+0x52/0x160
kernel: [ 973.595841] entry_SYSCALL_64_after_hwframe+0x44/0xa9
kernel: [ 973.595842] RIP: 0033:0x7f342cc195c7
kernel: [ 973.595844] Code: 00 00 90 48 8b 05 c9 78 0c 00 64 c7 00 26 00 00 00 48 c7 c0 ff ff ff ff c3 66 2e 0f 1
f 84 00 00 00 00 00 b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 99 78 0c 00 f7 d8 64 89 01 48
kernel: [ 973.595845] RSP: 002b:00007ffe61cf9bb8 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
kernel: [ 973.595846] RAX: ffffffffffffffda RBX: 00007ffe61cf9c00 RCX: 00007f342cc195c7
kernel: [ 973.595847] RDX: 00007ffe61cf9c00 RSI: 0000000040406469 RDI: 000000000000000a
kernel: [ 973.595848] RBP: 0000000040406469 R08: 0000560b58426630 R09: 0000000000000000
kernel: [ 973.595849] R10: 0000000000000000 R11: 0000000000000246 R12: 0000560b587b50c0
kernel: [ 973.595849] R13: 000000000000000a R14: ffffffffffffffff R15: 00007f342975d6a8
kernel: [ 973.595851] Modules linked in: rfcomm ctr ccm nvidia_modeset(POE) cmac overlay bnep intel_rapl_msr inte
l_rapl_common x86_pkg_temp_thermal intel_powerclamp coretemp mei_wdt kvm_intel snd_hda_codec_hdmi iwlmvm btusb nls_ascii snd_hda_codec_realt
ek btrtl nls_cp437 btbcm btintel kvm mac80211 snd_hda_codec_generic irqbypass vfat snd_soc_skl libarc4 fat bluetooth intel_cstate snd_soc_hd
ac_hda intel_uncore snd_hda_ext_core efi_pstore snd_soc_sst_ipc intel_rapl_perf snd_soc_sst_dsp snd_soc_acpi_intel_match snd_soc_acpi snd_so
c_core i915 wmi_bmof pcspkr serio_raw efivars snd_compress snd_hda_intel snd_intel_nhlt iTCO_wdt intel_wmi_thunderbolt iwlwifi uvcvideo snd_
hda_codec iTCO_vendor_support snd_hda_core nvidia(POE) watchdog videobuf2_vmalloc snd_hwdep videobuf2_memops videobuf2_v4l2 snd_pcm videobuf
2_common snd_timer drm_kms_helper cfg80211 videodev ipmi_devintf sg drbg ipmi_msghandler mc drm ansi_cprng joydev evdev ucsi_acpi typec_ucsi
mei_me ecdh_generic ecc mei i2c_algo_bit intel_pch_thermal
kernel: [ 973.595874] typec thinkpad_acpi nvram nft_ct ledtrig_audio nf_conntrack snd nf_defrag_ipv6 soundcore t
pm_crb nf_defrag_ipv4 rfkill ac libcrc32c tpm_tis tpm_tis_core tpm rng_core acpi_pad button nft_counter nf_tables_set parport_pc ppdev nf_ta
bles nfnetlink lp parport efivarfs ip_tables x_tables autofs4 ext4 crc16 mbcache jbd2 crc32c_generic sd_mod uas usb_storage scsi_mod hid_gen
eric usbhid dm_crypt dm_mod crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel e1000e aesni_intel crypto_simd cryptd glue_helper
xhci_pci xhci_hcd psmouse nvme ptp pps_core nvme_core usbcore i2c_i801 usb_common wmi battery i2c_hid hid video
kernel: [ 973.595893] CR2: 0000000000000040
kernel: [ 973.595894] ---[ end trace a922597122ba8247 ]---
kernel: [ 973.761055] RIP: 0010:i915_active_acquire+0x9/0x70 [i915]
kernel: [ 973.761060] Code: 00 00 00 48 c7 46 58 00 00 00 00 c7 46 38 00 00 00 00 48 c7 c6 0a 20 29 c2 e9 c3 f7 1
2 cf 0f 1f 00 0f 1f 44 00 00 41 54 55 53 <8b> 47 38 48 89 fb 85 c0 74 15 8d 50 01 f0 0f b1 53 38 75 f2 45 31
kernel: [ 973.761061] RSP: 0018:ffffb9924256ba40 EFLAGS: 00010286
kernel: [ 973.761062] RAX: 0000000000000000 RBX: ffff8b29346850c0 RCX: 0000000000000000
kernel: [ 973.761063] RDX: ffff8b288f04e880 RSI: ffff8b29346850c0 RDI: 0000000000000008
kernel: [ 973.761064] RBP: ffff8b288f04e880 R08: ffff8b2936174c08 R09: ffff8b2936174c08
kernel: [ 973.761065] R10: 000000000000a000 R11: 0000000000000000 R12: 0000000000000008
kernel: [ 973.761067] R13: 0000000000000004 R14: ffff8b288f04e880 R15: 000000004000001c
kernel: [ 973.761069] FS: 00007f342b744f00(0000) GS:ffff8b293e380000(0000) knlGS:0000000000000000
kernel: [ 973.761070] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
kernel: [ 973.761071] CR2: 0000000000000040 CR3: 0000000460b2e005 CR4: 00000000003606e0
kernel: [ 973.761072] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
kernel: [ 973.761072] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400

I believe it is related to this device:

00:02.0 VGA compatible controller: Intel Corporation HD Graphics 620 (rev 02) (prog-if 00 [VGA controller])
Subsystem: Lenovo ThinkPad T570
Flags: bus master, fast devsel, latency 0, IRQ 135
Memory at eb000000 (64-bit, non-prefetchable) [size=16M]
Memory at 80000000 (64-bit, prefetchable) [size=256M]
I/O ports at e000 [size=64]
[virtual] Expansion ROM at 000c0000 [disabled] [size=128K]
Capabilities: [40] Vendor Specific Information: Len=0c <?>
Capabilities: [70] Express Root Complex Integrated Endpoint, MSI 00
Capabilities: [ac] MSI: Enable+ Count=1/1 Maskable- 64bit-
Capabilities: [d0] Power Management version 2
Capabilities: [100] Process Address Space ID (PASID)
Capabilities: [200] Address Translation Service (ATS)
Capabilities: [300] Page Request Interface (PRI)
Kernel driver in use: i915

-- System Information:
Debian Release: bullseye/sid
APT prefers testing
APT policy: (500, 'testing')
Architecture: amd64 (x86_64)

Kernel: Linux 5.4.0-2-amd64 (SMP w/4 CPU cores)
Kernel taint flags: TAINT_PROPRIETARY_MODULE, TAINT_OOT_MODULE, TAINT_UNSIGNED_MODULE
Locale: LANG=C.UTF-8, LC_CTYPE=C.UTF-8 (charmap=UTF-8), LANGUAGE=C.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /usr/bin/dash
Init: systemd (via /run/systemd/system)
LSM: AppArmor: enabled

Ben Hutchings

unread,
Jan 20, 2020, 2:00:04 PM1/20/20
to
Control: tag -1 moreinfo

On Mon, 2020-01-20 at 11:03 +0100, Arturo Borrero Gonzalez wrote:
> Source: linux
> Version: 5.4.8-1
> Severity: normal
> Tags: upstream
>
> Dear maintainers, thanks for your hard work with the linux package, it is
> really appreciated.
>
> I had this kernel crash today that let the system unusable.
[...]

Is this reproducible without the Nvidia proprietary driver?

Ben.

--
Ben Hutchings
Unix is many things to many people,
but it's never been everything to anybody.


signature.asc

Arturo Borrero Gonzalez

unread,
Jan 20, 2020, 3:10:07 PM1/20/20
to


On Mon, Jan 20, 2020, 20:02 Ben Hutchings <b...@decadent.org.uk> wrote:
Control: tag -1 moreinfo

On Mon, 2020-01-20 at 11:03 +0100, Arturo Borrero Gonzalez wrote:
> Source: linux
> Version: 5.4.8-1
> Severity: normal
> Tags: upstream
>
> Dear maintainers, thanks for your hard work with the linux package, it is
> really appreciated.
>
> I had this kernel crash today that let the system unusable.
[...]

Is this reproducible without the Nvidia proprietary driver?

I was unable to reproduce the issue even without doing any modification to the system.

Feel free to close the bug. I can reopen it in case I have followups.

Nils Jarle Haugen

unread,
Feb 18, 2020, 10:20:03 AM2/18/20
to
Dear maintainers,

I experienced the same crash today. I have the Lenovo t470p with the
i915 (Intel HD 630 graphics) and the NVIDIA 940MX dedicated graphics
card. I statically switch between intel and nvidia with xrandr and sddm
scrpts (using
https://wiki.debian.org/NvidiaGraphicsDrivers/Optimus#Configuration).

On the occasion when the system crashed, the xrandr and sddm scripts
were commented out and not actively in use (but the nvidia kernel module
was still loaded).

The crash left the system in a unusable state, needed to call sysrq to
force restart it.

I'll try to purge the proprietary drivers and then try to reproduce the
issue.


Kind regards,
Nils J. Haugen


Distributor ID: Debian
Description:    Debian GNU/Linux bullseye/sid
Release:        testing
Codename:       bullseye
5.4.0-3-amd64

Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.793555] i915
0000:00:02.0: GPU HANG: ecode 9:1:0x00000000, hang on rcs0
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.794565] i915
0000:00:02.0: Resetting rcs0 for hang on rcs0
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.795587] i915
0000:00:02.0: Resetting chip for hang on rcs0
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.846861] ------------[ cut
here ]------------
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.846871] invalid opcode:
0000 [#1] SMP PTI
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.846875] CPU: 1 PID: 4125
Comm: Unity Tainted: G           OE     5.4.0-3-amd64 #1 Debian 5.4.13-1
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.846878] Hardware name:
LENOVO 20J6CTO1WW/20J6CTO1WW, BIOS R0FET50W (1.30 ) 07/03/2019
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.846885] RIP:
0010:__list_del_entry_valid.cold+0x31/0x55
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.846889] Code: 61 0e a3 e8
d4 1d ce ff 0f 0b 48 c7 c7 50 62 0e a3 e8 c6 1d ce ff 0f 0b 48 89 f2 48
89 fe 48 c7 c7 10 62 0e a3 e8 b2 1d ce ff <0f> 0b 48 89 fe 4c 89 c2 48
c7 c7 d8 61 0e a3 e8 9e 1d ce ff 0f 0b
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.846892] RSP:
0018:ffffba9dc91978a0 EFLAGS: 00010046
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.846895] RAX:
0000000000000054 RBX: ffffa0769e0d4fc0 RCX: 0000000000000000
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.846897] RDX:
0000000000000000 RSI: ffffa0783e257688 RDI: ffffa0783e257688
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.846899] RBP:
ffffa0783b9b6068 R08: ffffa0783e257688 R09: 0000000000000063
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.846901] R10:
ffffba9dc9197750 R11: 0000000000000000 R12: ffffa0783b9b6000
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.846903] R13:
ffffa0783b9b6000 R14: ffffa0783267c180 R15: ffffa078300f7ae8
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.846906] FS:
00007f571b4992c0(0000) GS:ffffa0783e240000(0000) knlGS:0000000000000000
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.846908] CS:  0010 DS:
0000 ES: 0000 CR0: 0000000080050033
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.846910] CR2:
00007f5717f28000 CR3: 0000000432e5e004 CR4: 00000000003606e0
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.846913] DR0:
0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.846915] DR3:
0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.846916] Call Trace:
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.846989]
i915_request_retire+0xc9/0x380 [i915]
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847042]
retire_requests+0x4e/0x60 [i915]
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847088]
i915_retire_requests+0xa9/0x230 [i915]
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847133]
i915_gem_evict_for_node+0x264/0x2b0 [i915]
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847176]
i915_gem_gtt_reserve+0x45/0x70 [i915]
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847222]
__i915_vma_do_pin+0x1d7/0x490 [i915]
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847265]
eb_lookup_vmas+0xa90/0xb90 [i915]
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847305]  ?
intel_gt_terminally_wedged+0x23/0xf0 [i915]
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847344]
i915_gem_do_execbuffer+0x67c/0x1520 [i915]
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847352]  ?
shmem_alloc_page+0x47/0x90
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847356]  ?
xas_store+0x56/0x5e0
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847362]  ?
mem_cgroup_charge_statistics+0x4c/0xd0
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847365]  ?
mem_cgroup_commit_charge+0x5f/0x4e0
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847370]  ?
__kmalloc_node+0x1f5/0x300
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847407]
i915_gem_execbuffer2_ioctl+0x1df/0x3d0 [i915]
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847445]  ?
i915_gem_execbuffer_ioctl+0x2e0/0x2e0 [i915]
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847466]
drm_ioctl_kernel+0xaa/0xf0 [drm]
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847484]
drm_ioctl+0x208/0x390 [drm]
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847521]  ?
i915_gem_execbuffer_ioctl+0x2e0/0x2e0 [i915]
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847527]
do_vfs_ioctl+0x40e/0x670
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847530] ksys_ioctl+0x5e/0x90
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847534]
__x64_sys_ioctl+0x16/0x20
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847538]
do_syscall_64+0x52/0x160
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847544]
entry_SYSCALL_64_after_hwframe+0x44/0xa9
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847547] RIP:
0033:0x7f571dcb65b7
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847551] Code: 00 00 90 48
8b 05 d9 78 0c 00 64 c7 00 26 00 00 00 48 c7 c0 ff ff ff ff c3 66 2e 0f
1f 84 00 00 00 00 00 b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3
48 8b 0d a9 78 0c 00 f7 d8 64 89 01 48
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847553] RSP:
002b:00007ffe7f2b6798 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847556] RAX:
ffffffffffffffda RBX: 00007ffe7f2b67e0 RCX: 00007f571dcb65b7
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847558] RDX:
00007ffe7f2b67e0 RSI: 0000000040406469 RDI: 000000000000004a
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847560] RBP:
0000000040406469 R08: 000026a53483c700 R09: 0000000000000000
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847562] R10:
0000000000000000 R11: 0000000000000246 R12: 000026a53486e030
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847564] R13:
000000000000004a R14: ffffffffffffffff R15: 00007f56f8a90e08
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847567] Modules linked
in: fuse ctr ccm hid_generic hidp hid acpi_call(OE) rfcomm ipt_REJECT
nf_reject_ipv4 xt_tcpudp nft_compat nft_counter nft_chain_nat nf_nat
nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c nf_tables nfnetlink
tun bridge stp llc cmac bnep bbswitch(OE) btusb btrtl btbcm btintel
bluetooth uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2
videobuf2_common drbg videodev ansi_cprng mc ecdh_generic ecc
intel_rapl_msr intel_rapl_common x86_pkg_temp_thermal intel_powerclamp
coretemp kvm_intel binfmt_misc kvm irqbypass iwlmvm snd_hda_codec_hdmi
mei_wdt crct10dif_pclmul ghash_clmulni_intel mac80211 nls_ascii
snd_hda_codec_realtek nls_cp437 vfat snd_hda_codec_generic fat libarc4
aesni_intel crypto_simd cryptd glue_helper efi_pstore intel_cstate
snd_hda_intel iwlwifi snd_intel_nhlt snd_hda_codec intel_uncore joydev
intel_rapl_perf snd_hda_core serio_raw pcspkr efivars snd_hwdep cfg80211
snd_pcm iTCO_wdt iTCO_vendor_support wmi_bmof intel_wmi_thunderbolt
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847611] thinkpad_acpi
rtsx_pci_ms mei_me snd_timer sg watchdog memstick mei tpm_crb
intel_pch_thermal nvram ledtrig_audio snd soundcore rfkill ac tpm_tis
tpm_tis_core tpm rng_core acpi_pad evdev parport_pc ppdev lp parport
efivarfs ip_tables x_tables autofs4 ext4 crc16 mbcache jbd2
crc32c_generic dm_mod sd_mod i915 rtsx_pci_sdmmc mmc_core i2c_algo_bit
crc32_pclmul e1000e ahci xhci_pci drm_kms_helper libahci xhci_hcd nvme
psmouse ptp i2c_i801 libata nvme_core crc32c_intel rtsx_pci pps_core
usbcore drm mfd_core scsi_mod usb_common wmi battery video button
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6312.847655] ---[ end trace
c3c61a0be155b85c ]---
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6313.009869] snd_hda_intel
0000:00:1f.3: Unstable LPIB (350544 >= 176400); disabling LPIB delay
counting
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6313.088787] RIP:
0010:__list_del_entry_valid.cold+0x31/0x55
Feb 18 15:29:29 nilsjarle-t470p kernel: [ 6313.088796] Code: 61 0e a3 e8
d4 1d ce ff 0f 0b 48 c7 c7 50 62 0e a3 e8 c6 1d ce ff 0f 0b 48 89 f2 48
89 fe 48 c7 c7 10 62 0e a3 e8 b2 1d ce ff <0f> 0b 48 89 fe 4c 89 c2 48
c7 c7 d8 61 0e a3 e8 9e 1d ce ff 0f 0b

Pavel Reznicek

unread,
Feb 25, 2020, 10:00:03 AM2/25/20
to
Source: linux
Followup-For: Bug #949369

Dear Maintainer,

I experienced the same crash after upgrading to kernel series 5.4.x (namely to 5.4.8) in mid of January. The crashes of the system were quite frequent (every few hours, making the system unusable). After few experiments I ended up with the linux kernel 5.5.0-rc5 from unstable, which seems to work relativelly well. However, today I got the same crash already twice, despite more than month of using it. There are numer of reports about the problems related to the i915 driver in the 5.3.x-5.5.x kernel series:

https://bbs.archlinux.org/viewtopic.php?id=250765&p=7
https://linuxreviews.org/Linux_Kernel_5.5_Will_Not_Fix_The_Frequent_Intel_GPU_Hangs_In_Recent_Kernels
https://forum.manjaro.org/t/random-freezing-with-resetting-rcs0-for-hang-on-rcs0/119313/29
https://www.phoronix.com/scan.php?page=news_item&px=Linux-5.5-Intel-Missed-Graphics

Kernels 4.19.x in buster seems to be fine.

Pavel


-- System Information:
Debian Release: bullseye/sid
APT prefers testing
APT policy: (840, 'testing'), (740, 'unstable'), (738, 'experimental'), (540, 'proposed-updates'), (540, 'stable'), (500, 'oldstable-proposed-updates'), (500, 'oldoldstable'), (500, 'oldstable')
Architecture: amd64 (x86_64)
Foreign Architectures: i386

Kernel: Linux 5.5.0-rc5-amd64 (SMP w/8 CPU cores)
Kernel taint flags: TAINT_OOT_MODULE, TAINT_UNSIGNED_MODULE
Locale: LANG=en_US.UTF-8, LC_CTYPE=cs_CZ.UTF-8 (charmap=UTF-8) (ignored: LC_ALL set to en_US.UTF-8), LANGUAGE=en_US:en (charmap=UTF-8) (ignored: LC_ALL set to en_US.UTF-8)
Shell: /bin/sh linked to /bin/dash

Ben Caradoc-Davies

unread,
Mar 12, 2020, 7:00:05 PM3/12/20
to
On 13/03/2020 11:37, Ben Caradoc-Davies wrote:
> Seen today with 5.4.19-1 (sid) on Intel HD 630 Graphics (i7 7700).
> Console nonrecoverable but system up and could be mostly shut down via
> remote access. Oops attached.

Note: no Nvidia hardware or drivers present. This is an IGP-only system.

--
Ben Caradoc-Davies <b...@transient.nz>
Director
Transient Software Limited <https://transient.nz/>
New Zealand

Ben Caradoc-Davies

unread,
Mar 12, 2020, 7:10:04 PM3/12/20
to
See also similar report and discussion and patch here:
https://gitlab.freedesktop.org/drm/intel/issues/827

Especially the remark "obj->frontbuffer is no longer protected by the
struct_mutex".

Are they the upstream? I do not know if this is exactly the same bug as
reported here, or whether they are upstream, so I have not set upstream.

Kind regards,

Ben Caradoc-Davies

unread,
Mar 12, 2020, 7:50:03 PM3/12/20
to
Seen today with 5.4.19-1 (sid) on Intel HD 630 Graphics (i7 7700).
Console nonrecoverable but system up and could be mostly shut down via
remote access. Oops attached.

i915-oops.2020-03-13.txt

Guy Baconniere

unread,
Mar 28, 2020, 6:30:03 AM3/28/20
to

Guy Baconniere

unread,
Apr 2, 2020, 7:00:03 AM4/2/20
to

@John try to install Linux Kernel 5.5.13 (aka 5.5.0-1) from sid

https://packages.debian.org/sid/kernel/linux-image-5.5.0-1-amd64-unsigned

https://kernel.ubuntu.com/~kernel-ppa/mainline/v5.5.12/CHANGES

Chris Wilson (1):

drm/i915/execlists: Track active elements during dequeue
Matt Roper (1):

drm/i915: Handle all MCR ranges
Caz Yokoyama (1):

Revert "drm/i915/tgl: Add extra hdc flush workaround"

Check the comment on Debian Bug #954817
https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=954817#17

https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1868551/comments/29

wim

unread,
Apr 16, 2020, 6:20:03 AM4/16/20
to
Package: src:linux
Followup-For: Bug #949369

Dear Maintainer,

Confirmation of this bug,
it seems to happen at random,
i don't know if it a kernel issue or a nouveau issue

from $grep -n3 chip /var/log/messages

15416-Apr 14 09:15:30 /usr/lib/gdm3/gdm-x-session[3388]: (--) modeset(0): HDMI max TMDS frequency 225000KHz
15417-Apr 14 09:15:38 kernel: [ 481.351289] i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
15418-Apr 14 09:15:46 kernel: [ 489.351633] i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
15419:Apr 14 09:15:46 kernel: [ 489.352390] i915 0000:00:02.0: Resetting chip for hang on rcs0
15420-Apr 14 09:15:46 /usr/lib/gdm3/gdm-x-session[3388]: (II) modeset(0): EDID vendor "DEL", prod id 16505
15421-Apr 14 09:15:46 /usr/lib/gdm3/gdm-x-session[3388]: (II) modeset(0): Using hsync ranges from config file
15422-Apr 14 09:15:46 /usr/lib/gdm3/gdm-x-session[3388]: (II) modeset(0): Using vrefresh ranges from config file

# lsb_release -a
No LSB modules are available.
Distributor ID: Debian
Description: Debian GNU/Linux 10 (buster)
Release: 10
Codename: buster
# uname -v
#1 SMP Debian 5.4.19-1~bpo10+1 (2020-03-09)

$ lspci | grep ' VGA ' | cut -d" " -f 1 | xargs -i lspci -v -s {}
00:02.0 VGA compatible controller: Intel Corporation UHD Graphics 630 (Mobile) (rev 02) (prog-if 00 [VGA controller])
Subsystem: Hewlett-Packard Company UHD Graphics 630 (Mobile)
Flags: bus master, fast devsel, latency 0, IRQ 154
Memory at e4000000 (64-bit, non-prefetchable) [size=16M]
Memory at a0000000 (64-bit, prefetchable) [size=256M]
I/O ports at 5000 [size=64]
[virtual] Expansion ROM at 000c0000 [disabled] [size=128K]
Capabilities: <access denied>
Kernel driver in use: i915
Kernel modules: i915

01:00.0 VGA compatible controller: NVIDIA Corporation Device 1fb8 (rev a1) (prog-if 00 [VGA controller])
Subsystem: Hewlett-Packard Company Device 860f
Flags: bus master, fast devsel, latency 0, IRQ 207
Memory at e5000000 (32-bit, non-prefetchable) [size=16M]
Memory at 80000000 (64-bit, prefetchable) [size=256M]
Memory at 90000000 (64-bit, prefetchable) [size=32M]
I/O ports at 4000 [size=128]
Expansion ROM at e6080000 [disabled] [size=512K]
Capabilities: <access denied>
Kernel driver in use: nouveau
Kernel modules: nouveau

$ optirun glxinfo|egrep "OpenGL vendor|OpenGL renderer"
[ 921.728466] [ERROR]Cannot access secondary GPU - error: [XORG] (EE) Unknown chipset: NV167

[ 921.728481] [ERROR]Aborting because fallback start is disabled.

# lshw -C display
*-display
description: VGA compatible controller
product: NVIDIA Corporation
vendor: NVIDIA Corporation
physical id: 0
bus info: pci@0000:01:00.0
version: a1
width: 64 bits
clock: 33MHz
capabilities: pm msi pciexpress vga_controller bus_master cap_list rom
configuration: driver=nouveau latency=0
resources: irq:207 memory:e5000000-e5ffffff memory:80000000-8fffffff memory:90000000-91ffffff ioport:4000(size=128) memory:e6080000-e60fffff
*-display
description: VGA compatible controller
product: Intel Corporation
vendor: Intel Corporation
physical id: 2
bus info: pci@0000:00:02.0
version: 02
width: 64 bits
clock: 33MHz
capabilities: pciexpress msi pm vga_controller bus_master cap_list rom
configuration: driver=i915 latency=0
resources: irq:154 memory:e4000000-e4ffffff memory:a0000000-afffffff ioport:5000(size=64) memory:c0000-dffff

Might help some people:
https://gist.github.com/szpak/71081b40217fb27c7a565b8c7b972067

The same error happens with only the intel vga enabled in the bios,
so this is not bumblebee or alike related.

hth,
Wim

-- System Information:
Debian Release: 10.3
APT prefers stable-updates
APT policy: (500, 'stable-updates'), (500, 'stable')
Architecture: amd64 (x86_64)

Kernel: Linux 5.4.0-0.bpo.4-amd64 (SMP w/16 CPU cores)
Locale: LANG=nl_BE.UTF-8, LC_CTYPE=nl_BE.UTF-8 (charmap=UTF-8), LANGUAGE=nl_BE.UTF-8 (charmap=UTF-8)
0 new messages