Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

freezing / unstable Debian Testing on MSI Stealth GS77 laptop

43 views
Skip to first unread message

JD

unread,
Jan 25, 2023, 4:00:07 AM1/25/23
to
Hello,

I'm trying to find some help in order to debug or resolve the issues I'm
facing. Hopefully, someone will be able to help me here.

The main problem that prevents me from trying to solve it myself is the
absence of logs and messages.


So, my laptop MSI Stealth GS77 with a Debian testing (up to date)
installed on it can freeze when launching 3D programs (both with setting
the nVidia card or the integrated Intel chipset). Once it also
complained about missing SSD drive (nvme). I currently don't know if
both could be related, but I'd say it's unlikely.


Some information about my system:

I installed Debian testing since latest Debian stable had many issues
(no audio, not wifi, unstable 3D graphics...). Debian stable with
backports didn't helped much (it fixed some issues but not all).
Therefore, the most practicable is Debian testing.


The issue I have is that seldom (but quite often), when running a 3D
program, the OS is completely freezing. There is no way to retrieve any
logs or information about the issue. The screen freezes, the keyboard
doesn't respond. The only thing I can do is to use a long press on the
power button to switch the laptop off (alt sys o doesn't work either).
And on the next reboot, my filesystem fixes wrong inodes and therefore
this is not possible to consultate any logs.


I don't really like to say that, but the laptop works well on Windows 11
(no freeze, no crash, no nvme issues). Thus my belief that the issue is
on the Linux side.


More information:

Desktop: XFCE

nvidia: latest proprietary modules available on Debian. Also tested with
modules directly provided by nVidia.

3D programs run on nVidia or the Intel chipset (with setting
appropriately __NV_PRIME_RENDER_OFFLOAD and __GLX_VENDOR_LIBRARY_NAME).

I played a lot with bumblebee, update-glx, keeping updating packages,
but nothing worked at this time.


uname -a
Linux wormhole 6.1.0-1-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.4-1
(2023-01-07) x86_64 GNU/Linux


lspci
00:00.0 Host bridge: Intel Corporation 12th Gen Core Processor Host
Bridge/DRAM Registers (rev 02)
00:01.0 PCI bridge: Intel Corporation 12th Gen Core Processor PCI
Express x16 Controller #1 (rev 02)
00:02.0 VGA compatible controller: Intel Corporation Alder Lake-P
Integrated Graphics Controller (rev 0c)
00:04.0 Signal processing controller: Intel Corporation Alder Lake
Innovation Platform Framework Processor Participant (rev 02)
00:06.0 PCI bridge: Intel Corporation 12th Gen Core Processor PCI
Express x4 Controller #0 (rev 02)
00:07.0 PCI bridge: Intel Corporation Alder Lake-P Thunderbolt 4 PCI
Express Root Port #1 (rev 02)
00:08.0 System peripheral: Intel Corporation 12th Gen Core Processor
Gaussian & Neural Accelerator (rev 02)
00:0d.0 USB controller: Intel Corporation Alder Lake-P Thunderbolt 4 USB
Controller (rev 02)
00:0d.2 USB controller: Intel Corporation Alder Lake-P Thunderbolt 4 NHI
#0 (rev 02)
00:0d.3 USB controller: Intel Corporation Alder Lake-P Thunderbolt 4 NHI
#1 (rev 02)
00:12.0 Serial controller: Intel Corporation Alder Lake-P Integrated
Sensor Hub (rev 01)
00:14.0 USB controller: Intel Corporation Alder Lake PCH USB 3.2 xHCI
Host Controller (rev 01)
00:14.2 RAM memory: Intel Corporation Alder Lake PCH Shared SRAM (rev 01)
00:14.3 Network controller: Intel Corporation Alder Lake-P PCH CNVi WiFi
(rev 01)
00:15.0 Serial bus controller: Intel Corporation Alder Lake PCH Serial
IO I2C Controller #0 (rev 01)
00:16.0 Communication controller: Intel Corporation Alder Lake PCH HECI
Controller (rev 01)
00:1c.0 PCI bridge: Intel Corporation Device 51b8 (rev 01)
00:1c.6 PCI bridge: Intel Corporation Device 51be (rev 01)
00:1c.7 PCI bridge: Intel Corporation Alder Lake PCH-P PCI Express Root
Port #9 (rev 01)
00:1f.0 ISA bridge: Intel Corporation Alder Lake PCH eSPI Controller
(rev 01)
00:1f.3 Multimedia audio controller: Intel Corporation Alder Lake PCH-P
High Definition Audio Controller (rev 01)
00:1f.4 SMBus: Intel Corporation Alder Lake PCH-P SMBus Host Controller
(rev 01)
00:1f.5 Serial bus controller: Intel Corporation Alder Lake-P PCH SPI
Controller (rev 01)
01:00.0 3D controller: NVIDIA Corporation GA104 [Geforce RTX 3070 Ti
Laptop GPU] (rev a1)
01:00.1 Audio device: NVIDIA Corporation GA104 High Definition Audio
Controller (rev a1)
02:00.0 Non-Volatile memory controller: Samsung Electronics Co Ltd NVMe
SSD Controller PM9A1/PM9A3/980PRO
2e:00.0 Unassigned class [ff00]: Realtek Semiconductor Co., Ltd. RTS5261
PCI Express Card Reader (rev 01)
2f:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. Killer
E3000 2.5GbE Controller (rev 06)


lsmod
Module                  Size  Used by
dm_mod                184320  0
snd_ctl_led            24576  0
snd_soc_skl_hda_dsp    24576  6
snd_soc_intel_hda_dsp_common    20480  1 snd_soc_skl_hda_dsp
snd_soc_hdac_hdmi      45056  1 snd_soc_skl_hda_dsp
snd_sof_probes         24576  0
snd_soc_dmic           16384  1
mei_hdcp               24576  0
intel_rapl_msr         20480  0
gpio_keys              20480  0
x86_pkg_temp_thermal    20480  0
intel_powerclamp       20480  0
kvm_intel             380928  0
kvm                  1130496  1 kvm_intel
irqbypass              16384  1 kvm
ghash_clmulni_intel    16384  0
aesni_intel           393216  0
crypto_simd            16384  1 aesni_intel
cryptd                 28672  2 crypto_simd,ghash_clmulni_intel
rapl                   20480  0
intel_cstate           20480  0
intel_uncore          212992  0
pcspkr                 16384  0
wmi_bmof               16384  0
rfcomm                 90112  16
cmac                   16384  4
algif_hash             16384  1
ecb                    16384  2
algif_skcipher         16384  1
af_alg                 36864  6 algif_hash,algif_skcipher
bnep                   28672  2
snd_hda_codec_realtek   167936  1
snd_hda_codec_generic    98304  1 snd_hda_codec_realtek
ledtrig_audio          16384  2 snd_ctl_led,snd_hda_codec_generic
iwlmvm                385024  0
snd_sof_pci_intel_tgl    16384  0
snd_sof_intel_hda_common   188416  1 snd_sof_pci_intel_tgl
soundwire_intel        49152  1 snd_sof_intel_hda_common
soundwire_generic_allocation    16384  1 soundwire_intel
mac80211             1171456  1 iwlmvm
soundwire_cadence      40960  1 soundwire_intel
snd_sof_intel_hda      20480  1 snd_sof_intel_hda_common
snd_sof_pci            24576  2
snd_sof_intel_hda_common,snd_sof_pci_intel_tgl
snd_sof_xtensa_dsp     16384  1 snd_sof_intel_hda_common
snd_sof               274432  3
snd_sof_pci,snd_sof_intel_hda_common,snd_sof_probes
snd_sof_utils          20480  1 snd_sof
snd_soc_hdac_hda       24576  1 snd_sof_intel_hda_common
snd_hda_ext_core       40960  3
snd_sof_intel_hda_common,snd_soc_hdac_hdmi,snd_soc_hdac_hda
snd_soc_acpi_intel_match    73728  2
snd_sof_intel_hda_common,snd_sof_pci_intel_tgl
snd_soc_acpi           16384  2
snd_soc_acpi_intel_match,snd_sof_intel_hda_common
libarc4                16384  1 mac80211
snd_soc_core          348160  8
soundwire_intel,snd_sof,snd_sof_intel_hda_common,snd_soc_hdac_hdmi,snd_soc_hdac_hda,snd_sof_probes,snd_soc_dmic,snd_soc_skl_hda_dsp
snd_compress           28672  2 snd_soc_core,snd_sof_probes
iwlwifi               360448  1 iwlmvm
snd_hda_codec_hdmi     81920  2
soundwire_bus         102400  3
soundwire_intel,soundwire_generic_allocation,soundwire_cadence
btusb                  65536  0
btrtl                  28672  1 btusb
btbcm                  24576  1 btusb
btintel                45056  1 btusb
btmtk                  16384  1 btusb
bluetooth             950272  44 btrtl,btmtk,btintel,btbcm,bnep,btusb,rfcomm
jitterentropy_rng      16384  1
snd_hda_intel          57344  1
snd_intel_dspcfg       36864  3
snd_hda_intel,snd_sof,snd_sof_intel_hda_common
iTCO_wdt               16384  0
snd_intel_sdw_acpi     20480  2 snd_sof_intel_hda_common,snd_intel_dspcfg
intel_pmc_bxt          16384  1 iTCO_wdt
snd_hda_codec         184320  8
snd_hda_codec_generic,snd_hda_codec_hdmi,snd_hda_intel,snd_hda_codec_realtek,snd_soc_intel_hda_dsp_common,snd_soc_hdac_hda,snd_sof_intel_hda,snd_soc_skl_hda_dsp
sha512_ssse3           49152  1
iTCO_vendor_support    16384  1 iTCO_wdt
sha512_generic         16384  1 sha512_ssse3
snd_hda_core          122880  11
snd_hda_codec_generic,snd_hda_codec_hdmi,snd_hda_intel,snd_hda_ext_core,snd_hda_codec,snd_hda_codec_realtek,snd_soc_intel_hda_dsp_common,snd_sof_intel_hda_common,snd_soc_hdac_hdmi,snd_soc_hdac_hda,snd_sof_intel_hda
snd_hwdep              16384  1 snd_hda_codec
ctr                    16384  0
watchdog               45056  1 iTCO_wdt
mei_me                 53248  1
drbg                   45056  1
qrtr                   49152  4
msi_wmi                20480  0
cfg80211             1122304  3 iwlmvm,iwlwifi,mac80211
mei                   159744  3 mei_hdcp,mei_me
snd_pcm               159744  11
snd_hda_codec_hdmi,snd_hda_intel,snd_hda_codec,soundwire_intel,snd_sof,snd_sof_intel_hda_common,snd_soc_hdac_hdmi,snd_compress,snd_soc_core,snd_sof_utils,snd_hda_core
ansi_cprng             16384  0
hid_sensor_als         20480  0
hid_sensor_trigger     20480  2 hid_sensor_als
snd_timer              49152  1 snd_pcm
processor_thermal_device_pci    16384  0
hid_sensor_iio_common    24576  2 hid_sensor_trigger,hid_sensor_als
processor_thermal_device    20480  1 processor_thermal_device_pci
ecdh_generic           16384  2 bluetooth
industrialio_triggered_buffer    16384  1 hid_sensor_trigger
processor_thermal_rfim    16384  1 processor_thermal_device
snd                   126976  27
snd_ctl_led,snd_hda_codec_generic,snd_hda_codec_hdmi,snd_hwdep,snd_hda_intel,snd_hda_codec,snd_hda_codec_realtek,snd_sof,snd_timer,snd_soc_hdac_hdmi,snd_compress,snd_soc_core,snd_pcm
kfifo_buf              16384  1 industrialio_triggered_buffer
processor_thermal_mbox    16384  2
processor_thermal_rfim,processor_thermal_device
rfkill                 36864  8 iwlmvm,bluetooth,cfg80211
processor_thermal_rapl    20480  1 processor_thermal_device
ecc                    40960  1 ecdh_generic
industrialio          110592  4
industrialio_triggered_buffer,hid_sensor_trigger,kfifo_buf,hid_sensor_als
intel_rapl_common      32768  2 intel_rapl_msr,processor_thermal_rapl
soundcore              16384  2 snd_ctl_led,snd
uinput                 20480  1
binfmt_misc            24576  1
nls_ascii              16384  1
nls_cp437              20480  1
vfat                   24576  1
fat                    90112  1 vfat
int3403_thermal        20480  0
int340x_thermal_zone    20480  2 int3403_thermal,processor_thermal_device
ac                     20480  0
intel_hid              24576  0
sparse_keymap          16384  2 intel_hid,msi_wmi
int3400_thermal        20480  0
acpi_thermal_rel       16384  1 int3400_thermal
intel_pmc_core         53248  0
soc_button_array       20480  0
acpi_tad               20480  0
acpi_pad              184320  0
nvidia_drm             73728  5
nvidia_modeset       1155072  7 nvidia_drm
nvidia              39190528  631 nvidia_modeset
hid_multitouch         32768  0
joydev                 28672  0
evdev                  28672  43
serio_raw              20480  0
coretemp               20480  0
parport_pc             40960  0
ppdev                  24576  0
lp                     20480  0
parport                73728  3 parport_pc,lp,ppdev
fuse                  176128  3
efi_pstore             16384  0
configfs               57344  1
efivarfs               24576  1
ip_tables              36864  0
x_tables               61440  1 ip_tables
autofs4                53248  2
ext4                  978944  2
crc16                  16384  2 bluetooth,ext4
mbcache                16384  1 ext4
jbd2                  167936  1 ext4
crc32c_generic         16384  0
hid_logitech_hidpp     53248  0
hid_logitech_dj        28672  0
hid_sensor_custom      28672  0
hid_sensor_hub         28672  4
hid_sensor_trigger,hid_sensor_iio_common,hid_sensor_als,hid_sensor_custom
intel_ishtp_hid        28672  0
usbhid                 65536  2 hid_logitech_dj,hid_logitech_hidpp
i915                 3317760  17
nvme                   53248  4
drm_buddy              20480  1 i915
i2c_algo_bit           16384  1 i915
drm_display_helper    212992  1 i915
nvme_core             159744  6 nvme
hid_generic            16384  0
cec                    61440  2 drm_display_helper,i915
xhci_pci               20480  0
rc_core                69632  1 cec
t10_pi                 16384  1 nvme_core
xhci_hcd              315392  1 xhci_pci
r8169                  94208  0
ttm                    94208  1 i915
rtsx_pci_sdmmc         32768  0
crc64_rocksoft         20480  1 t10_pi
crc64                  20480  1 crc64_rocksoft
i2c_hid_acpi           16384  0
mmc_core              208896  1 rtsx_pci_sdmmc
usbcore               344064  4 xhci_hcd,usbhid,btusb,xhci_pci
drm_kms_helper        229376  3 drm_display_helper,nvidia_drm,i915
intel_lpss_pci         28672  0
i2c_hid                32768  1 i2c_hid_acpi
crc_t10dif             20480  1 t10_pi
realtek                36864  1
i2c_i801               36864  0
intel_ish_ipc          28672  0
crct10dif_generic      16384  0
intel_lpss             16384  1 intel_lpss_pci
mdio_devres            16384  1 r8169
drm                   663552  20
drm_kms_helper,drm_display_helper,nvidia,drm_buddy,nvidia_drm,i915,ttm
thunderbolt           376832  0
libphy                180224  3 r8169,mdio_devres,realtek
psmouse               184320  0
crct10dif_pclmul       16384  1
crc32_pclmul           16384  0
rtsx_pci              114688  1 rtsx_pci_sdmmc
crc32c_intel           24576  4
intel_ishtp            61440  2 intel_ishtp_hid,intel_ish_ipc
i2c_smbus              20480  1 i2c_i801
hid                   155648  8
i2c_hid,usbhid,hid_multitouch,hid_sensor_hub,intel_ishtp_hid,hid_generic,hid_logitech_dj,hid_logitech_hidpp
idma64                 20480  0
usb_common             16384  2 xhci_hcd,usbcore
crct10dif_common       16384  3
crct10dif_generic,crc_t10dif,crct10dif_pclmul
video                  65536  3 msi_wmi,i915,nvidia_modeset
battery                28672  0
button                 24576  0
wmi                    36864  3 video,wmi_bmof,msi_wmi


Thank you.

Martin Petersen

unread,
Jan 29, 2023, 1:30:06 PM1/29/23
to
Hi JD,

sorry to hear that you have these kind of troubles. I dont know much
about this gpu/igpu setups, but I want to give some proceedings I would
choose:

You write that there dont seem to be any logs.

Where did You look? I would expect, that there might me /var/log/syslog
messages concerning kernel panics or maybe SystemD journal entries from
that last boot (journalctl -b 1 or smth).

What kind of logs did You check or are some logs Your google searches
might have produced not there or empty? They might have to bee enabled.

Also there might be error messages of the xserver /var/log/Xorg.0.log.

On top I would take a look into the messages of my Xsession itself
(~.xsession-errors)

Many places to check, I'm afraid to say :)

If You have another system You could ssh into Your crash prone laptop to
get messages live.

Posting some details about the versions of the nvidia modules and such
might be useful. Some other readers might be able to compare such strings.

And which CPU version does Your notebook have? (cat /proc/cpuinfo,
maybe). If this is a CPU in a combo setup "some big/beefy cores and some
lightweight/lowpower cores" it have implications w. applications in my
opinion.

And one thing I would like to recommend: try another, maybe more recent
system. Boot from a ubuntu or fedora or some outher fresh distributions
live iso and check / compare modules and configuration parameters and -
if possible - behaviour of the particular applications.

Good luck and please post log contents or snippes of logs, if You find some.

Cheers,

Martin
0 new messages