Failed To Start Systemd Script To Load Sep5 Driver At Boot Time

0 views
Skip to first unread message

Егор Ульянов

unread,
Aug 4, 2024, 11:28:59 PM8/4/24
to azusocun
Thismorning I updated my system with pacman -Syyu. There was a kernel update installed & after the update had finished I found that the usb ports on the front of my computer had stopped working so decided to reboot. Upon rebooting I am greeted with a Failed to start Load Kernel Modules. error. I can login to other ttys & have tried reinstalling both linux & linux-firmware, but I'm still unable to access my desktop. I have the standard linux kernel installed along with nvidia-470 drivers. Here is the out for journalctl -b. Someone please help me please....

On kernel updates you need to rebuild your nvidia-470xx package against the new kernel, did you do that? This is easiest done by using nvidia-470xx-dkms which should automate this. if you think this to be the case what's your output for


Sorry about that. I was just freaking out a little. I'm pretty sure that the nvidia package was rebuilt, because I have the nvidia-470xx-dkms installed & I also have the hook set up. Kernel updates have never been a problem until this morning. Whenever I run sudo dkms status I can see the following....


Only workaround we came up with was using binary modules built on a newer system. With nvidia-dkms, that meant switching to nvidia/nvidia-lts. One other guy using nvidia 470 was able to copy the modules from another, newer system onto the older one that wasn't working.


Here is my cpuinfo. I'm running on a Dell optiplex 7010. I don't have access to any other computers that are running either Linux or Nvidia. Is there anything I can do without having to switch to another distro like Debian or Fedora ?? Could I maybe even rollback the necessary packages & then hold them to stop them from being updated ??


i5-3470 is significantly newer than the systems we were troubleshooting, but still older than the Haswell the one person used to build good modules. The age of the system was a theory, don't know if it'll hold up or not.


Ok so OMFG I'm so unbelievably happy right now. Stripping out the BTF info actually worked. I'm now booted back into my desktop once again. Thank you so, so, so much for helping me. One final question before I mark the thread as solved. I take it that going forward from here I need to strip out the BTF info every time I update the nvidia package ??


You are a frikin genius my friend. I can't thank you enough for this hack or whatever it is. It got my computer booting again & I'm literally so happy. Yes, yes you should definitely open up a bug report because I'm sure I'm not the only person who will encounter whatever this is....


Hello,

I kinda am in the same situation, I updated kernel and nvidia drivers as always but this time nvidia kernel module does not load and I'm not able to startx anymore.

Here are some logs:

Pkgs versions:


When I turn on my computer, make it past the GRUB menu, and type in my encryption password, I am greeted with the HP logo with the Fedora logo underneath it, and in between, a spinning circle. About 60% of the time, the circle will abruptly stop spinning, and freeze, never allowing my computer to load into GNOME. The remaining 40%, it will boot and run perfectly fine. When I press the escape button and try to boot, it will typically freeze at


I have tried using different kernels in GRUB, and the error persists. It also occurred in Fedora 37, 38, 39, and Arch Linux. Disconnecting my devices such as my keyboard and speakers have no effect, in addition to plugging out my Ethernet. Running sudo dnf distro-sync also does nothing to fix my problem.


I installed Fedora 14 with the KDE desktop. Can I make Fedora boot to a terminal rather than the GUI? I would want to boot to the terminal just 1 time so I don't want to get rid of the GUI permanently.


After some trials, I managed to repair the dnf package manager and have internet access. However even after a new dnf upgrade there are some services which cannot start normally, or they failed.


You are posting in a thread that does not involve Nvidia. Please start a new thread that mentions Nvidia. Since you can boot to terminal mode, you can use journalctl to see the error messages. Please post the output of inxi -Fzxx so we can see your hardware configuration. Have you installed non-free Nvidia drivers or are you using the nouveau driver provided by Fedora 39?


However when I try running the vtune profiler with hardware event-based sampling I get the warning displayed in the image at the bottom. Before I alsohad the warnings like: cannot locate 'vtssoo.ko', and: cannot locate debugging information for the linux kernel. Now they're gone but I'm not sure if they are solved though.


pax driver is loaded and owned by group "vtune" with file permissions "660".

socperf3 driver is not correctly loaded.

sep5 driver is not correctly loaded.

socwatch driver is loaded.

vtsspp driver is loaded and owned by group "vtune" with file permissions "660".


2 drivers are not loaded correctly. Following the instructions to build the drivers didn't help either and I still get this result.

A colleague of mine has the same setup (same OS and version) but he's using a lenovo and not macbook pro. In his case after the installation he could already run hardware event-based sampling without warnings and with all the useful information.

I'm starting to think that it might be something with the macbook pro build. What could the problem be? I would really appreciate any help because I really need this feature. Thanks a lot for you effort and time.


I have got a warning message in vtune profiling.

"amplxe: Warning: Only user space will be profiled due to credentials lack. Consider changing /proc/sys/kernel/perf_event_paranoid file for enabling kernel space profiling."

and current "perf_event_paranoid" is 2.


I would like to use the options including hotspots and hpc-performance in Vtune profiler, Advisor and Tracer.

In the current situation, Are there any methods to use the profiler without changing the "perf_event_paranoid" file?

Because I am not an administrator of whole systems, I can't modify the files.

To ask the administrator, I need solid evidence to modify the file. Is there any document containing the recommended option about "perf_event_paranoid".

Also, In my case, what is the recommended value (0 or 1)?


I have encountered an issue with VTune 2021.1-beta06 on DevCloud where it completes profiling a workload with a specific dataset, but when I try to create reports they are empty. If I run the workload with a different, smaller dataset the reports work fine.


I see the 'data collection is completed successfully' message and the progress bar gets stuck. There's not specific pattern where it gets stuck, it could be while reading the trace file or a on a dll used by the program. I tried to leaving the process running for hours to see if it would eventually finish, no luck.


To get pass this I usually have to kill VTune from the task manager; re-open VTune and load the capture on which it crashed. At which point the resolution just takes a few seconds and I'm able to see navigate my capture normally.


I have been plagued with this behavior for a while and on various versions of Vtune. I have to carefully pick a version from which I can apply the workaround described above, for example, I tried 2020 patch 1 and I'm not able to kill the Vtune instance and resume from it, therefore I had to revert to the version I'm currently using.


Earlier I was using Intel VTune profiler on a AMD machine. Though I could use User Based Sampling for my application but as it was AMD machine, therefore HW sampling was not possible. Now I am using an Intel Machine. And I would really like to use all profiling options finally.


Having said that, I could proceed with the installation but I don't want to use user mode sampling only. I would like to have all profiling options. And I think until and unless I don't resolve these two messages in preequisites, I won't be able to use all profiling option.


I am running VTune locally on my Apple laptop, attempting to analyze a remote system which runs Linux. However, due to system configuration and administration requirements within my company, I am unable to configure a remote Linux target via SSH for VTune.


Is there a way to determine the command line flags that the VTune GUI would have tried to run via SSH on the remote system? I would like to log into the remote Linux host and run that exact command manually, then download the resulting data on to my local workstation to analyze the results with the VTune GUI.

3a8082e126
Reply all
Reply to author
Forward
0 new messages