Hi Julien,
Apparently your slurmdbd is quite happy, but it seems that your
slurmctld StateSaveLocation has been corrupted:
> [2022-07-19T15:17:58.356] error: Node state file /var/lib/slurm-llnl/slurmctld/node_state too small
> [2022-07-19T15:17:58.356] error: NOTE: Trying backup state save file. Information may be lost!
> [2022-07-19T15:17:58.356] debug3: Version string in node_state header is PROTOCOL_VERSION
> [2022-07-19T15:17:58.357] Recovered state of 71 nodes
> [2022-07-19T15:17:58.357] error: Job state file /var/lib/slurm-llnl/slurmctld/job_state too small
> [2022-07-19T15:17:58.357] error: NOTE: Trying backup state save file. Jobs may be lost!
> [2022-07-19T15:17:58.357] error: Incomplete job state save file
Did something bad happen to your storage of
/var/lib/slurm-llnl/slurmctld/ ? Could you possibly restore this folder
from the last backup?
I don't know if it's possible to recover from a corrupted slurmctld
StateSaveLocation, maybe some others have an experience?
Even if you could restore it, the Slurm database probably needs to be
consistent with your slurmctld StateSaveLocation, and I don't know if
this is feasible...
Could you initialize your slurm 17.02.11 and start it from scratch?
Regarding an upgrade from 17.02 or 17.11, you may find some useful notes
in my Wiki page
https://wiki.fysik.dtu.dk/niflheim/Slurm_installation#upgrading-slurm
/Ole