Hi all,
FWIW, this was also reported very recently via Debian.
The following mail was sent to the package tracker -- I'm not sure if
there is a public tracker of these mails, which is why I'm dumping the
complete mail in here.
Cheers,
Georg
----- Forwarded message from Santiago Garcia Mantinan <
ma...@debian.org> -----
Date: Thu, 2 Sep 2021 13:05:38 +0200
From: Santiago Garcia Mantinan <
ma...@debian.org>
To:
gan...@packages.debian.org
Subject: Migrating from buster to bullseye, not so easy
Hi!
I have recently migrated a small ganeti cluster from buster to bullseye and
I wanted to comment with you the problems I found, in case you want me to
open bugs or whatever for any of them.
This is in no way any rant or anything, I enjoy pretty much ganeti and in
buster everything was going perfect with it, thanks for your great job.
The first thing I did was fetch the package and read the NEWS in case there
was something I should be aware of, it was good to have a message there
saying that I could migrate from buster's version to bullseye, you also
stated the typical setup, both packages installed and then migrate and
remove the old one.
This was my first problem, a dist-upgrade would remove the old package, so I
tried a simple apt install ganeti and same thing happened. I had those
problems before with other python2 based stuff, so I went for the same
solution... apt install python-is-python2 and after that, I could indeed
install both versions of ganeti on the same machine, maybe we should add a
hint for this or some other solution for these cases?
After this, I started migrating all the guests to one node, while I was
upgrading the other nodes and rebooting, it was a bad thing to see that
after the nodes were upgraded, rebooted, ... I couldn't bring the guests
back to my upgraded nodes. Ganeti would refuse to move them because there
was a problem, when I looked at the ganeti logs I could only see a message:
ganeti.errors.OpExecError: Could not pre-migrate instance prejitsi:
Failed to accept instance: Failed to start instance prejitsi: exited
with exit code 1 (kvm: -usbdevice tablet: '-usbdevice' is deprecated,
please use '-device usb-...' instead
This was completely misleading, as this message is only a warning, not an
error, the real error was...
kvm: -vnc
127.0.0.1:5142: keymap include files are not supported any more
Which I just couldn't see anywhere until I looked at the kvm logs, don't
know why I didn't see it, now I see it was on Ganeti's logs as well.
Anyway, the error was caused by me having the keymap set to spanish, so what
I did was:
gnt-instance modify -H keymap= guest
for every guest, but having them change the keymap meant a reboot off all
guests.
Having done that I thought I had finished, as I had all the nodes updated to
bullseye and cleaned, the cluster is running ok and everything looks fine,
so... I started moving tghe guests to their default node and... guests
started to freeze as they reached the destination node :-(
When I tried to see what had happened to those guests connecting to the
console... I got:
# gnt-instance console sid
Instance sid is paused, unpausing
Farder investigation of what had happened revealed
# cat /var/log/ganeti/kvm/sid.log
kvm: Could not open '/var/run/ganeti/instance-disks/sid:0': Permission denied
# ls -l /var/run/ganeti/instance-disks/sid:0
lrwxrwxrwx 1 root root 11 sep 2 12:48 /var/run/ganeti/instance-disks/sid:0 -> /dev/drbd11
# ls -l /dev/drbd*
brw-rw---- 1 root disk 147, 0 sep 2 12:50 /dev/drbd0
brw-rw---- 1 root disk 147, 1 sep 2 12:50 /dev/drbd1
brw-rw---- 1 root disk 147, 10 sep 2 12:50 /dev/drbd10
brw-rw---- 1 root disk 147, 11 sep 2 12:48 /dev/drbd11
# id sid
uid=123(sid) gid=105(kvm) grupos=105(kvm)
I run the machine as user sid, of course user sid cannot open the drbd I
don't think it should either.
I tested to see if this was the real problem, I changed group from disk to
kvm on the secondary node of sid and then did a migration without any
problem.
I feel like the last problem I found is a bug and I'll try to submit it as
soon as possible unless you tell me not to, as for the other two, I don't
know if they qualify as such or not, maybe we should add some info on the
release notes or similar?
You tell me what to do.
Thanks in advance.
--
Manty/BestiaTester ->
http://manty.net
----- End forwarded message -----