-
Notifications
You must be signed in to change notification settings - Fork 103
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Spice errors on offline guest backups #1661
Comments
Hi @pboguslawski that sounds interesting. Can you verify that the port is still in listening mode and is there still a process listed along with it while the error occurs? e.g. with |
Below log of failed backup job (standard ganeti on Debian 11, without our patch) with added
Same result in Seems that spice port is still shutting down when vm is started after snapshot and this causes the problem. |
10s sleep added after vm shutdown was not enough - sometimes error still occured; sleep was increased to 15s and no such errors from a few weeks. Looks like something connected with socket shutdown like on: https://unix.stackexchange.com/questions/17218/how-long-is-a-tcp-local-socket-address-that-has-been-bound-unavailable-after-clo |
We've noticed offline backup problems with one guest on ganeti host; problem does not occur on every backup; it occurs on one guest only (old server hardware so maybe some kind of race connected with processing speed).
Ganeti 2 from standard Debian 9 packages:
Ganeti 3 from standard Debian 11 packages:
Seems that qemu process cannot bind to spice port which is still used for some time after guest shutdown. Attached please find patch that resolves this issue for us in Debian 11 (10s sleep added after vm shutdown, before starting it again during offline backup).
backup.py-IB1107521.patch.gz
BTW: Seems that error message content is not present in job info in ganeti 3 (it was avaliable in ganeti 2 as above). Probably another issue?
The text was updated successfully, but these errors were encountered: