Graceful node shutdown does not work #13
Labels
No Label
todo/H&S
todo/electronics
todo/equipment
todo/event
todo/ideas
todo/information
todo/infra
todo/inspiration
todo/media
todo/merch
todo/museum
todo/presentation
todo/work-shop
No Milestone
No project
No Assignees
2 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: k-space/kube#13
Loading…
Reference in New Issue
No description provided.
Delete Branch "%!s(<nil>)"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Not sure what is missing from https://github.com/kubernetes/enhancements/tree/master/keps/sig-node/2000-graceful-node-shutdown
Basically what has been tried:
f9393fd0da
Tried
systemctl daemon-reload
and node reboot but still node immediately shuts down without marking it not ready and allowing time to drain the podsThe file
/etc/systemd/logind.conf.d/99-kubelet.conf
on nodes shows:But
systemd-inhibit --list
doesn't show anything kubelet relatedNodes are Ubuntu 22.04 at the moment, but this looks like more reasons to move to Flatcar
Ubuntu by default has
unattended-upgrades
installed that contains/lib/systemd/logind.conf.d/unattended-upgrades-logind-maxdelay.conf
file withInhibitDelayMaxSec
config option.Kubelet writes its DelayMaxSec into file called
99-kubelet.conf
systemd-logind sorts all the files in its config directories and will pick the config value from last file.
This means that
unattended-upgrades-logind-maxdelay.conf
will overwrite99-kubelet.conf
InhibitDelayMaxSec.Fix is to rename 99-kubelet.conf to zz-kubelet.conf so that is loaded last https://github.com/kubernetes/kubernetes/issues/107043#issuecomment-1023756350