-
Environmental Info: Describe the bug: Interestingly my agents aren't joining with the error Steps To Reproduce:
Expected behavior: Actual behavior: No entries listed on a 5 node cluster. Additional context / logs: |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 5 replies
-
The docs are correct. The error you're showing also indicates that the secrets exist. Where specifically are you looking to see that the node password secrets do not exist? What is the specific disaster-recovery process that you are testing here? Is it possible that you've got some inconsistent state due to how you're restoring things? |
Beta Was this translation helpful? Give feedback.
I was able to find out why my agents weren't joining - because I was recovering from a disaster and had to restore a server node first, Rancher's webhook was not running (but was installed) because it does not tolerate the server taints.
Since Rancher's webhook was not running, it was actually blocking RKE2's ability to check/create the
node-password
secret. As soon as I tolerated the taints to get that pod running, the agent joined and anode-password
was created. It does not explain why the servers were able to join and did not create node-passwords but it at least allows me to continue.It's rather bad that a misbehaving
ValidatingWebhook
can interfere withkube-system
and therefore af…