-
Notifications
You must be signed in to change notification settings - Fork 120
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update CrashLoopBackOff condition as soon as timeout expires #790
Comments
Grooming Discussion Results
|
On further testing we saw that although this change is fine if The case we focussed on was With this change however (assuming Solution:
|
Regarding |
We should just add a doc that the default value is set to 0. There are 2 different recommendations:
|
How to categorize this issue?
/area robustness
/kind bug
/priority 2
What happened:
Currently MCM doesn't turn
CrashLoopBackoff
(CLBF) machines toFailed
as soon ascreationTimeout
expires , but delays have been observed which can range to 2min to any time.There are 2 parts of the problem:
CreateMachine()
driver call .CreateMachine()
itself could take any amount of time as it provisions VM on the cloudprovider (in Azure big delays like 30min or more have been seen before)ShortRetry
(3min). This happens because the retry period is not calculated considering thetime left before timeout
but is just a constant value ofShortRetry
(3min)What you expected to happen:
Turn to
Failed
as soon as timeout expires.How to reproduce it (as minimally and precisely as possible):
Anything else we need to know?:
We could take inspiration from machineDeployment logic which turns
Progressing
condition toFalse
with reasonProgressDeadlineExceeded
as soon as the deadline exceeds.Environment:
kubectl version
):The text was updated successfully, but these errors were encountered: