-
Notifications
You must be signed in to change notification settings - Fork 275
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[MultiKueue] Default managedBy for ClusterQueues configured to use MultiKueue AC #2022
Comments
/assign @trasc |
I guess either the cache or the client (which is also cached). Whichever is less cumbersome. There might be race conditions in either case, so that should not be a consideration when choosing between one or the other. |
One problem that I see with this is that the What it might be more problematic, down the line is:
In case the |
It should be immutable. We want consistency with Batch Job, where this was discussed at length. |
Yes, we are aware of race conditions like these, but the assumption is that ClusterQueue configurations are rare, once the system is setup. That being said, the more helpful we can be in that situations the better. For example, maybe we could detect situations like these and evict the workloads, and mark as inactive. |
Btw, I think the scenarios are sort of orthogonal, they may happen regardless if managedBy was set by a user or by an automation. |
Expanding a little bit on the mutability of managedBy. According to the mission of wg-batch we should try make the ecosystem consistent as much as possible. In this case I think it means that Jobset should try to follow the Job's decisions in the KEP. Some of the complications for making it mutable include:
Having said that, there is a graduation point for GA. but IMO pushing in this direction will require a very good justification. Since MultiKueue is still alpha, the adoption is still low, and we don't have feedback indicating the scenarios are problematic for end-users, I don't think it provides such a justification. So, for now, I would suggest to assume it is immutable. |
/assign |
@vladikkuzn I think it would be good to follow up with a docs update. I think we can drop this example. A short note, that the webhook updates the field should be enough. |
@mimowo Is it better to do it in separate PR or in the same? |
Either way would work for me. Maybe separate is slightly preferred. |
Docs follow-up: 88d459f |
What would you like to be added:
Add the
managedBy
field if the JobSet is submitted to a ClusterQueue with MultiKueue AC.Why is this needed:
Usability. The users should know as little about MK as possible, only admins. Once the system is setup by an admin the user does not need to know if this is MK or not.
It will allow for smooth transition from single cluster to multiple cluster environments.
Proposed approach
The proposed solution is to use the
jobset_webhook
which would have access to the configuration of ClusterQueues via cache.The text was updated successfully, but these errors were encountered: