Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Disable max jobs limit, let Azure batch handle job queue #52

Open
lcjohnso opened this issue Oct 7, 2024 · 0 comments
Open

Disable max jobs limit, let Azure batch handle job queue #52

lcjohnso opened this issue Oct 7, 2024 · 0 comments

Comments

@lcjohnso
Copy link
Member

lcjohnso commented Oct 7, 2024

Current Behavior

Azure batch pools are currently capped at two nodes (i.e., two jobs) at a time. At time of job submission (say, for a prediction job) Bajor will reflect on the current state of the pool job queue and reject the job if N_jobs >= N_max_nodes -- see this code which is evaluated and used to throw errors here.

It is not clear why BaJoR was taking any responsibility for checking the queue and making go/no-go decisions based on the current state of the pool's job queue.

Desired Behavior

Let the Azure Batch pool handled the job queue: in the case that N_jobs > N_max_nodes, let BaJoR to submit the job to the pool and let Azure Batch be responsible for keep the new job in a queue until there is a node available to run the job.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant