Skip to content

Commit

Permalink
fix for job manager crash: Unable to contact slurm controller
Browse files Browse the repository at this point in the history
  • Loading branch information
vsc46128 vscuser committed Feb 21, 2024
1 parent 0e145d6 commit 75c00bb
Showing 1 changed file with 6 additions and 0 deletions.
6 changes: 6 additions & 0 deletions eessi_bot_job_manager.py
Original file line number Diff line number Diff line change
Expand Up @@ -113,8 +113,14 @@ def get_current_jobs(self):
squeue_cmd,
"get_current_jobs(): squeue command",
log_file=self.logfile,
raise_on_error=False,
)

if squeue_exitcode != 0:
current_jobs = {}
log("The squeue command failed will try again in {} seconds".format(config.read_config()["job_manager"].get("poll_interval")))
return current_jobs

# create dictionary of jobs from output of 'squeue_cmd'
# with the following information per job: jobid, state,
# nodelist_reason
Expand Down

0 comments on commit 75c00bb

Please sign in to comment.