You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
srun -p super_priority --job-name=psg --gres=gpu:1 --ntasks-per-node=1 --cpus-per-task=5 --kill-on-bad-exit=1 python -u tools/train.py configs/mask2former/mask2former_r50_lsj_8x2_50e_coco-panoptic.py --work-dir=work_dirs/mask2former_r50_ips --launcher=slurm
srun: error: If munged is up, restart with --num-threads=10
srun: error: Munge encode failed: Failed to access "/var/run/munge/munge.socket.2": No such file or directory
srun: error: slurm_send_node_msg: auth_g_create: REQUEST_RESOURCE_ALLOCATION has authentication error
srun: error: Srun communication socket apparently being written to by something other than Slurm
srun: error: If munged is up, restart with --num-threads=10
srun: error: Munge encode failed: Failed to access "/var/run/munge/munge.socket.2": No such file or directory
srun: error: slurm_send_node_msg: auth_g_create: REQUEST_RESOURCE_ALLOCATION has authentication error
srun: error: Srun communication socket apparently being written to by something other than Slurm
srun: error: If munged is up, restart with --num-threads=10
srun: error: Munge encode failed: Failed to access "/var/run/munge/munge.socket.2": No such file or directory
srun: error: slurm_send_node_msg: auth_g_create: REQUEST_RESOURCE_ALLOCATION has authentication error
srun: error: Srun communication socket apparently being written to by something other than Slurm
srun: error: If munged is up, restart with --num-threads=10
srun: error: Munge encode failed: Failed to access "/var/run/munge/munge.socket.2": No such file or directory
srun: error: slurm_send_node_msg: auth_g_create: REQUEST_RESOURCE_ALLOCATION has authentication error
srun: error: Srun communication socket apparently being written to by something other than Slurm
srun: error: If munged is up, restart with --num-threads=10
srun: error: Munge encode failed: Failed to access "/var/run/munge/munge.socket.2": No such file or directory
srun: error: slurm_send_node_msg: auth_g_create: REQUEST_RESOURCE_ALLOCATION has authentication error
srun: error: Srun communication socket apparently being written to by something other than Slurm
srun: error: Unable to allocate resources: Protocol authentication error
具体报错情况如上,请问应该在哪里来做调整?
The text was updated successfully, but these errors were encountered:
srun: error: If munged is up, restart with --num-threads=10
srun: error: Munge encode failed: Failed to access "/var/run/munge/munge.socket.2": No such file or directory
srun: error: slurm_send_node_msg: auth_g_create: REQUEST_RESOURCE_ALLOCATION has authentication error
srun: error: Srun communication socket apparently being written to by something other than Slurm
srun: error: If munged is up, restart with --num-threads=10
srun: error: Munge encode failed: Failed to access "/var/run/munge/munge.socket.2": No such file or directory
srun: error: slurm_send_node_msg: auth_g_create: REQUEST_RESOURCE_ALLOCATION has authentication error
srun: error: Srun communication socket apparently being written to by something other than Slurm
srun: error: If munged is up, restart with --num-threads=10
srun: error: Munge encode failed: Failed to access "/var/run/munge/munge.socket.2": No such file or directory
srun: error: slurm_send_node_msg: auth_g_create: REQUEST_RESOURCE_ALLOCATION has authentication error
srun: error: Srun communication socket apparently being written to by something other than Slurm
srun: error: If munged is up, restart with --num-threads=10
srun: error: Munge encode failed: Failed to access "/var/run/munge/munge.socket.2": No such file or directory
srun: error: slurm_send_node_msg: auth_g_create: REQUEST_RESOURCE_ALLOCATION has authentication error
srun: error: Srun communication socket apparently being written to by something other than Slurm
srun: error: If munged is up, restart with --num-threads=10
srun: error: Munge encode failed: Failed to access "/var/run/munge/munge.socket.2": No such file or directory
srun: error: slurm_send_node_msg: auth_g_create: REQUEST_RESOURCE_ALLOCATION has authentication error
srun: error: Srun communication socket apparently being written to by something other than Slurm
srun: error: Unable to allocate resources: Protocol authentication error
具体报错情况如上,请问应该在哪里来做调整?
The text was updated successfully, but these errors were encountered: