Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

More storage needed for the web server file system #67

Closed
milanmlft opened this issue Aug 15, 2023 · 9 comments · Fixed by #68
Closed

More storage needed for the web server file system #67

milanmlft opened this issue Aug 15, 2023 · 9 comments · Fixed by #68
Assignees
Labels
bug Something isn't working

Comments

@milanmlft
Copy link
Collaborator

While running the IDEAS IBASH workshop, I'm running into an error when trying to run the Converts NIFTI to 256 conformed MGZ file command:

org.mandas.docker.client.exceptions.ImagePullFailedException: Image pull failed: healthbioscienceideas/fastsurfer:gpu: ProgressMessage{id=null, status=null, stream=null, error=write /var/lib/docker/tmp/GetImageBlob4206791876: no space left on device, progress=null, progressDetail=null, buildImageId=null, digest=null}

In particular, the no space left on device seems to indicate we're running out of storage.

We should probably increase the root_block_device_size in terraform.tfvars_sample:

"xnat_web" = 10
"xnat_db" = 10
"xnat_cserv" = 10

@milanmlft milanmlft added the bug Something isn't working label Aug 15, 2023
@milanmlft milanmlft self-assigned this Aug 15, 2023
@milanmlft
Copy link
Collaborator Author

Also root_block_device["xnat_db"] isn't used anywhere, so should probably remove that.

@p-j-smith
Copy link
Contributor

that's strange, the EFS should be being used for storing all the data, and it should expand as necessary

@milanmlft
Copy link
Collaborator Author

Hmm yeah not sure yet what's happening, I'll play around with these settings and see if I can get it to work.

@milanmlft
Copy link
Collaborator Author

So after increasing the xnat_web and xnat_cserv root block device sizes to 30 GB, the no space left on device error disappeared, but now I'm getting a new one:

Could not create container from image healthbioscienceideas/fastsurfer:gpu

Note that before, for some reason (can't remember where it came from) I had 30 GB for the xnat_web block size and 10 GB for the xnat_cserv block size. So the original error might have been due to that difference? Maybe we should just use a single root_block_device_size variable and use that for both the web server and the container?

@p-j-smith
Copy link
Contributor

we previously increased the size of the root block device to 30 GB (#32) so we didn't run out of disk space after uploading the data. But now the data should all be stored in the EFS rather than root block device, so there might be an issue with the symlink between /data (root block device) and /storage/data (mounted EFS volume) not being created correctly

@p-j-smith
Copy link
Contributor

Could not create container from image healthbioscienceideas/fastsurfer:gpu

I just checked and the image does exist with that tag (https://hub.docker.com/layers/healthbioscienceideas/fastsurfer/gpu/images/sha256-8110b0a8e4ae31a96ca3993ffee039200bb5f8aa53287ec575b271e884f00608?context=explore), so maybe there's an issue in the Dockerfile for that image. Was there any more info in the error message?

@milanmlft
Copy link
Collaborator Author

Nope, only that, I just tried with deepmi/fastsurfer:latest instead, but same error 😕
The dcm2niix container still works though, so it's not a general Docker container problem.

@p-j-smith
Copy link
Contributor

Nope, only that, I just tried with deepmi/fastsurfer:latest instead, but same error 😕
The dcm2niix container still works though, so it's not a general Docker container problem.

😞 It might be that we do indeed need to run it on a gpu

@milanmlft
Copy link
Collaborator Author

milanmlft commented Aug 15, 2023

Yeah was thinking something like that as well, so probably makes more sense to track this in #51

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants