-
Notifications
You must be signed in to change notification settings - Fork 50
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create_tarball might not include Lmod cache and config files correctly for GPU builds #722
Comments
So this needs an adjustment in the check for creating these, in fact they shouldn't be touched at all by GPU PRs |
Well, there's no check I think, we just create and if they are different they are picked up so we just need to chomp off |
@casparvl Looking into recent accel PRs, I don't see these files appearing...so perhaps there has already been a PR to fix this? |
We also discussed this in the call on Wednesday, and I think we agreed that we do want to make these files for every accelerator? Besides having accelerator-specific hooks (which then won't be considered at all for other CPU/GPU targets), it also allows us to build an accelerator-specific Lmod cache, that we can enable by adding the |
So that would mean that instead of merging #744, we would need to make sure that these files get created and that they're included in the tarball. For the ESPResSO builds from PR #748, I do still see this output in the slurm log:
So it looks like they are still being created. Looking at the installation script, I think it only does it at the end (here: https://github.com/EESSI/software-layer/blob/2023.06-software.eessi.io/EESSI-install-software.sh#L312), but not at the beginning (here: https://github.com/EESSI/software-layer/blob/2023.06-software.eessi.io/EESSI-install-software.sh#L155). Not sure if that's an issue. I'll open a PR to make sure that these files are included in the tarball. |
I checked the contents of the files that were generated by the ESPResSO job:
That's perfect, and would allow us to generate a cache for the GPU builds. The SitePackage.lua is just an exact copy of all the other ones. That's also expected, but probably not what we want? @casparvl what was your idea here? Can you even somehow stack multiple SitePackages.lua files on top of each other? Or do you want to let the CPU one include the one for the GPU? |
Well, We do this for allowing host-specific hooks, see e.g. software-layer/create_lmodsitepackage.py Line 100 in 6cccfcd
That's how we can import from two specific file locations, see the docs on this https://www.eessi.io/docs/site_specific_config/lmod_hooks/#location-of-the-hooks . But, the order in which we do this is important. We first register the EESSI hook software-layer/create_lmodsitepackage.py Line 204 in 6cccfcd
software-layer/create_lmodsitepackage.py Line 207 in 6cccfcd
This allows sites to overwrite what we do in EESSI. Thinking about the GPU part... The CPU So probably, it requires a small modification to our current |
I see that new files were created during the install step:
But the
create_tarball
only looks atThe text was updated successfully, but these errors were encountered: