-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create a single repository structure, image with relevant dependencies for data pipelines #198
Comments
Related: #201 The above is related because AT PRESENT, virtually all of our data pipelines have library requirements not currently met by any existing default runtime environments. The AICoE tutorial explains how to use Thoth, Sesheta, and other CI/CD tools to build and deploy one's own images into Quay, but that quickly leads to runtime image fragmentation. The Highlander approach is in development, but not yet ready. From an ARCHITECTURAL perspective, we need to describe how people should write pipelines today (using notebooks and suffering the costs of installing and loading their own modules on a case-by-case basis), as well as where we are going (what basic approach to deciding when to create a new image, maintained by whom, and how a part of the larger data reproducibility story). I think it is unsatisfying to achieve our data reproducibility solution by creating a runtime image manageability problem. |
Produce a dedicated image pipeline repository . Goal to have an image created in the OSC Quay account . @erikerlandson |
Defer to after COP27, create standard image with pre-loaded libraries. |
See also: #98 |
still in progress; need to break out "default" versions of jupyter notebooks (AI and non-AI);
|
No description provided.
The text was updated successfully, but these errors were encountered: