-
Notifications
You must be signed in to change notification settings - Fork 115
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
abstracting model downloads and file-normalization to models dir #190
abstracting model downloads and file-normalization to models dir #190
Conversation
baa0b3b
to
4b754a0
Compare
6ffe7e0
to
10862f1
Compare
One comment caught my eye. ghcr.io is used for testing the artifacts before they are promoted to quay.io. Hence, would you mind checking that the push pipelines are pushing the model servers container images to ghcr.io. I think that one is broken. Probably caused by the changes I merged yesterday. |
10862f1
to
227eed4
Compare
Changes since last review:
Just going to make sure the readmes line up with these changes, conflicts are resolved and tests passes and then I will tag people for re-review |
3a4d012
to
1c0bc90
Compare
We want to keep the model-servers separate from models - we'll either volume mount a model as an init container from a containerized model, as the quadlet pod yamls are doing now, or from a host-filesystem volume mount as podman-desktop AI Lab does with the sample applications - this is to keep everything as pluggable as possible at both build time and runtime. |
For the most part this PR keeps with this isolation, that being said I will need to adapt it to the new "model-as-a-container" paradigm, but ill try to rework it to support both.
I am all for this concept of normalizing as much as we can to make models easily swappable, but I also feel that @MichaelClifford did have some good points about not calling it |
holding this to focus on the funcitonal tests |
08355e2
to
1bc051c
Compare
9d2074e
to
f6fd467
Compare
b45f117
to
fc57821
Compare
Signed-off-by: greg pereira <[email protected]>
fc57821
to
d8fe927
Compare
--net=host change is an outcome of https://redhat-internal.slack.com/archives/C06S75ZF9JT/p1712851316770539 Signed-off-by: greg pereira <[email protected]>
Changes:
/models
Makefile
in the/models
directorymodel_server
directories/model_servers
download-model
with some prepathed information for tests, users responsible for their own models.REGISTRY
,IMAGE_NAME
,MODEL_NAME
,MODEL_PATH
andPORT
common
directory for allmodel_servers
for building, running, running tests, etc./.github/workflows
Makefiles
for 2 reasons: