Replies: 1 comment 1 reply
-
Hi @ZeroAct - could you elaborate more on what you are trying to achieve? I'm not sure I understand your use case enough to answer the question. Is it related to dynamically loading and unloading models in a BentoServer? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I have only 8Gb gpu ram.
But, I want that the BentoService has multiple artifacts (over 8Gb total).
Is there any function bentoml provides?
Or do i have to implement this?
Thank you!
Beta Was this translation helpful? Give feedback.
All reactions