feat: change unload model and model status to POST #558

vansangpfiev · 2024-05-14T02:03:46Z

cortex.llamacpp is now supporting multiple models, we need to change method of unloadmodel and modelstatus to POST.
Sample requests for multiple models feature:
load model:

curl http://localhost:3928/inferences/server/loadmodel \
  -H 'Content-Type: application/json' \
  -d '{
    "llama_model_path": "/model/llama-2-7b-model.gguf",
    "model_alias": "llama-2-7b-model",
    "model_type": "llm" // use 'embedding` for embedding model
  }'

unload model

curl http://localhost:3928/inferences/server/unloadmodel \
  -H 'Content-Type: application/json' \
  -d '{
    "model": "llama-2-7b-model"
  }'

louis-jan

LGTM

feat: change unload model and model status to POST

2b59e05

vansangpfiev self-assigned this May 14, 2024

vansangpfiev added 2 commits May 14, 2024 09:16

fix: e2e testing

04f79c8

fix: e2e testing windows

938eacb

vansangpfiev requested a review from louis-jan May 14, 2024 02:43

vansangpfiev marked this pull request as ready for review May 14, 2024 02:44

louis-jan approved these changes May 14, 2024

View reviewed changes

vansangpfiev merged commit 48dcae3 into dev May 14, 2024
18 checks passed

vansangpfiev deleted the feat/get-model-method branch July 8, 2024 05:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: change unload model and model status to POST #558

feat: change unload model and model status to POST #558

vansangpfiev commented May 14, 2024

louis-jan left a comment

feat: change unload model and model status to POST #558

feat: change unload model and model status to POST #558

Conversation

vansangpfiev commented May 14, 2024

louis-jan left a comment

Choose a reason for hiding this comment