chore: update run.sh script #11

axel7083 · 2024-04-10T13:16:13Z

What does this PR do?

The current script in this repository is out of date compared to the one in https://github.com/containers/ai-lab-recipes/blob/main/model_servers/llamacpp_python/src/run.sh

Screenshot / video of UI

What issues does this PR fix or reference?

Related to containers/podman-desktop-extension-ai-lab#825

How to test this PR?

podman build ./chat

benoitf

I'm not sure about adding stuff around CONFIG_PATH as it's never used in the extension
and for the other env variables as well (except MODEL_PATH, HOST and PORT)

https://github.com/containers/podman-desktop-extension-ai-lab/blob/3e6f29ed112c2fc2e1696b632c82d6d6e66f3a7c/packages/backend/src/utils/inferenceUtils.ts#L150

so should we not only add

--clip_model_path "None" --chat_format "llama-2"

axel7083 · 2024-04-10T13:38:24Z

I'm not sure about adding stuff around CONFIG_PATH as it's never used in the extension and for the other env variables as well (except MODEL_PATH, HOST and PORT)

GPU_LAYERS will be required at some point (I would prefer to have it now)
CLIP_MODEL_PATH I don't know
CHAT_FORMAT is necessary to fix the issue

I removed the code related to the config file

chat/run.sh

chat/requirements.txt

chat/run.sh

Signed-off-by: axel7083 <[email protected]>

Co-authored-by: Florent BENOIT <[email protected]> Signed-off-by: axel7083 <[email protected]>

Signed-off-by: axel7083 <[email protected]>

jeffmaury

LGTM

benoitf · 2024-04-11T16:11:32Z

I'm still wondering, if ai lab is the only thing using these images if we should not remove env var that are not yet provided by Ai lab

like GPU_LAYERS and CLIP_MODEL_PATH

and add them the day it'll be handled

but then side question, how ai lab is ensuring that the image is matching the precondition.

if I have the extension being installed for a while and update the extension, will it update the image or notify that the service is no longer compliant ?

axel7083 · 2024-04-11T16:14:26Z

I'm still wondering, if ai lab is the only thing using these images if we should not remove env var that are not yet provided by Ai lab

like GPU_LAYERS and CLIP_MODEL_PATH

I am in favour of keeping them

and add them the day it'll be handled

but then side question, how ai lab is ensuring that the image is matching the precondition.

if I have the extension being installed for a while and update the extension, will it update the image or notify that the service is no longer compliant ?

We have nothing keeping track of the version

axel7083 requested a review from a team as a code owner April 10, 2024 13:16

axel7083 requested review from benoitf and lstocchi April 10, 2024 13:16

benoitf reviewed Apr 10, 2024

View reviewed changes

benoitf approved these changes Apr 10, 2024

View reviewed changes

chat/run.sh Outdated Show resolved Hide resolved

chat/run.sh Outdated Show resolved Hide resolved

chat/requirements.txt Outdated Show resolved Hide resolved

axel7083 force-pushed the chore/bump-run-script branch from 723498b to ac6210b Compare April 10, 2024 13:49

axel7083 requested a review from jeffmaury April 10, 2024 13:51

axel7083 mentioned this pull request Apr 10, 2024

fix: adding chatformat to use for inference servers containers/podman-desktop-extension-ai-lab#868

Merged

1 task

benoitf reviewed Apr 11, 2024

View reviewed changes

chat/run.sh Outdated Show resolved Hide resolved

axel7083 commented Apr 11, 2024

View reviewed changes

chat/run.sh Outdated Show resolved Hide resolved

axel7083 and others added 4 commits April 11, 2024 14:11

chore: update run.sh script

ca0debd

Signed-off-by: axel7083 <[email protected]>

fix: removing config file related code

4a60c33

Signed-off-by: axel7083 <[email protected]>

Apply suggestions from code review

bfc7104

Co-authored-by: Florent BENOIT <[email protected]> Signed-off-by: axel7083 <[email protected]>

Update chat/run.sh

c55c6df

Signed-off-by: axel7083 <[email protected]>

axel7083 force-pushed the chore/bump-run-script branch from d694b42 to c55c6df Compare April 11, 2024 12:11

axel7083 requested a review from benoitf April 11, 2024 12:11

axel7083 linked an issue Apr 11, 2024 that may be closed by this pull request

Merlinite model needs "chat_format" set to "openchat" containers/podman-desktop-extension-ai-lab#825

Closed

jeffmaury approved these changes Apr 11, 2024

View reviewed changes

benoitf approved these changes Apr 11, 2024

View reviewed changes

axel7083 merged commit 91431f5 into containers:main Apr 11, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: update run.sh script #11

chore: update run.sh script #11

axel7083 commented Apr 10, 2024

benoitf left a comment

axel7083 commented Apr 10, 2024

jeffmaury left a comment

benoitf commented Apr 11, 2024

axel7083 commented Apr 11, 2024

chore: update run.sh script #11

chore: update run.sh script #11

Conversation

axel7083 commented Apr 10, 2024

What does this PR do?

Screenshot / video of UI

What issues does this PR fix or reference?

How to test this PR?

benoitf left a comment

Choose a reason for hiding this comment

axel7083 commented Apr 10, 2024

jeffmaury left a comment

Choose a reason for hiding this comment

benoitf commented Apr 11, 2024

axel7083 commented Apr 11, 2024