fix: update image to support usage info #81

jeffmaury · 2024-11-27T09:48:06Z

Update the inference server to return usage statistics in streaming mode

N/A

Signed-off-by: Jeff MAURY <[email protected]>

axel7083 · 2024-11-27T09:52:23Z

Referencing abetlen/llama-cpp-python#1552 since we are using it

axel7083

I honestly emit serious doubt to the longevity of such images. I don't think we should in the future build too much on the llama-cpp-python project.

chat/setup.sh

Signed-off-by: Jeff MAURY <[email protected]>

fix: update image to support usage info

07b50b4

Signed-off-by: Jeff MAURY <[email protected]>

jeffmaury requested a review from a team as a code owner November 27, 2024 09:48

jeffmaury requested review from dgolovin, cdrage and axel7083 November 27, 2024 09:48

axel7083 approved these changes Nov 27, 2024

View reviewed changes

benoitf reviewed Nov 27, 2024

View reviewed changes

chat/setup.sh Outdated Show resolved Hide resolved

jeffmaury added 4 commits November 27, 2024 14:01

fix: use rebase instead of apply

cb90220

Signed-off-by: Jeff MAURY <[email protected]>

fix: fix setup file

6a9f3bc

Signed-off-by: Jeff MAURY <[email protected]>

fix: add gitignore

97e3ce7

Signed-off-by: Jeff MAURY <[email protected]>

fix: update git info

c1dbe65

Signed-off-by: Jeff MAURY <[email protected]>

jeffmaury merged commit ad0b3ec into containers:main Dec 6, 2024
6 checks passed

jeffmaury deleted the GH-1730 branch December 6, 2024 12:27

Provide feedback