feat: adding InferenceType enum #1186

axel7083 · 2024-06-10T09:02:17Z

What does this PR do?

Adding the InferenceType enum, allowing InferenceProvider (see #1161) to precise their type, in this context, it could be llama-cpp, whisper-cpp etc.

Notable change

Adding new enum InferenceType
Adding optional backend string property to ModelInfo interface
Adding optional backend string property to Recipe interface
adding getByType method to the InferenceProviderRegistry
InferenceServer will check the backend property of models to select the InferenceProvider
Improving InferenceProvider constructor

Screenshot / video of UI

N/A no visual change

What issues does this PR fix or reference?

Fixes #1181
Part of #1111

How to test this PR?

Unit tests has been provided

Side effect

Now, trying to start an InferenceServer with the WhisperModel will raise an error as we do not have any InferenceProvider for it. Same for facebook/detr-resnet-101 model.

ℹ️ This is good, this is what we expect !

packages/backend/src/assets/ai.json

lstocchi

Overall LGTM

2 things that we should improve (even on separate PRs) are

we should add a label displayign the backend in the models table
i don't like to see the error " no enabled provider could be found.." when clicking on create service using a non-supported model. I would prefer to not seeing the "create new inference server" button at all. Otherwise i may think that i'm doing something wrong and there is some way to make it work

axel7083 · 2024-06-10T14:39:16Z

Overall LGTM

2 things that we should improve (even on separate PRs) are

we should add a label displayign the backend in the models table

i don't like to see the error " no enabled provider could be found.." when clicking on create service using a non-supported model. I would prefer to not seeing the "create new inference server" button at all. Otherwise i may think that i'm doing something wrong and there is some way to make it work

Yeah I totally agree, this PR is the first step #1111, we can add more items on their to improve how we deal with this relation

jeffmaury

LGTM

Signed-off-by: axel7083 <[email protected]>

axel7083 force-pushed the feature/adding-backend-recipes-models branch from 936dc6b to e389061 Compare June 10, 2024 09:19

axel7083 marked this pull request as ready for review June 10, 2024 09:44

axel7083 requested review from benoitf and a team as code owners June 10, 2024 09:45

axel7083 requested review from jeffmaury, lstocchi and feloy June 10, 2024 09:45

axel7083 mentioned this pull request Jun 10, 2024

docs: discussion recipe models relation #1180

Closed

lstocchi reviewed Jun 10, 2024

View reviewed changes

packages/backend/src/assets/ai.json Show resolved Hide resolved

axel7083 requested a review from lstocchi June 10, 2024 13:26

lstocchi approved these changes Jun 10, 2024

View reviewed changes

jeffmaury approved these changes Jun 11, 2024

View reviewed changes

axel7083 added 5 commits June 11, 2024 10:37

feat: adding InferenceType enum

4144a25

Signed-off-by: axel7083 <[email protected]>

test: ensure expected behaviour

eff32c8

Signed-off-by: axel7083 <[email protected]>

fix: prettier

a90fbfc

Signed-off-by: axel7083 <[email protected]>

fix: linter

4d9c621

Signed-off-by: axel7083 <[email protected]>

fix: typecheck

3caacd7

Signed-off-by: axel7083 <[email protected]>

axel7083 force-pushed the feature/adding-backend-recipes-models branch from d9e2fb4 to 3caacd7 Compare June 11, 2024 08:37

axel7083 enabled auto-merge (squash) June 11, 2024 08:46

axel7083 merged commit 5ddb5c5 into containers:main Jun 11, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: adding InferenceType enum #1186

feat: adding InferenceType enum #1186

axel7083 commented Jun 10, 2024 •

edited

Loading

lstocchi left a comment

axel7083 commented Jun 10, 2024

jeffmaury left a comment

feat: adding InferenceType enum #1186

feat: adding InferenceType enum #1186

Conversation

axel7083 commented Jun 10, 2024 • edited Loading

What does this PR do?

Notable change

Screenshot / video of UI

What issues does this PR fix or reference?

How to test this PR?

Side effect

lstocchi left a comment

Choose a reason for hiding this comment

axel7083 commented Jun 10, 2024

jeffmaury left a comment

Choose a reason for hiding this comment

axel7083 commented Jun 10, 2024 •

edited

Loading