[Feature Request] Support any OpenAI compatible endpoints by adding two flags #1008

regunakyle · 2024-11-18T07:46:09Z

Summary

Support any OpenAI compatible endpoints, such as tabbyAPI, vLLM, ollama, etc.

I am running Qwen2.5-coder 32B with tabbyAPI which is a OpenAI comaptible API server.

Here is what I did to make it work with garak (openai generator):

export OPENAI_BASE_URL="http://localhost:5000/v1" so that the OpenAI client uses my server
Set the model name of my Qwen2.5 to gpt-4-32k (because gpt-4-32k is one of the supported models and is hardcoded to have 32k context, which is the same context length as Qwen2.5 coder)
Run garak with garak --model_type openai --model_name gpt-4-32k

It would be nice if garak support arbitary OpenAI models out of the box.

Basic example

I suggest adding the following logic:

Add --custom_base_url and --context_len flag; user must uses either both or none of them
If --custom_base_url is used, initiate the OpenAI client with it. Something like this:

# ...
self.client = openai.OpenAI(api_key=self.api_key, base_url=custom_base_url)
# ...

Set context length to the value of --context_len
User run garak with

OPENAI_API_KEY=<API key> garak --model_type openai --model_name <model name> --custom_base_url <custom_base_url> --context_len <context_len>

For example

OPENAI_API_KEY="sk-123XXXXXXXXXXXX" garak --model_type openai --model_name Qwen_Qwen2.5-Coder-32B-Instruct-exl2 --custom_base_url http://localhost:5000/v1 --context_len 32768

Motivation

There is quite a lot of OpenAI compatible API servers out there, supporting them would cover a lot more use cases.
Also, I think it is more straightforward to setup (compared to the REST generator with has a lot of manual config values).

The text was updated successfully, but these errors were encountered:

leondz · 2024-11-18T08:32:06Z

Thanks, this is a good idea. Will take a look.

jmartin-tech · 2024-11-18T19:56:44Z

This is already possible with nim generators since NIMs are published as OpenAI compatible service containers, you can pass in a config to garak that provides a uri to target OpenAI client compatible endpoints. Promoting OpenAICompatible as a generic generator however may be a more straight forward accessible pattern.

For the moment can you try one of these examples and see if there are edge cases that might need to be investigated?

openai-compat-endpoint.yaml

plugins:
  generators:
    nim:
      uri: http://0.0.0.0:8000/v1
      context_len: 32768
      api_key: <enter here or in env var NIM_API_KEY>

This can be passed via --config

python -m garak -m nim -n my_deployed_model_name --config openai-compat-endpoint.yaml

Or as json openai-compat-endpoint.json:

{
  "generators": {
    "nim": {
      "uri": "http://0.0.0.0:8000/v1",
      "context_len": 32768
    }
  }
}

This can be passed

python -m garak -m nim -n my_deployed_model_name --generator_option_file openai-compat-endpoint.json

leondz · 2024-11-19T02:12:08Z

Might be worth farming this out to a putative openai.Compatible that requires an endpoint uri

regunakyle · 2024-11-19T02:54:35Z

This is already possible with nim generators since NIMs are published as OpenAI compatible service containers, you can pass in a config to garak that provides a uri to target OpenAI client compatible endpoints. Promoting OpenAICompatible as a generic generator however may be a more straight forward accessible pattern.

For the moment can you try one of these examples and see if there are edge cases that might need to be investigated?

openai-compat-endpoint.yaml
plugins:
  generators:
    nim:
      uri: http://0.0.0.0:8000/v1
      context_len: 32768
      api_key: <enter here or in env var NIM_API_KEY>
This can be passed via --config
python -m garak -m nim -n my_deployed_model_name --config openai-compat-endpoint.yaml
Or as json openai-compat-endpoint.json:
{
  "generators": {
    "nim": {
      "uri": "http://0.0.0.0:8000/v1",
      "context_len": 32768
    }
  }
}
This can be passed
python -m garak -m nim -n my_deployed_model_name --generator_option_file openai-compat-endpoint.json

Thanks, I just tried this and it works. (It would be nice if this is explicitly documented though)

I guess my proposal is not needed in this case? I will close the issue now.

leondz · 2024-11-20T17:56:00Z

It would be nice if this is explicitly documented though

agree, we should have a clearer route. if you don't mind i'll reopen this to track resolving that

jmartin-tech · 2024-11-20T19:42:14Z

It may make sense to enhance OpenAICompatible to be fully functional with exposed params for expecting a uri to be provided.

regunakyle added the architecture Architectural upgrades label Nov 18, 2024

leondz added the generators Interfaces with LLMs label Nov 18, 2024

regunakyle closed this as completed Nov 19, 2024

leondz reopened this Nov 20, 2024

jmartin-tech mentioned this issue Nov 22, 2024

generator: promote OpenAICompatible as first class generator #1021

Merged

2 tasks

leondz closed this as completed in #1021 Dec 6, 2024

leondz closed this as completed in 7499da1 Dec 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Support any OpenAI compatible endpoints by adding two flags #1008

[Feature Request] Support any OpenAI compatible endpoints by adding two flags #1008

regunakyle commented Nov 18, 2024

leondz commented Nov 18, 2024

jmartin-tech commented Nov 18, 2024 •

edited

Loading

leondz commented Nov 19, 2024

regunakyle commented Nov 19, 2024 •

edited

Loading

leondz commented Nov 20, 2024

jmartin-tech commented Nov 20, 2024

[Feature Request] Support any OpenAI compatible endpoints by adding two flags #1008

[Feature Request] Support any OpenAI compatible endpoints by adding two flags #1008

Comments

regunakyle commented Nov 18, 2024

Summary

Basic example

Motivation

leondz commented Nov 18, 2024

jmartin-tech commented Nov 18, 2024 • edited Loading

leondz commented Nov 19, 2024

regunakyle commented Nov 19, 2024 • edited Loading

leondz commented Nov 20, 2024

jmartin-tech commented Nov 20, 2024

jmartin-tech commented Nov 18, 2024 •

edited

Loading

regunakyle commented Nov 19, 2024 •

edited

Loading