Add base API URL field for Ollama and OpenAI embedding models #1136

srdas · 2024-12-04T07:16:22Z

Description

Jupyter AI currently allows the user to call a model at a URL (location) different from the default one by specifying a selected Base API URL. This can be done for Ollama, OpenAI provider models. However, for these providers, there is no way to change the API URL for embedding models when using the /learn command in RAG mode. This PR adds an extra field to make this feasible.

Testing instructions

Testing as follows for Ollama:
[1] Start the Ollama system from port 11435 instead 11434 (the default):
OLLAMA_HOST=127.0.0.1:11435 ollama serve
[2] Set the Base API URL:

[3] Check that the new API URL works for completions and /learn:

Jupyter AI currently allows the user to call a model at a URL (location) different from the default one by specifying a selected Base API URL. This can be done for Ollama, OpenAI provider models. However, for these providers, there is no way to change the API URL for embedding models when using the `/learn` command in RAG mode. This PR adds an extra field to make this feasible. Tested as follows for Ollama: [1] Start the Ollama system from port 11435 instead 11434 (the default): `OLLAMA_HOST=127.0.0.1:11435 ollama serve` [2] Set the Base API URL: [3] Check that the new API URL works:

for more information, see https://pre-commit.ci

dlqqq · 2024-12-09T18:05:29Z

I noticed a bug on a24b436 on Friday but didn't have time to document it. If you change the OpenAI embeddings base URL to an empty value and save, all subsequent connections fail. I did some debugging and found that the OpenAIEmbeddings provider class fails if you pass it openai_api_base="", because it treats the empty string as the API base instead of ignoring it.

I'm pushing a change that alters the ConfigManager to exclude empty string fields from the keyword arguments it provides. This fixes the bug for me locally. Most providers already ignore optional keyword arguments that are set to an empty string, so this shouldn't cause any change for users.

for more information, see https://pre-commit.ci

srdas

Tested /learn and /ask with haiku-3.5 and titan-embed-v1. ✅ (No Base API URL)
Tested /learn and /ask with ollama-llama3.2 and ollama-mxbai-embed-large. ✅ (No Base API URL) -- all next tests with Ollama (to test changes in ollama.py ).
Tested again with Base API URL = 11434 (the default explicitly). ✅
Tested again after clearing out the Base API URL, still works ✅
Restarted Ollama with port=12345. Added this to the Base API URL and it all works as expected. ✅
Removed the custom Base API URL (blank field) and the /learn and /ask commands now fail, as they should, because Ollama is still running on the custom port. :gr-checkmark:
Leaving custom fields blank, restarted Ollama to return to default API URL and everything works as expected. ✅
With OpenAI embeddings, left the Base API URL blank (it works ✅), then added the URL (it works ✅) and then deleted the URL (and it still works ✅, confirms the change in config_manager.py is implemented).

Code looks good as well.

dlqqq · 2024-12-09T23:29:30Z

Kicking CI since the RTD workflow has stalled.

dlqqq · 2024-12-09T23:48:48Z

@meeseeksdev please backport to v3-dev

…enAI embedding models

…ding models (#1149) Co-authored-by: Sanjiv Das <[email protected]>

srdas added the enhancement New feature or request label Dec 4, 2024

srdas and others added 2 commits December 5, 2024 15:27

[pre-commit.ci] auto fixes from pre-commit.com hooks

6df1247

for more information, see https://pre-commit.ci

dlqqq force-pushed the base_api_url branch from 9c5b9d1 to 6df1247 Compare December 5, 2024 23:27

dlqqq and others added 3 commits December 6, 2024 12:28

allow embedding model fields to be saved

a24b436

Merge branch 'main' into base_api_url

0eb18be

Merge branch 'main' into base_api_url

dc4e3af

dlqqq and others added 2 commits December 9, 2024 10:09

exclude empty str fields from config manager

80b24b7

[pre-commit.ci] auto fixes from pre-commit.com hooks

4a69101

for more information, see https://pre-commit.ci

dlqqq changed the title ~~Base API URL added for embedding models~~ Add base API URL field for Ollama and OpenAI embedding models Dec 9, 2024

dlqqq marked this pull request as ready for review December 9, 2024 19:52

srdas commented Dec 9, 2024

View reviewed changes

dlqqq approved these changes Dec 9, 2024

View reviewed changes

dlqqq closed this Dec 9, 2024

dlqqq reopened this Dec 9, 2024

dlqqq merged commit 1cbd853 into jupyterlab:main Dec 9, 2024
20 checks passed

meeseeksmachine pushed a commit to meeseeksmachine/jupyter-ai that referenced this pull request Dec 9, 2024

Backport PR jupyterlab#1136: Add base API URL field for Ollama and Op…

802f085

…enAI embedding models

meeseeksmachine mentioned this pull request Dec 9, 2024

Backport PR #1136 on branch v3-dev (Add base API URL field for Ollama and OpenAI embedding models) #1149

Merged

dlqqq pushed a commit that referenced this pull request Dec 10, 2024

Backport PR #1136: Add base API URL field for Ollama and OpenAI embed…

b37b7be

…ding models (#1149) Co-authored-by: Sanjiv Das <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add base API URL field for Ollama and OpenAI embedding models #1136

Add base API URL field for Ollama and OpenAI embedding models #1136

srdas commented Dec 4, 2024 •

edited by dlqqq

Loading

dlqqq commented Dec 9, 2024

srdas left a comment

dlqqq commented Dec 9, 2024

dlqqq commented Dec 9, 2024

Add base API URL field for Ollama and OpenAI embedding models #1136

Add base API URL field for Ollama and OpenAI embedding models #1136

Conversation

srdas commented Dec 4, 2024 • edited by dlqqq Loading

Description

Testing instructions

dlqqq commented Dec 9, 2024

srdas left a comment

Choose a reason for hiding this comment

dlqqq commented Dec 9, 2024

dlqqq commented Dec 9, 2024

srdas commented Dec 4, 2024 •

edited by dlqqq

Loading