Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

update to 0.1, remove deprecated functionality and focus on api catalog backend #48

Merged
merged 42 commits into from
May 31, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
42 commits
Select commit Hold shift + click to select a range
fffe37e
bump version to 0.1.0-rc0
mattf May 20, 2024
856e735
remove deprecated methods and properties
mattf May 20, 2024
81ebefd
define model table and semantics, replaces model specs
mattf May 20, 2024
9853259
add phi-3-small/medium/vision to MODEL_TABLE
mattf May 21, 2024
ac7fbbb
refactor to use MODEL_TABLE and integrate.api.nvidia.com
mattf May 20, 2024
410ad6e
assume deprecated models without alternatives are not available
mattf May 23, 2024
ccfa8fe
make NV-Embed-QA an alternative for nvolveqa_40k
mattf May 23, 2024
c8eb33f
remove deprecated bad & labels param from ChatNVIDIA
mattf May 23, 2024
3b0926a
remove deprecated max_length param from NVIDIAEmbeddings
mattf May 23, 2024
044766e
remove special handling of removed kosmos_2 model
mattf May 23, 2024
1440857
add nvidia/embed-qa-4 to list of embedding models
mattf May 23, 2024
38502fd
playground_* models no longer exist - add aliases for model alternati…
mattf May 23, 2024
5e097ad
remove redundant model_name
mattf May 23, 2024
9c12d5a
update default embedding model to nvidia/embed-qa-4
mattf May 24, 2024
29def65
update chat unit tests to remove duplicate deprecation check and fix …
mattf May 24, 2024
88ba0c5
NVIDIAEmbeddings has _NVIDIAClient instead of is _NVIDIAClient
mattf May 24, 2024
01556e4
add tests for base_url, share public classes fixture
mattf May 24, 2024
2e090e1
ChatNVIDIA has _NVIDIAClient instead of is _NVIDIAClient
mattf May 24, 2024
4acf6ab
pass base_url tests
mattf May 24, 2024
827c0ba
add hosted nim identification
mattf May 24, 2024
1130abb
warn if no key provided for hosted nim
mattf May 24, 2024
4bd3c5f
update api key tests and implementation
mattf May 24, 2024
1fe47f2
align available_models integration test
mattf May 24, 2024
0514018
ensure model field reflects the model that will be used
mattf May 24, 2024
f82d8f0
add support for VLM models
mattf May 24, 2024
7d84a02
add support for embedding models on custom endpoints
mattf May 24, 2024
20bc3ec
align alias and model availability unit test
mattf May 24, 2024
e23b5df
align test_langchain_reranker_direct_endpoint_unavailable across publ…
mattf May 24, 2024
deb8053
remove unused deprecated field from MODEL_TABLE
mattf May 24, 2024
1505eec
correct meta/llama3-{8,70}b-instruct alias names
mattf May 28, 2024
c1b374a
remove references to deprecated models from docs
mattf May 30, 2024
a696849
bump version from 0.1.0-rc0 to 0.1.0
mattf May 30, 2024
064c1ee
mark NVIDIAEmbeddings.model_type as deprecated
mattf May 30, 2024
598ee33
remove dead code handling nvolveqa_40k
mattf May 30, 2024
4474971
ensure available models includes known and listed
mattf May 31, 2024
46bc579
add compat mode for nvolveqa_40k
mattf May 31, 2024
f1504f3
use get's default instead of or
mattf May 31, 2024
eaab838
add ibm/granite-8b-code-instruct, ibm/granite-34b-code-instruct and g…
mattf May 31, 2024
858afb8
lookup_model: remove unnecessary iteration, clarify intent
mattf May 31, 2024
15d28a2
remove unused streaming field
mattf May 31, 2024
d4a32f2
add doc for internal determine_model func
mattf May 31, 2024
e88df10
update documentation for primary public interfaces
mattf May 31, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 2 additions & 2 deletions libs/ai-endpoints/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -221,7 +221,7 @@ for txt in chain.stream({"input": "Why is a PB&J?"}):

NVIDIA also supports multimodal inputs, meaning you can provide both images and text for the model to reason over.

An example model supporting multimodal inputs is `ai-neva-22b`.
An example model supporting multimodal inputs is `nvidia/neva-22b`.

These models accept LangChain's standard image formats. Below are examples.

Expand All @@ -237,7 +237,7 @@ Initialize the model like so:
```python
from langchain_nvidia_ai_endpoints import ChatNVIDIA

llm = ChatNVIDIA(model="ai-neva-22b")
llm = ChatNVIDIA(model="nvidia/neva-22b")
```

#### Passing an image as a URL
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -451,7 +451,7 @@
" ]\n",
")\n",
"\n",
"model = ChatNVIDIA(model=\"ai-mixtral-8x7b-instruct\")\n",
"model = ChatNVIDIA(model=\"mistralai/mixtral-8x7b-instruct-v0.1\")\n",
"\n",
"chain = (\n",
" {\"context\": retriever, \"question\": RunnablePassthrough()}\n",
Expand Down
Loading
Loading