add NVIDIARerank connector, a BaseDocumentCompressor, with support for NVIDIA API Catalog #19

mattf · 2024-04-12T13:31:26Z

introduce NVIDIARerank, a BaseDocumentCompressor, supporting NVIDIA API Catalog, e.g.

from langchain_nvidia_ai_endpoints import NVIDIARerank
from langchain_core.documents import Document

query = "which way should i go?"
passages = [
	"two roads diverged in a yellow wood, and sorry i could not travel both and be one traveler, long i stood and looked down one as far as i could to where it bent in the undergrowth;",
	"then took the other, as just as fair, and having perhaps the better claim because it was grassy and wanted wear, though as for that the passing there had worn them really about the same,",
	"and both that morning equally lay in leaves no step had trodden black. oh, i marked the first for another day! yet knowing how way leads on to way i doubted if i should ever come back.",
	"i shall be telling this with a sigh somewhere ages and ages hense: two roads diverged in a wood, and i, i took the one less traveled by, and that has made all the difference."
]
documents = [Document(page_content=passage) for passage in passages]

# available models
models = NVIDIARerank.get_available_models()

# API Catalog with default model
ranker = NVIDIARerank()

# API Catalog with selected model
ranker = NVIDIARerank(model=models[0].id)

# control number of output documents
ranker.top_n = 2

# perform reranking compression
ranked_documents = ranker.compress_documents(documents=documents, query=query)

# examine ranking relevance score
assert ranked_documents[0].metadata["relevance_score"] > ranked_documents[1].metadata["relevance_score"]

…://host:port without path

current: {"model": "...", "query": {"text": "hello world?"}, "passages": [ {"text": "passage one"}, {"text": "passage two"}, {"text": "passage three"} ], "logits": True } -> [{"index": 1, "score": 1.0, "logit": 1.234}, {"index": 2, "score": 0.5, "logit": 0.910}, {"index": 0, "score": 0.0, "logit": -5.678}] change: - remove logit option - return logits instead of normalized scores new: {"model": "...", "query": {"text": "hello world?"}, "passages": [ {"text": "passage one"}, {"text": "passage two"}, {"text": "passage three"} ] } -> [{"index": 1, "logit": 1.234}, {"index": 2, "logit": 0.910}, {"index": 0, "logit": -5.678}] user impact: - user must perform their own normalization, if desired - no impact: results are still returned in sorted order - no impact: users can still produce a total order across batches

…r NVIDIA API Catalog

doing so would require selection of a bogus logit, which will break the ability of a user to call compress_documents with disjoint sets of documents and reconstruct an ordering. at least in the degenerate case of 1 document per batch.

… a space

chantal-rose · 2024-04-23T19:06:20Z

Looks good @mattf

mattf self-assigned this Apr 12, 2024

mattf mentioned this pull request Apr 17, 2024

add NVIDIARerank support for a local NIM #23

Merged

mattf force-pushed the mattf/add-reranking-api-catalog branch from 5586cde to 6662c10 Compare April 17, 2024 15:30

Zenodia and others added 25 commits April 20, 2024 07:10

adding langchain.py inside src/ranking

2e63d8c

align api w/ langchain expectations

b2a2e1b

remove document alterations

323774d

add direct use example

a1cb91f

rename NVReranker_URL to NVIDIA_NEMO_RERANKING_ENDPOINT, make it http…

cfc9726

…://host:port without path

add config for top_n, model, endpoint

c0d39cf

remove empty async acompress_documents

c7a456d

provide default for NVIDIA_NEMO_RERANKING_ENDPOINT

81fca6f

add basic tests, handle top_n < 1

db4eebc

add negative tests for endpoint

e3322f8

add marker for tests that require an active service

3b16256

add langchain example to readme

672955d

align model name

81c7400

add NVIDIARerank connector, a BaseDocumentCompressor, with support fo…

cef5f4e

…r NVIDIA API Catalog

add NIM mode to NVIDIARerank connector

6defa5b

provide a default base_url for nim mode

d320a4c

add validation of top_n

43b2bda

add support for batching

0cb117f

update negative top_n test to disable validation

19a9ff5

add illustrative doc for reranking (not expected to execute)

a58b5d2

remove notebooks/LangChain example, direct use.ipynb, which contained…

4656f6d

… a space

support for py3.8, use List for typing instead of list

e2de402

add nvidia_api_key and api_key params to NVIDIARerank

d95fd5a

mattf force-pushed the mattf/add-reranking-api-catalog branch from 6662c10 to d95fd5a Compare April 20, 2024 12:33

update example notebook, thank you @chantal-rose

7ed686f

mattf merged commit edb8b9b into main Apr 23, 2024
12 checks passed

mattf deleted the mattf/add-reranking-api-catalog branch April 23, 2024 19:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add NVIDIARerank connector, a BaseDocumentCompressor, with support for NVIDIA API Catalog #19

add NVIDIARerank connector, a BaseDocumentCompressor, with support for NVIDIA API Catalog #19

mattf commented Apr 12, 2024 •

edited

Loading

chantal-rose commented Apr 23, 2024

add NVIDIARerank connector, a BaseDocumentCompressor, with support for NVIDIA API Catalog #19

add NVIDIARerank connector, a BaseDocumentCompressor, with support for NVIDIA API Catalog #19

Conversation

mattf commented Apr 12, 2024 • edited Loading

chantal-rose commented Apr 23, 2024

mattf commented Apr 12, 2024 •

edited

Loading