Model features: native async #110

raspawar · 2024-10-18T06:39:16Z

Native Async

Validated current implementation and added test cases.

llm = ChatNVIDIA()
message = HumanMessage(content="Hello")
response = await llm.agenerate([[message]])

cc: @sumitkbh

mattf · 2024-10-18T12:31:17Z

@efriis we have async implementations for generation functions. what is the criteria for "native async" (see https://python.langchain.com/docs/integrations/chat/nvidia_ai_endpoints/#model-features)?

libs/ai-endpoints/tests/integration_tests/test_chat_models.py

mattf

per discussion w/ Erick, we need to implement _agenerate, which will entail a move from requests to httpx

mattf

looking good. please add a test that confirms httpx requests are interleaved.

mattf · 2024-11-26T21:53:56Z

libs/ai-endpoints/tests/integration_tests/test_chat_models.py

+
+
+def test_generate(chat_model: str, mode: dict) -> None:
+    """Test generate method of anthropic."""


mattf · 2024-11-26T21:54:29Z

libs/ai-endpoints/langchain_nvidia_ai_endpoints/_common.py

@@ -100,7 +103,7 @@ class _NVIDIAClient(BaseModel):
    last_inputs: Optional[dict] = Field(
        default={}, description="Last inputs sent over to the server"
    )
-    last_response: Response = Field(
+    last_response: Optional[Response] = Field(


why make this optional?

mattf · 2024-11-26T21:55:38Z

libs/ai-endpoints/tests/integration_tests/test_chat_models.py

+    assert chat_messages == messages_copy
+
+
+# @pytest.mark.scheduled


should this be commented or not?

mattf · 2024-11-26T22:58:47Z

libs/ai-endpoints/tests/integration_tests/test_chat_models.py

+
+
+# @pytest.mark.scheduled
+async def test_async_generate(chat_model: str, mode: dict) -> None:


this will pass even if agenerate() is implemented without truly async communication w/ the server.

add a unit text that check that async generation requests are interleaved. for inspiration...

async def afetch_data(url: str) -> str: async with httpx.AsyncClient() as client: return (await client.get(url)).text async def amock_response(request): await asyncio.sleep(1) return httpx.Response(200, text="Hello world!") start_time = time.time() httpx_mock.add_callback(amock_response, is_reusable=True) task1, task2 = afetch_data("http://example.com"), afetch_data("http://example.com") _, _ = await asyncio.gather(task1, task2) assert (time.time() - start_time) < 2, "Tasks did not run concurrently"

add test cases for native async

c947b85

raspawar requested a review from mattf October 18, 2024 06:39

mattf assigned raspawar Oct 18, 2024

mattf requested changes Oct 25, 2024

View reviewed changes

libs/ai-endpoints/tests/integration_tests/test_chat_models.py Outdated Show resolved Hide resolved

libs/ai-endpoints/tests/integration_tests/test_chat_models.py Outdated Show resolved Hide resolved

add param for test cases

a2914c8

raspawar requested a review from mattf October 25, 2024 14:52

mattf approved these changes Oct 30, 2024

View reviewed changes

mattf requested changes Nov 7, 2024

View reviewed changes

raspawar and others added 11 commits November 21, 2024 15:43

add httpx as dependency

61537d4

move agenerate to httpx client

e1c2011

add updated lock file

05beeeb

Merge branch 'main' into raspawar/model_features_native_async

6adf3de

updated lock file

0efb69c

add mock addess for ainvoke

a254d5e

linting fix

2296d31

more linting fixes

215bd7c

linting test case fixes

7ca93e9

remove unnecessary mock

be10099

remove httpx import

b57feb2

raspawar requested a review from mattf November 21, 2024 13:20

mattf requested changes Nov 26, 2024

View reviewed changes

add review changes

ee90cd1

raspawar requested a review from mattf December 12, 2024 19:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model features: native async #110

Model features: native async #110

raspawar commented Oct 18, 2024

mattf commented Oct 18, 2024

mattf left a comment

mattf left a comment

mattf Nov 26, 2024

mattf Nov 26, 2024

mattf Nov 26, 2024

mattf Nov 26, 2024



		def test_generate(chat_model: str, mode: dict) -> None:
		"""Test generate method of anthropic."""

		assert chat_messages == messages_copy


		# @pytest.mark.scheduled



		# @pytest.mark.scheduled
		async def test_async_generate(chat_model: str, mode: dict) -> None:

Model features: native async #110

Are you sure you want to change the base?

Model features: native async #110

Conversation

raspawar commented Oct 18, 2024

Native Async

mattf commented Oct 18, 2024

mattf left a comment

Choose a reason for hiding this comment

mattf left a comment

Choose a reason for hiding this comment

mattf Nov 26, 2024

Choose a reason for hiding this comment

mattf Nov 26, 2024

Choose a reason for hiding this comment

mattf Nov 26, 2024

Choose a reason for hiding this comment

mattf Nov 26, 2024

Choose a reason for hiding this comment