Refactor to reuse stream parsing across ChatModels #380

jackmpcollins · 2024-11-29T01:42:03Z

Big refactor. Replaces near-duplicate generator functions with better abstractions.

To parse an LLM streamed response now requires

StreamParser to identify string vs tool and parse these
StreamState to keep track of current message snapshot, usage, etc. for a streamed response

This removes the assumption that the LLM returns either string or tool calls, in preparation for #232

Improved parsing of output types. So now FunctionCall or ParallelFunctionCall is required in the return type of the prompt-function rather than allowing these to be returned when not in the type.

Also added new error types FunctionCallNotAllowedError, ObjectNotAllowedError, UnknownToolError which can be added to the LLM-assisted retry logic in future.

jackmpcollins added 30 commits November 18, 2024 00:04

Copy discard_none_arguments to LitellmChatModel

9ad1c7f

Switch OpenaiChatModel to use Stream approach

93d6cfd

Add TODO to test retry logic

a7ec635

Add StreamParser to DRY openai streaming code

02a05ec

Add _if_given

a097c2a

Fix typing for openai Stream classes

f4bb4c1

Add parse_stream and use in OpenaiChatModel

8ccd016

tidy function schema matching

eeb765e

Copy Stream classes into stream.py

360211e

Switch LitellmChatModel to use stream parsers

6fa7cdd

Switch OpenaiChatModel to use shared streaming classes

f758536

Add docstring for is_instance_origin

45074a5

Merge branch 'main' into use-streaming-events

990c03f

Use type origins in parse_stream

4f37190

Fix complete type hints for LitellmChatModel

45b8799

Change function_schemas type list -> Iterable

f8b92a4

Add StreamState and OpenaiStreamState

6b464dd

Add TODOs

b073258

Update openai retry test cassettes

f6f55c3

Add back usage for OpenaiChatModel

7478c9a

Consolodate parsers into one

5ca69cb

Make LitellmChatModel use new parsing format

4b98d25

Fix litellm_ollama

4fd7b4b

Handle multiple tools in a chunk, for Mistral

7e7e64b

Remove anthropic context manager usage

cfa0815

Add _if_given helper for anthropic

ab407b4

Allow parser.get_content to return None

9ab2265

Switch AnthropicChatModel to new parsing logic

b414c08

Delete unused validate_str_content functions

f507027

Remove redundant TODOs

8f8e3d4

jackmpcollins added 12 commits November 27, 2024 00:12

Fix prompt_chain and unskip tests

647e362

Only yield tool call args if not falsy

ac4f27c

Add FunctionCallNotAllowedError, ObjectNotAllowedError

56974ef

Add UnknownToolError, raise in OutputStream

c4b73e9

Remove done todo for unknown tool call

b964aad

Remove is_content_ended if favor of is_tool_call

6b408fd

Add typecheck to make all

cb2e565

Add get_function_schemas. Fix some mypy errors

d6c7bfa

Tidy calculation of allow_string_output

a92217f

Fix remaining mypy errors

846be9e

Delete is_instance_origin

169a640

Merge branch 'main' into use-streaming-events

fdd14a7

jackmpcollins merged commit a9d513f into main Nov 29, 2024
1 check passed

jackmpcollins deleted the use-streaming-events branch November 29, 2024 01:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor to reuse stream parsing across ChatModels #380

Refactor to reuse stream parsing across ChatModels #380

jackmpcollins commented Nov 29, 2024 •

edited

Loading

Refactor to reuse stream parsing across ChatModels #380

Refactor to reuse stream parsing across ChatModels #380

Conversation

jackmpcollins commented Nov 29, 2024 • edited Loading

jackmpcollins commented Nov 29, 2024 •

edited

Loading