Consume LLM output stream via returned objects to allow caching #384

jackmpcollins · 2024-12-01T00:08:42Z

Consume any remaining chunks via StreamedStr or output object in OutputStream so that internal caching in these objects can be used.

This removes the need to process StreamedStr immediately when it is received, so I removed the warning from the docs.

jackmpcollins added 3 commits November 30, 2024 15:56

Consume stream via objects to allow caching

8089ddf

Update tests to check caching

d65d97d

Change StreamedStr consume warning to tip in docs

580fd02

jackmpcollins self-assigned this Dec 1, 2024

jackmpcollins merged commit f7e7e37 into main Dec 1, 2024
1 check passed

jackmpcollins deleted the allow-caching-for-output-objs branch December 1, 2024 00:10

Provide feedback