[Feature] Default max_prompt_tokens should be LLM dependent #223

igiloh-pinecone · 2023-12-13T14:37:28Z

Is this your first time submitting a feature request?

I have searched the existing issues, and I could not find an existing issue for this feature
I am requesting a straightforward extension of existing functionality

Describe the feature

The ChatEngine takes a max_prompt_tokens argument, which allows limiting the prompt size (message history + system prompt + retrieved context).
Currently, this argument has an arbitrary hard-coded default value of 4096. However - the real default value should be LLM dependent, as different LLMs have different context windows.

Describe alternatives you've considered

Each LLM class should have a context_window cached property (the value may change, according to model_name).
If the user hasn't specified max_prompt_tokens (the default should be None) - the ChatEngine will use llm.context_window.

Who will this benefit?

No response

Are you interested in contributing this feature?

No response

Anything else?

No response

The text was updated successfully, but these errors were encountered:

igiloh-pinecone added enhancement New feature or request good first issue Good for newcomers labels Dec 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Default max_prompt_tokens should be LLM dependent #223

[Feature] Default max_prompt_tokens should be LLM dependent #223

igiloh-pinecone commented Dec 13, 2023

[Feature] Default max_prompt_tokens should be LLM dependent #223

[Feature] Default max_prompt_tokens should be LLM dependent #223

Comments

igiloh-pinecone commented Dec 13, 2023

Is this your first time submitting a feature request?

Describe the feature

Describe alternatives you've considered

Who will this benefit?

Are you interested in contributing this feature?

Anything else?