Conversation History Trimming with max_tokens and max_history_length #8

adivik2000 · 2023-08-01T06:14:35Z

This PR has the PoC for conversation trimming based on tokens given and default lengths.

Added a dummy conversation via ChatGPT - wanted content of diff lengths.

Took a bit of your code from your gist code you had shared, @NirantK . Hope this is what you were looking for in -> #7

Would be helpful if you could guide with further steps.

Thanks,
Aditya Thoomati

NirantK · 2023-08-01T12:01:26Z

Split this into two:

Move the usage example to docs
Add the trimming logic to Conversation class itself in conversation.py itself

The usage example should show that the trimming works!

review-notebook-app · 2023-08-01T16:57:34Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

adivik2000 · 2023-08-01T16:59:24Z

Split this into two:

Move the usage example to docs

Add the trimming logic to Conversation class itself in conversation.py itself

The usage example should show that the trimming works!

Done. Made the changes requested.

NirantK

API Design Changes primarily, core connection/integration logic looks good

agentai/conversation.py

NirantK · 2023-08-02T04:26:28Z

agentai/conversation.py

+    def get_history(self, max_tokens=100) -> List[Message]:
+        """Function to get the conversation history based on the number of tokens"""
+        self.trim_history_by_tokens(max_tokens=max_tokens)
+        return self.history


This feels like a utility wrapper. Would prefer to remove this and rename self.trim_history_by_tokens(max_tokens=max_tokens) to get_history instead?

This is done.

NirantK · 2023-08-02T04:27:03Z

docs/test_conversation_history.py

@@ -0,0 +1,75 @@
+import os


Looks identical to the notebook? Let's keep only one copy of the logic?

The notebook didn't work. Was picking agentai library version and not the folder. So, had to use the python file to test things out

pip install -e . might be worth looking into for local development

docs/05_Conversation_History.ipynb

agentai/conversation.py

NirantK · 2023-08-02T05:12:17Z

agentai/conversation.py

+        local = threading.local()
+        try:
+            enc = local.gpt2enc
+        except AttributeError:
+            enc = tiktoken.get_encoding("gpt2")
+            local.gpt2enc = enc


Check and confirm if the GPT4 tokeniser is same as gpt2? From what I recall, this is wrong. The tokeniser depends on the LLM.

NirantK · 2023-08-02T05:14:10Z

This closes #7

NirantK · 2023-08-02T10:24:04Z

agentai/conversation.py

@@ -11,7 +13,7 @@ class Message(BaseModel):


 class Conversation:
-    def __init__(self, history: List[Message] = [], id: Optional[str] = None):
+    def __init__(self, history: List[Message] = [], id: Optional[str] = None, max_history_tokens: int = 200):


We should default to infinite max_history_tokens to be backward compatible.

conversation history trimming with max_tokens and max_history_length

12f73b6

merged conversation history keeping the style of original + docs

c60386b

NirantK requested changes Aug 2, 2023

View reviewed changes

NirantK reviewed Aug 2, 2023

View reviewed changes

docs/05_Conversation_History.ipynb Show resolved Hide resolved

docs/05_Conversation_History.ipynb Show resolved Hide resolved

changes

ea72cdf

NirantK reviewed Aug 2, 2023

View reviewed changes

agentai/conversation.py Outdated Show resolved Hide resolved

Change var name and increase default

470a83e

NirantK reviewed Aug 2, 2023

View reviewed changes

NirantK mentioned this pull request Aug 2, 2023

Check and confirm if the GPT4 tokeniser is same as gpt2? From what I recall, this is wrong. The tokeniser depends on the LLM. #10

Closed

NirantK approved these changes Aug 2, 2023

View reviewed changes

NirantK merged commit 7ed24a4 into NirantK:main Aug 2, 2023

NirantK mentioned this pull request Aug 2, 2023

Chunking #7

Closed

NirantK reviewed Aug 2, 2023

View reviewed changes

NirantK mentioned this pull request Aug 2, 2023

Default to infinite max_history_tokens to be backward compatible. #11

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Conversation History Trimming with max_tokens and max_history_length #8

Conversation History Trimming with max_tokens and max_history_length #8

adivik2000 commented Aug 1, 2023

NirantK commented Aug 1, 2023

review-notebook-app bot commented Aug 1, 2023

adivik2000 commented Aug 1, 2023

NirantK left a comment

NirantK Aug 2, 2023

adivik2000 Aug 2, 2023

NirantK Aug 2, 2023

adivik2000 Aug 2, 2023

NirantK Aug 2, 2023

NirantK Aug 2, 2023

NirantK commented Aug 2, 2023

NirantK Aug 2, 2023

Conversation History Trimming with max_tokens and max_history_length #8

Conversation History Trimming with max_tokens and max_history_length #8

Conversation

adivik2000 commented Aug 1, 2023

NirantK commented Aug 1, 2023

review-notebook-app bot commented Aug 1, 2023

adivik2000 commented Aug 1, 2023

NirantK left a comment

Choose a reason for hiding this comment

NirantK Aug 2, 2023

Choose a reason for hiding this comment

adivik2000 Aug 2, 2023

Choose a reason for hiding this comment

NirantK Aug 2, 2023

Choose a reason for hiding this comment

adivik2000 Aug 2, 2023

Choose a reason for hiding this comment

NirantK Aug 2, 2023

Choose a reason for hiding this comment

NirantK Aug 2, 2023

Choose a reason for hiding this comment

NirantK commented Aug 2, 2023

NirantK Aug 2, 2023

Choose a reason for hiding this comment