Respect the maximum number of tokens in interactive. #298

tjohnman · 2023-03-19T18:28:18Z

Even in interactive mode the maximum number of specified tokens should be respected. Instead of ending the main loop, by falling back to user input mode.

rabidcopy · 2023-03-19T19:09:59Z

Upon further testing of this, it seems to remember things immediately before running out of tokens and resetting. Though sometimes it goes a bit off the rails if it was in the middle of telling a long winded story. Makes somewhat coherent conversations possible even with very low context/n_predict sizes. Kinda crazy it's that simple.. Reminds me of Stable Diffusion where the breakthrough on tokens limiting prompt size and complexity was to just add more tokens when it ran out.

tjohnman · 2023-03-19T19:31:16Z

Upon further testing of this, it seems to remember things immediately before running out of tokens and resetting. Though sometimes it goes a bit off the rails if it was in the middle of telling a long winded story. Makes somewhat coherent conversations possible even with very low context/n_predict sizes. Kinda crazy it's that simple.. Reminds me of Stable Diffusion where the breakthrough on tokens limiting prompt size and complexity was to just add more tokens when it ran out.

The "reset" that happens when it runs out of tokens is not much of a reset, really. It's just keeping track of how many tokens it can generate before letting the user intervene again.

rabidcopy · 2023-03-19T20:02:51Z

Ah, I get it now. Running out of n_predict is fine and recoverable, but if you run out of ctx_size it still comes to a hard stop. Since increasing ctx_size past 2048 is not recommended for pratical use, I take it there will still need some sort of rolling context cache that pushes out past history when it runs out of room while possibly keeping the initial prompt cached? #71 (comment)

Respect the maximum number of tokens in interactive.

9d89bed

tjohnman mentioned this pull request Mar 19, 2023

Never exit the main loop in interactive mode. #297

Closed

ggerganov approved these changes Mar 19, 2023

View reviewed changes

Merge branch 'master' into interactive-mode-fix

63fd09b

ggerganov merged commit 368d0c8 into ggerganov:master Mar 19, 2023

tjohnman deleted the interactive-mode-fix branch March 19, 2023 18:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Respect the maximum number of tokens in interactive. #298

Respect the maximum number of tokens in interactive. #298

tjohnman commented Mar 19, 2023

rabidcopy commented Mar 19, 2023 •

edited

Loading

tjohnman commented Mar 19, 2023

rabidcopy commented Mar 19, 2023

Respect the maximum number of tokens in interactive. #298

Respect the maximum number of tokens in interactive. #298

Conversation

tjohnman commented Mar 19, 2023

rabidcopy commented Mar 19, 2023 • edited Loading

tjohnman commented Mar 19, 2023

rabidcopy commented Mar 19, 2023

rabidcopy commented Mar 19, 2023 •

edited

Loading