Phi-2 model dies upon reaching the context size limit when using console llama.cpp app #4625

Slider2k · 2023-12-24T23:58:06Z

Upon reaching the context size limit (2048) Phi-2 model suddenly becomes silent or starts producing gibberish (random letters). Log is included.
NOTE: The server app is not affected by this bug, only the console app.

llama.cpp: version: 1661 (a7aee47), 1698 (753be37)
Model: https://huggingface.co/TheBloke/dolphin-2_6-phi-2-GGUF

Logs:
1661
1698

Reference: #4490

The text was updated successfully, but these errors were encountered:

BarfingLemurs · 2023-12-26T12:50:49Z

This is not an inference issue, rather a limitation with the model itself. The same gibberish output will occur when inferencing with llama 1.

Llama 2 is better designed to maintain coherence after 4k context, but still will fail into incoherent loops normally after ~4.3k

Slider2k · 2023-12-26T20:53:25Z

To my knowledge upon reaching context size limit llama.cpp clears it, then fills it half way with the immediately preceding text and regenerates. This is not happening here, the model completely breaks down. You can check the attached log. I don't know why everyone reads 'gibberish' as 'normal text bit incoherent' outright.

ggerganov · 2023-12-27T14:32:18Z

Try to keep the system prompt in the context using --keep and see if this helps

Slider2k · 2023-12-27T15:33:42Z

@ggerganov

Try to keep the system prompt in the context using --keep and see if this helps

The model completely breaks (as in becomes silent to responses or produces gibberish: a bunch of random letters). I tried running Phi-2 on the llama.cpp server and seeing what happens when the context gets full, and it doesn't break like the console app - it proceeds by clearing the KV cache and filling it in half by immediately preceding text, as it's supposed to. Then counties working normally.

ggerganov · 2024-01-12T18:43:18Z

Fixed via #4889

Slider2k added the bug-unconfirmed label Dec 24, 2023

Slider2k mentioned this issue Dec 25, 2023

Support for Phi-2 #4490

Merged

Slider2k changed the title ~~Phi-2 model dies upon reaching the context size limit~~ Phi-2 model dies upon reaching the context size limit when using console llama.cpp app Dec 28, 2023

Slider2k mentioned this issue Jan 12, 2024

llama : fix llm_build_k_shift to use correct n_rot #4889

Merged

ggerganov closed this as completed Jan 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Phi-2 model dies upon reaching the context size limit when using console llama.cpp app #4625

Phi-2 model dies upon reaching the context size limit when using console llama.cpp app #4625

Slider2k commented Dec 24, 2023 •

edited

Loading

BarfingLemurs commented Dec 26, 2023

Slider2k commented Dec 26, 2023 •

edited

Loading

ggerganov commented Dec 27, 2023

Slider2k commented Dec 27, 2023 •

edited

Loading

ggerganov commented Jan 12, 2024

Phi-2 model dies upon reaching the context size limit when using console llama.cpp app #4625

Phi-2 model dies upon reaching the context size limit when using console llama.cpp app #4625

Comments

Slider2k commented Dec 24, 2023 • edited Loading

BarfingLemurs commented Dec 26, 2023

Slider2k commented Dec 26, 2023 • edited Loading

ggerganov commented Dec 27, 2023

Slider2k commented Dec 27, 2023 • edited Loading

ggerganov commented Jan 12, 2024

Slider2k commented Dec 24, 2023 •

edited

Loading

Slider2k commented Dec 26, 2023 •

edited

Loading

Slider2k commented Dec 27, 2023 •

edited

Loading