feat: Self-Extend support #349

hahuyhoang411 · 2024-01-16T11:07:49Z

Problem
This is new implementation of Self-Extend to make LLM extend to 8k or 16k without further training

Success Criteria
We can also use in Nitro

--grp-attn-n 2
--grp-attn-w 128

Additional context
The llama.cpp supports it: ggerganov/llama.cpp#4815

The text was updated successfully, but these errors were encountered:

hahuyhoang411 added P3: nice to have Nice to have feature type: feature request A new feature labels Jan 16, 2024

tikikun linked a pull request Jan 16, 2024 that will close this issue

349 feat self extend support + pump version of llama cpp #351

Merged

tikikun closed this as completed in #351 Jan 16, 2024

Provide feedback