Suggestion for Official Releases of LLMs: Include Quantized Versions #17

epochaudio · 2024-05-28T13:50:52Z

对于官方发布的 LLMS 大模型，建议在未来可以附上 awq 和 gptq 的量化版本。这种做法几乎没有成本，但却能帮助许多缺乏 GPU 的潜在用户。这会让用户在使用模型时更加方便，因为大家普遍认为官方发布的量化版本更具权威性。
For officially released LLMs, it is suggested that awq and gptq quantized versions be included in the future. This practice incurs almost no cost but could benefit many potential users who lack GPUs. It would also be more convenient for users as official quantized versions are generally considered more authoritative.

objecti0n · 2024-05-28T14:32:11Z

Please use the community's version first: https://huggingface.co/legraphista/internlm2-math-plus-1_8b-IMat-GGUF
We will consider adding quantized versions after.

objecti0n added the wontfix This will not be worked on label Jun 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Suggestion for Official Releases of LLMs: Include Quantized Versions #17

Suggestion for Official Releases of LLMs: Include Quantized Versions #17

epochaudio commented May 28, 2024

objecti0n commented May 28, 2024

Suggestion for Official Releases of LLMs: Include Quantized Versions #17

Suggestion for Official Releases of LLMs: Include Quantized Versions #17

Comments

epochaudio commented May 28, 2024

objecti0n commented May 28, 2024