Skip to content

Releases: ggerganov/llama.cpp

b4067

11 Nov 18:28
54ef9cf
Compare
Choose a tag to compare
vulkan: Throttle the number of shader compiles during the build step.…

b4066

11 Nov 08:00
b0cefea
Compare
Choose a tag to compare
metal : more precise Q*K in FA vec kernel (#10247)

b4065

11 Nov 07:56
b141e5f
Compare
Choose a tag to compare
server : enable KV cache defrag by default (#10233)

ggml-ci

b4062

10 Nov 12:30
160687b
Compare
Choose a tag to compare
vulkan: Fix newly added tests for permuted mul_mat and 1D im2col (#10…

b4061

09 Nov 12:51
6423c65
Compare
Choose a tag to compare
metal : reorder write loop in mul mat kernel + style (#10231)

* metal : reorder write loop

* metal : int -> short, style

ggml-ci

b4060

09 Nov 12:51
39a334a
Compare
Choose a tag to compare
metal : fix build and some more comments (#10229)

b4059

09 Nov 12:51
bb38cdd
Compare
Choose a tag to compare
metal : fix F32 accumulation in FA vec kernel (#10232)

b4058

09 Nov 12:51
f018acb
Compare
Choose a tag to compare
llama : fix Qwen model type strings

b4057

09 Nov 12:50
46323fa
Compare
Choose a tag to compare
metal : hide debug messages from normal log

b4056

09 Nov 08:50
5b359bb
Compare
Choose a tag to compare
ggml: fix zero division in ‘dne’ calculation in CUDA COUNT_EQUAL oper…