Releases: ggerganov/llama.cpp
Releases · ggerganov/llama.cpp
b4126
cuda : only use native when supported by cmake (#10389)
b4122
Vulkan: Fix device info output format specifiers (#10366) * Vulkan: Fix device info output format specifiers * Vulkan: Use zu printf specifier for size_t instead of ld
b4120
CUDA: fix MMV kernel being used for FP16 src1 (#10357)
b4118
llama : only use default buffer types for the KV cache (#10358)
b4115
ggml : fix undefined reference to 'getcpu' (#10354) https://github.com/ggerganov/llama.cpp/issues/10352
b4114
CUDA: remove DMMV, consolidate F16 mult mat vec (#10318)
b4113
CMake: default to -arch=native for CUDA build (#10320)
b4112
ggml : fix possible buffer use after free in sched reserve (#9930)
b4111
ggml : inttypes.h -> cinttypes (#0) ggml-ci
b4103
llama/ex: remove --logdir argument (#10339)