Merge mainline llama.cpp #3

ikawrakow · 2024-07-26T14:56:16Z

Only quick testing so far.

AVX2 and CUDA appear to work. CUDA performance seems slightly (~1-2%) lower as it is so often the case with llama.cpp/ggml after some "improvements" have been made.

ikawrakow · 2024-07-27T05:54:36Z

Seems to be working -> merging

Kawrakow added 4 commits July 26, 2024 16:32

Merging mainline - WIP

6b2b52d

Merging mainline - WIP

a0849e4

AVX2 and CUDA appear to work. CUDA performance seems slightly (~1-2%) lower as it is so often the case with llama.cpp/ggml after some "improvements" have been made.

Merging mainline - fix Metal

ddd97dc

Remove check

573e500

ikawrakow merged commit 154e0d7 into main Jul 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge mainline llama.cpp #3

Merge mainline llama.cpp #3

ikawrakow commented Jul 26, 2024

ikawrakow commented Jul 27, 2024

Merge mainline llama.cpp #3

Merge mainline llama.cpp #3

Conversation

ikawrakow commented Jul 26, 2024

ikawrakow commented Jul 27, 2024