feat: Switch vulkan images to the kompute backend #83

slp · 2024-11-28T17:44:00Z

The latest versions of MoltenVK (tested with 1.2.11) are not able to deal properly with llama.cpp's Vulkan (GGML_VULKAN) backend.

Switch our Vulkan images to use the Kompute (GGML_KOMPUTE) backend instead, which works fine with MoltenVK and will eventually achieve the same performance as Metal on macOS bare-metal.

jeffmaury · 2024-12-05T15:59:20Z

chat/vulkan/arm64/Containerfile

-    dnf install -y git cmake ninja-build gcc gcc-c++ vulkan-loader-devel vulkan-tools && \
+    dnf install -y mesa-vulkan-drivers-24.1.2-101.el9.aarch64 && \
+    dnf versionlock  mesa-vulkan-drivers-24.1.2-101.el9.aarch64 && \
+    dnf install -y git cmake ninja-build gcc gcc-c++ vulkan-loader-devel vulkan-tools fmt-devel && \
    dnf copr enable -y jeffmaury/shaderc epel-9-aarch64 && \


This part was added recently because it was required by the llama-cpp Vulkan build, is it still required for kompute ?

Yes, I updated the mesa package because el9 has switched to a newer llvm.

Sorry I was not clear enough. I was mentioning the sharerc/glslc part

Ah, yes, kompute also needs glslc

Signed-off-by: Sergio Lopez <[email protected]>

The latest versions of MoltenVK (tested with 1.2.11) are not able to deal properly with llama.cpp's Vulkan (GGML_VULKAN) backend. Switch our Vulkan images to use the Kompute (GGML_KOMPUTE) backend instead, which works fine with MoltenVK and will eventually achieve the same performance as Metal on macOS bare-metal. Signed-off-by: Sergio Lopez <[email protected]>

slp requested a review from a team as a code owner November 28, 2024 17:44

slp requested review from dgolovin, jeffmaury and cdrage November 28, 2024 17:44

jeffmaury changed the title ~~Switch vulkan images to the kompute backend~~ feat: Switch vulkan images to the kompute backend Nov 29, 2024

jeffmaury reviewed Dec 5, 2024

View reviewed changes

slp added 2 commits December 11, 2024 16:44

fix: setup.sh: also sync llama.cpp submodules

45be081

Signed-off-by: Sergio Lopez <[email protected]>

slp force-pushed the switch-to-kompute branch 3 times, most recently from 665759f to 188866a Compare December 13, 2024 18:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Switch vulkan images to the kompute backend #83

feat: Switch vulkan images to the kompute backend #83

slp commented Nov 28, 2024

jeffmaury Dec 5, 2024

slp Dec 13, 2024

jeffmaury Dec 15, 2024

slp Dec 16, 2024

feat: Switch vulkan images to the kompute backend #83

Are you sure you want to change the base?

feat: Switch vulkan images to the kompute backend #83

Conversation

slp commented Nov 28, 2024

jeffmaury Dec 5, 2024

Choose a reason for hiding this comment

slp Dec 13, 2024

Choose a reason for hiding this comment

jeffmaury Dec 15, 2024

Choose a reason for hiding this comment

slp Dec 16, 2024

Choose a reason for hiding this comment