roadmap: llamacpp-engine to align with llama.cpp upstream #1728

dan-homebrew · 2024-11-26T07:34:57Z

Goal

Goal: Can we have a minimalist fork of llama.cpp as llamacpp-engine
- cortex.cpp's desktop focus means Drogon's features are unused
- We should contribute our vision and multimodal work upstream as a form of llama.cpp server
- Very clear Engines abstraction (i.e. support OpenVino etc in the future)
Goal: Contribute upwards to llama.cpp
- Vision, multimodal
- May not be possible if the vision, audio encoders are Python-runtime based

Can we consider refactoring llamacpp-engine to use the server implementation, and maintain a fork with our improvements to speech, vision etc? This is especially if we do a C++ implementation of whisperVQ in the future.

The text was updated successfully, but these errors were encountered:

vansangpfiev · 2024-11-26T08:24:12Z

I agree that we should align with the llama.cpp upstream, but I have several concerns:

Drogon is part of cortex.cpp, we have already removed it from llama-cpp engine. If we remove Drogon from cortex.cpp, we need to find a replacement, which will be costly.
Repository Structure: Forking the server implementation will necessitate changes to our repository structure, since we currently use llama.cpp as a submodule.
Our current version differs significantly from the upstream version, which will require considerable time for refactoring.

vansangpfiev · 2024-12-23T02:21:58Z

dan-homebrew added the type: epic A major feature or initiative label Nov 26, 2024

dan-homebrew assigned vansangpfiev Nov 26, 2024

dan-homebrew added this to Jan & Cortex Nov 26, 2024

github-project-automation bot moved this to Investigating in Jan & Cortex Nov 26, 2024

gabrielle-ong added this to the v1.0.5 milestone Nov 28, 2024

gabrielle-ong mentioned this issue Nov 28, 2024

Sprint 26 Planning #1735

Closed

gabrielle-ong removed this from the v1.0.5 milestone Nov 28, 2024

dan-homebrew closed this as completed Dec 15, 2024

github-project-automation bot moved this from Investigating to QA in Jan & Cortex Dec 15, 2024

dan-homebrew reopened this Dec 15, 2024

github-project-automation bot moved this from QA to In Progress in Jan & Cortex Dec 15, 2024

dan-homebrew changed the title ~~epic: llamacpp-engine to align with llama.cpp upstream~~ roadmap: llamacpp-engine to align with llama.cpp upstream Dec 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

roadmap: llamacpp-engine to align with llama.cpp upstream #1728

roadmap: llamacpp-engine to align with llama.cpp upstream #1728

dan-homebrew commented Nov 26, 2024 •

edited

Loading

vansangpfiev commented Nov 26, 2024

vansangpfiev commented Dec 23, 2024 •

edited

Loading

roadmap: llamacpp-engine to align with llama.cpp upstream #1728

roadmap: llamacpp-engine to align with llama.cpp upstream #1728

Comments

dan-homebrew commented Nov 26, 2024 • edited Loading

Goal

vansangpfiev commented Nov 26, 2024

vansangpfiev commented Dec 23, 2024 • edited Loading

dan-homebrew commented Nov 26, 2024 •

edited

Loading

vansangpfiev commented Dec 23, 2024 •

edited

Loading