Releases: notsyncing/azarrot
Releases · notsyncing/azarrot
0.3.0
0.2.0
- Add IPEX-LLM backend
- Support InternVL2 on IPEX-LLM backend with OpenAI chat completion image input
- Support Qwen2 tool calling on IPEX-LLM and OpenVINO backend with OpenAI chat completion tools input
- Support embedding models on IPEX-LLM and OpenVINO backend with OpenAI embedding API
- Support parallel completion requests: concurrent completion requests can be submit on both OpenVINO and IPEX-LLM backends (not batching)
- Add README and changelog