Token streaming doesn't work #517

LoganDark · 2024-05-10T08:32:26Z

For some reason token streaming just does not work. It's enabled and the actual terminal output from the server updates every token but no messages are actually sent over websocket to the UI so it can't be displayed until the response is complete. No idea what is going on.

I'm on the latest United commit 1e985ed.

The text was updated successfully, but these errors were encountered:

henk717 · 2024-05-10T10:45:18Z

What kind of model / api was it hooked up to?

LoganDark · 2024-05-10T14:48:48Z

Thought that would be in the debug json, but I've tried with both LLaMA 2 and Mixtral 8x7B in GGUF format, running on KoboldCPP (with cuBLAS and full offload to a 3090). I'm using the KoboldAI United UI (localhost:5000, not lite).

henk717 · 2024-05-10T16:46:37Z

United can't stream over the API thats why streaming is missing.

LoganDark · 2024-05-10T16:48:01Z

What do you mean it can't stream over the API? So it can't stream at all?

henk717 · 2024-05-10T16:52:03Z

It can stream when you use huggingface based models in the main UI.

LoganDark · 2024-05-10T16:53:05Z

So I can't use my 3090 to run models? Or I can't use GGUF files?

henk717 · 2024-05-10T16:55:32Z

You can't use GGUF's combined with United combined with streaming.
You can use it when you directly use Koboldcpp in its own bundled KoboldAI Lite.

LoganDark · 2024-05-10T17:07:53Z

OK, so the solution is to not use GGUF then? the lite UI is mostly unusable for me (it works fine, it just has an awful user experience)

henk717 · 2024-05-10T19:02:06Z

Yes, the backends built in to KoboldAI United should work (Huggingface, exllama2)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Token streaming doesn't work #517

Token streaming doesn't work #517

LoganDark commented May 10, 2024

henk717 commented May 10, 2024

LoganDark commented May 10, 2024 •

edited

Loading

henk717 commented May 10, 2024

LoganDark commented May 10, 2024

henk717 commented May 10, 2024 •

edited

Loading

LoganDark commented May 10, 2024

henk717 commented May 10, 2024

LoganDark commented May 10, 2024

henk717 commented May 10, 2024

Token streaming doesn't work #517

Token streaming doesn't work #517

Comments

LoganDark commented May 10, 2024

henk717 commented May 10, 2024

LoganDark commented May 10, 2024 • edited Loading

henk717 commented May 10, 2024

LoganDark commented May 10, 2024

henk717 commented May 10, 2024 • edited Loading

LoganDark commented May 10, 2024

henk717 commented May 10, 2024

LoganDark commented May 10, 2024

henk717 commented May 10, 2024

LoganDark commented May 10, 2024 •

edited

Loading

henk717 commented May 10, 2024 •

edited

Loading