Add optional querier response streaming #7173

flxbk · 2024-01-19T15:50:14Z

What this PR does

Some Mimir API endpoints can generate responses with a very large response body. Today, responses are transmitted from queriers to query-frontends via the non-streaming gRPC call

rpc QueryResult (QueryResultRequest) returns (QueryResultResponse) { };

In case of the /series and /active_series APIs, the (partial) responses generated by queriers can get so large that they cause the query-frontend handling the request to OOM on unmarshaling the QueryResultRequest containing the response. The following example profile from an internal cell demonstrates this behaviour:

While queriers need to accumulate the entire response in memory to be able to deduplicate results from ingesters, query-frontends do not need to bring the full response into memory in all cases. The "active series" endpoint in particular never needs to do this and was implemented in a streaming fashion to avoid the issue of OOMs for large result sets. However, since all response bodies for all types of queries are fully loaded into query-frontend memory in today's implementation of the communication between queriers and query-frontends, this streaming is essentially useless.

To avoid this issue and enable a single query-frontend instance to process arbitrarily large responses, this PR proposes the introduction of a new (streaming) rpc

rpc QueryResultStream (stream QueryResultStreamRequest) returns (QueryResultResponse) { };

that queriers can invoke instead of the QueryResult call to transmit responses to query-frontends in a streaming fashion. This essentially requires two changes:

Lifecycle management of requests in the query-frontend needs to be handled differently. This is because a request's context and its associated frontendRequest data-structure can no longer be canceled/cleaned up on return from RoundTripGRPC. Instead, the required cleanup is captured in a closure and passed to the caller, who must invoke it through closing the response body once the transfer is complete. Because of that, we risk introducing a memory leak that could be triggered by client code not closing the response body (today not closing the response body would not lead to a leak, because the body is an io.NopCloser).
Queriers need to use the new rpc to stream responses. This PR puts that behaviour behind a feature flag and also requires the response being sent to indicate it wants to use streaming via a header. In this PR, only responses to the /active_series endpoint request the streaming behaviour, all other responses will continue to be transmitted via the existing rpc even when the feature flag is enabled.

One less-than-obvious caveat of this streaming behaviour is that queriers participate in the handling of the request for longer than they previously would have (namely until the response body is fully streamed into the query-frontend, which itself may block on streaming to the client). Since request concurrency in queriers is limited by the -querier.max-concurrent flag, this may lead to slow clients and/or big requests blocking querier workers. The following pair of example traces with and without streaming illustrates this difference.

Without response streaming

With response streaming

If this approach proves viable, other request types might also benefit from streaming in the future (e.g. remote read).

Which issue(s) this PR fixes or relates to

n/a

Checklist

Tests updated.
Documentation added.
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX].
about-versioning.md updated with experimental features.

dimitarvdimitrov · 2024-01-22T12:17:23Z

can you add a few sentences to the description with some more details for how this works? it will make the review easier. Or if you have a design doc, that'd suffice too

flxbk · 2024-01-22T13:20:53Z

Thanks for the suggestion and excuse the previously barren PR description @dimitarvdimitrov, I have added some context.

dimitarvdimitrov

I had a high-level look and the implementation looks good. It fits fairly naturally in the current model. I will take a deeper look later on too.

Perhaps if we want to take this one step further I would have done the preliminary work wrt adding a streaming proto file in httpgrpc instead of only in frontend.proto. I am less hopeful that we will actually do proper streaming httpgrpc because it will require changing a lot of code, but decided to bring it up.

Would be nice to sync with @aknuds1 on his plans for streaming to make sure we don't implement the same thing in two different ways.

pkg/querier/worker/worker.go

pkg/querier/worker/scheduler_processor.go

pkg/frontend/v2/frontendv2pb/frontend.proto

pkg/frontend/transport/roundtripper.go

pkg/querier/cardinality_analysis_handler.go

flxbk · 2024-02-12T15:29:06Z

Perhaps if we want to take this one step further I would have done the preliminary work wrt adding a streaming proto file in httpgrpc instead of only in frontend.proto. I am less hopeful that we will actually do proper streaming httpgrpc because it will require changing a lot of code, but decided to bring it up.

From internal discussions I gathered that the general sentiment in the team is that streaming through the frontend is not very useful for most endpoints (e.g. because of caching done by frontends). Given that, I decided it would be better to keep the change as local as possible for now.

dimitarvdimitrov

very solid work! 💪

My main comments are around making the tests a bit more elaborate, most of the others are nitpicks, otherwise I couldn't find anything surprising

cmd/mimir/help-all.txt.tmpl

integration/query_frontend_active_series_test.go

pkg/frontend/transport/roundtripper.go

pkg/frontend/v2/frontend.go

pkg/querier/worker/scheduler_processor.go

pkg/querier/worker/scheduler_processor_test.go

pkg/querier/cardinality_analysis_handler.go

cmd/mimir/config-descriptor.json

pkg/querier/worker/scheduler_processor.go

pkg/querier/worker/scheduler_processor_test.go

pkg/frontend/v2/frontend_test.go

pkg/querier/worker/scheduler_processor_test.go

dimitarvdimitrov

final set of comments and then we can merge :)

pkg/querier/worker/scheduler_processor.go

pkg/querier/worker/scheduler_processor_test.go

…llation

flxbk force-pushed the felix/streaming-querier-response branch 11 times, most recently from eca9c2c to f1c6c9a Compare January 21, 2024 19:59

flxbk changed the title ~~[do not review] wip querier response streaming~~ Add optional querier response streaming Jan 21, 2024

flxbk force-pushed the felix/streaming-querier-response branch 2 times, most recently from 233e77d to 23394e3 Compare January 21, 2024 20:46

flxbk marked this pull request as ready for review January 21, 2024 21:02

flxbk requested review from a team as code owners January 21, 2024 21:02

dimitarvdimitrov self-requested a review February 6, 2024 10:52

dimitarvdimitrov reviewed Feb 8, 2024

View reviewed changes

flxbk force-pushed the felix/streaming-querier-response branch 3 times, most recently from 65295cc to 67d3110 Compare February 12, 2024 15:04

dimitarvdimitrov reviewed Feb 16, 2024

View reviewed changes

jdbaldry reviewed Feb 18, 2024

View reviewed changes

cmd/mimir/config-descriptor.json Outdated Show resolved Hide resolved

flxbk force-pushed the felix/streaming-querier-response branch from 5d0c5fe to 505cfd1 Compare February 22, 2024 10:26

dimitarvdimitrov reviewed Feb 22, 2024

View reviewed changes

dimitarvdimitrov reviewed Feb 23, 2024

View reviewed changes

flxbk added 27 commits February 23, 2024 17:03

defer request cleanup

1b14164

rename wireType to queryResult

e1af708

extract struct construction into separate statement

000551c

naming (removeStreamingHeader)

ee941e0

godoc for ResponseStreamingEnabledHeader

4959ec3

improve query frontend test

35af831

refactor integration test

c1c2796

make frontend test more readable

dda46ba

stream only responses whose size exceeds the chunk size limit

4a49a79

improve usage string for querier.response-streaming-enabled

cf4626f

reduce cardinality of integration test flags

fcd80f9

add test case for frontend

d73516a

do not use defer in cleanup

53a060f

add test for context cancellation while streaming

425b28f

check for leaking goroutines

c4f1361

test aborting stream on query cancellation

bc52bc9

use VerifyNoLeakTestMain for frontend/v2 tests

4881608

add logging for aborted response stream

316ef6c

cancel only query context

33255f9

add test for non-cancellation error during streaming

bf205e7

add test for frontend returning erro during stream

112ac43

improve logs for stream abortion

dd5581f

assert that response body is streamed in full on worker context cance…

15e0c70

…llation

CHANGELOG.md

3ce483d

about-versioning.md

25d5111

split up tests for worker cancellation / scheduler client error

0d40276

PR number

a7edf35

flxbk force-pushed the felix/streaming-querier-response branch from 998080d to a7edf35 Compare February 23, 2024 16:03

flxbk merged commit f985741 into main Feb 23, 2024
28 checks passed

flxbk deleted the felix/streaming-querier-response branch February 23, 2024 16:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add optional querier response streaming #7173

Add optional querier response streaming #7173

flxbk commented Jan 19, 2024 •

edited

Loading

dimitarvdimitrov commented Jan 22, 2024

flxbk commented Jan 22, 2024

dimitarvdimitrov left a comment •

edited

Loading

flxbk commented Feb 12, 2024

dimitarvdimitrov left a comment

dimitarvdimitrov left a comment

Add optional querier response streaming #7173

Add optional querier response streaming #7173

Conversation

flxbk commented Jan 19, 2024 • edited Loading

What this PR does

Which issue(s) this PR fixes or relates to

Checklist

dimitarvdimitrov commented Jan 22, 2024

flxbk commented Jan 22, 2024

dimitarvdimitrov left a comment • edited Loading

Choose a reason for hiding this comment

flxbk commented Feb 12, 2024

dimitarvdimitrov left a comment

Choose a reason for hiding this comment

dimitarvdimitrov left a comment

Choose a reason for hiding this comment

flxbk commented Jan 19, 2024 •

edited

Loading

dimitarvdimitrov left a comment •

edited

Loading