Add support for client and server message buffering #28

jvican · 2021-06-04T06:04:53Z

(work-in-progress)

One of the current problems with our implementation is that request is
only called when the downstream consumer finishes the processing of a
message received from the server.

That is, if the server sends 10 messages on the same pipe and then the
client consumer is busy processing the first message, the other nine
messages will not be populated in the observable and might not even
reach the client grpc netty buffer. Only when the client finishes
processing the first message, it then calls request, processes another
message, and the same pattern continues where every message is pulled
right before it's going to be processed and not before that. This means
there is obviously a performance penalty and that such behavior can
change the way the consumer processes messages that can be sent or
processed in parallel, resulting in unexpected user semantics.

This commit uses asyncBoundary to fix this problem and have monix-grpc
populate an internal buffer with as many messages as they fit, while
still allowing the consumers to process these messages individually
without waiting to completely fill the buffer. If the server sends more
messages than they fit in the buffer, then asyncBoundary stops
consuming messages from the grpc source and consequently stop running
request calls, so the grpc runtime doesn't call onMessage anymore
until the messages in the buffer are processed downstream.

This behavior is great because the messages processed by onMessage are
added to a hot observable and asyncBoundary gives us all the benefits
of a backpressured source without actually using that source. As a
result, this means our hot observable will never grow larger than our
buffer because onMessage is only called when request is called.

So this simple solution actually gives us the semantics we want. The
following PR is built on top of this simple mechanism.

This is a work-in-progress PR, here's the things I'm currently
working on adding:

Replicate the same implementation on the server call options
Add a battery test similar to the one found in reactive-grpc
Add a low tide argument to the client and server options
Consider calling request with N messages instead of 1 if needed

I'm actively working on this PR so I'll add these changes shortly.

It has been renamed to `ClientCallOptionsApi` so it's no longer needed.

One of the current problems with our implementation is that `request` is only called when the downstream consumer finishes the processing of a message received from the server. That is, if the server sends 10 messages on the same pipe and then the client consumer is busy processing the first message, the other nine messages will not be populated in the observable and might not even reach the client grpc netty buffer. Only when the client finishes processing the first message, it then calls `request`, processes another message, and the same pattern continues where every message is pulled right before it's going to be processed and not before. This means that there is obviously a performance penalty and that such behavior can change the way the consumer processes messages that can be sent or processed in parallel, resulting in unexpected user semantics. This commit uses `asyncBoundary` to fix this problem and have monix-grpc populate an internal buffer with as many messages as they fit, while still allowing the consumers to process these messages individually without waiting to completely fill the buffer. If the server sends more messages than they fit in the buffer, then `asyncBoundary` stops consuming messages from the grpc source and consequently stop running `request` calls, so the grpc runtime doesn't call `onMessage` anymore until the messages in the buffer are processed downstream. This behavior is great because the messages processed by `onMessage` are added to a hot observable and `asyncBoundary` gives us all the benefits of a backpressured source without actually using that source. As a result, this means our hot observable will never grow larger than our buffer because `onMessage` is only called when `request` is called. So this simple solution actually gives us the semantics we want. However, there are still some things to improve, namely: - Users might want to specify a "low tide" to this buffer. That is, a number below which we will always rehydrate the buffer until completing it but above which we will not rehydrate it. This argument gives the client the flexibility to optimize for memory if it doesn't want the buffer filled up to its limit to save space. - We might want to `request` several messages at once instead of one at a time when repopulating the buffer. It's not clear if this gives us any benefit actually as grpc's internal engine is likely already buffering things in Netty byte buffers and calling `request` for each message might not poise any performance penalty.

A similar adjustment to the one made for the client call options API. I've taken the libery to redesign how server call options are passed in regardless of binary compatibility as we don't promise backwards compatibility until we make a stable release >= 1.0. I have added some more documentation to the server call to make it clear that we are trying to replicate the API in the underlying grpc java library.

borissmidt · 2021-06-14T08:18:58Z

grpc-runtime/src/main/scala/monix/grpc/runtime/server/ServerCall.scala

+ * @param call is the instance of the wrapped grpc server call.
+ * @param options is a field for custom user-defined server call options.
+ */
+final class ServerCall[Request, Response] private (


One thing that is not clear to me yet is how you set this ServerCallOptions.
Is meant to be set while registering the Api to the GrpcServer?

borissmidt · 2021-06-14T08:25:51Z

grpc-runtime/src/main/scala/monix/grpc/runtime/server/ServerCallOptions.scala

+ * expected behavior of a configuration, check the grpc-java defaults.
+ */
+final class ServerCallOptions private (
+    val compressor: Option[grpc.Compressor],


I think using the default values would make it easier to follow the code instead of using Option.
I.e. bufferSize can be None == 0 buffer or no buffer but
No compression is
Compressor.Identity the default could be Compressor.Gzip
enabledMessageCompression should be either true or false, having true false and None is a bit weird.

jvican added 4 commits June 3, 2021 20:44

Use opts instead of co

fcd33b6

Add withBufferSize method in the client

359e5c9

Remove unnecessary CallOptionsMethods

5d89d68

It has been renamed to `ClientCallOptionsApi` so it's no longer needed.

jvican added the enhancement New feature or request label Jun 4, 2021

borissmidt reviewed Jun 14, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for client and server message buffering #28

Add support for client and server message buffering #28

jvican commented Jun 4, 2021 •

edited

Loading

borissmidt Jun 14, 2021

borissmidt Jun 14, 2021

Add support for client and server message buffering #28

Are you sure you want to change the base?

Add support for client and server message buffering #28

Conversation

jvican commented Jun 4, 2021 • edited Loading

borissmidt Jun 14, 2021

Choose a reason for hiding this comment

borissmidt Jun 14, 2021

Choose a reason for hiding this comment

jvican commented Jun 4, 2021 •

edited

Loading