Auto-resolve schema by confluent magic bytes #302
Unanswered
Yerachmiel-Feltzman
asked this question in
Q&A
Replies: 1 comment
-
Hello,
Yes, but how do you represent it in Spark? Since each row now has different data type they cannot be in one DataFrame. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
HI,
Is there a way to automatically resolve the schema by the leading magic byte for each message, which contains the schema id for that message?
As we all know, confluent AVRO prepends the schema id to the message. So, each message has its own schema id embedded to it.
ABRIS adds this id (the magic byte) when encoding the schema to support the Confluent format.
However, when decoding Confluent encoded messages, we must manually pass the schema configuration beforehand, making it hard to support cases where we can receive messages with different schemas (think of record strategy or even schema evolution).
I could implement a solution parsing mannualy each message magic byte and dynamically constructing the schema configurations for each message (or group of messages to save and avoid creating million of config objects).
Is there such a thing out of the box that I miss?
Thank you very much.
Beta Was this translation helpful? Give feedback.
All reactions