Exposes JMX for brokers, and exemplify key cluster-level metric #93

solsson · 2017-11-09T16:59:22Z

Already included in #49, but I would like to keep metrics opt-in while that PR adds a quite heavy container to the pod.

The exposed port can be utilized by kafka-manager (#83) - just tick the JMX box when adding a cluster - to see bytes in/out rates.

Already included in #49, but here we don't add any export container to the pod. Can be utilized by kafka-manager (#83) - just tick the JMX box when adding a cluster - to see bytes in/out rates.

solsson · 2017-11-09T19:50:22Z

Given the countless options on how to consume Kafka metrics, I'd like to avoid making a specific implementation like #49 "core" by adding it to the kafka and zookeeper manifests. Instead I'd like this repo to encourage experimentation with different methods. Also, since v3.0.0 there's an ongoing transition from the old addons concept to a feature folder. The addition of more containers to core pods is something I haven't found how to separate in opt-in manifest files.

We do have to make the JMX_PORT env var default, but it's rather standard for Kafka.

The brokers-prometheus deployment in has IMO these advantages:

Easier to experiment with memory and cpu limits, becase pod stats are easily avalable.
We don't run cluster-level metrics against an unready broker pod.
Broker level metrics can have whitelist optimized for actual broker-level metrics.

Scrape times on minikube for this single metric is 5-15 for me. Not very good.

solsson · 2017-11-09T19:53:26Z

Feature that worked in #49 too, but was less of an advantage because it was the same configmap as kafka, you can simply apply 10-metrics-config.yml and the exporter will show: jmx_config_reload_success_total 1.0.

... though I've seen PartitionCount toggle between including the partitions in __consumer_offsets and not doing so.

solsson · 2017-11-10T07:47:49Z

This PR is a poor replacement for #49. If I kill one broker (after editing the init script so it won't go up again), my /metrics with the two test clients running alternate between:

kafka_server_ReplicaManager_Value{name="PartitionCount",} 51.0
kafka_server_ReplicaManager_Value{name="UnderReplicatedPartitions",} 0.0

and

kafka_server_ReplicaManager_Value{name="PartitionCount",} 2.0
kafka_server_ReplicaManager_Value{name="UnderReplicatedPartitions",} 1.0

Which means UnderReplicated is per broker, unlike with the test in #95.

Got the scrape times down to .2 seconds again, that's a consolation :)

I'll go ahead and explore more monitoring options. The addition of JMX_PORT,
e2ae2bf, is ok to merge I think.

solsson · 2018-02-02T09:00:46Z

#128 replaced this PR. With it you get for example kafka_server_replicamanager_value{name="UnderReplicatedPartitions"}.

solsson added 2 commits November 9, 2017 17:31

Exposes JMX for brokers, to typical monitoring setups

e2ae2bf

Already included in #49, but here we don't add any export container to the pod. Can be utilized by kafka-manager (#83) - just tick the JMX box when adding a cluster - to see bytes in/out rates.

Exports a key metric: Under-Replicated Partitions

f127d5c

solsson added 3 commits November 10, 2017 06:30

Whitelist only the required MBean, for performance

c037942

Lets Prometheus do means and rates - it's good at that

dd4788b

Returns to the key metrics

26fe6d4

... though I've seen PartitionCount toggle between including the partitions in __consumer_offsets and not doing so.

solsson added the monitoring label Nov 10, 2017

solsson mentioned this pull request Dec 13, 2017

Wanted: topic management, declarative #101

Open

This was referenced Jan 15, 2018

Test failures for console producer/consumer with min.insync.replicas=2 #116

Closed

Add Kafka Prometheus metrics export #128

Merged

solsson closed this Feb 2, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Exposes JMX for brokers, and exemplify key cluster-level metric #93

Exposes JMX for brokers, and exemplify key cluster-level metric #93

solsson commented Nov 9, 2017

solsson commented Nov 9, 2017

solsson commented Nov 9, 2017

solsson commented Nov 10, 2017

solsson commented Feb 2, 2018

Exposes JMX for brokers, and exemplify key cluster-level metric #93

Exposes JMX for brokers, and exemplify key cluster-level metric #93

Conversation

solsson commented Nov 9, 2017

solsson commented Nov 9, 2017

solsson commented Nov 9, 2017

solsson commented Nov 10, 2017

solsson commented Feb 2, 2018