Implement ContainerPilot telemetry #16

misterbisson · 2016-04-18T04:02:48Z

ContainerPilot 2.0 introduced a telemetry feature that would be very useful for monitoring this application.

TritonDataCenter/containerpilot#27 proposed the following gauge:

The count of MySQL Query entries from SHOW PROCESSLIST that are in any Waiting state. 0 is great. 1 or above can be trouble. 10 or more is probably critical.

There are other MySQL-specific stats that would be very useful in scaling decisions. How would we write those sensors?

The text was updated successfully, but these errors were encountered:

tgross · 2016-04-18T14:11:12Z

Looks like we can get replication lag for the replicas via pt-heartbeat

misterbisson · 2016-09-12T03:10:26Z

@Smithx10 asked how to autoscale MySQL in #54. With telemetry implemented per this ticket (though the sensors still need to be defined), scaling will require two more pieces:

configured thresholds at which to scale up or down
a scheduler/supervisor that can apply those scaling rules

It's incredibly minimalistic, but I've been experimenting for the past few months with running docker-compose scale <service>=<count> via a recurring task (Jenkins or cron both work fine). I have to name all the services and their counts in that line, but that's pretty much all there is to supervision. If an instance of a service fails, that will bring it back up to healthy. If you log the activity and set alarms on the logging....

What I haven't done yet is to make the <count> dynamic based on telemetry data and scaling thresholds, but that would seem to be the next step. Of course, I plan to set some min and max values, but....

Smithx10 · 2016-09-12T17:10:30Z

After watching a few promcon presentations, would it make sense to use prometheus exporters and use a separate http call?

tgross · 2016-09-23T13:06:23Z

@neuroserve wrote in #58:

To enhance the setup, it might be a good idea to add Percona monitoring and management:
https://www.percona.com/doc/percona-monitoring-and-management/index.html

It consists basically of two Docker containers and the pmm-client package, that needs to be installed and activated on the mysql servers. The pmm-server IP/name could be transferred via its cns name (similar to the consul name).

It delivers query analysis and a grafana based metrics monitor. The backend is prometheus.

tgross · 2016-09-23T13:08:13Z

@Smithx10 and @neuroserve we've provided the Prometheus endpoint in ContainerPilot so that we can use the same interface to capture metrics from arbitrary applications. What the end user does with those metrics afterwards (put graphana in front of Prometheus or pipe them out via an exporter to a different storage engine) is left intentionally agnostic.

misterbisson · 2017-06-05T07:00:35Z

With ContainerPilot 3's first-class support for multi-process containers, it probably makes more sense to implement the "official" MySQL exporter for Prometeheus.

Related: a fancy dashboard for Grafana for that data.

tgross added the enhancement label Apr 20, 2016

misterbisson added the help wanted label Apr 21, 2016

tgross mentioned this issue Sep 6, 2016

How to AutoScale #54

Closed

tgross mentioned this issue Sep 23, 2016

Add Percona Monitoring and Management to the cluster #58

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement ContainerPilot telemetry #16

Implement ContainerPilot telemetry #16

misterbisson commented Apr 18, 2016

tgross commented Apr 18, 2016

misterbisson commented Sep 12, 2016 •

edited

Loading

Smithx10 commented Sep 12, 2016

tgross commented Sep 23, 2016 •

edited

Loading

tgross commented Sep 23, 2016

misterbisson commented Jun 5, 2017 •

edited

Loading

Implement ContainerPilot telemetry #16

Implement ContainerPilot telemetry #16

Comments

misterbisson commented Apr 18, 2016

tgross commented Apr 18, 2016

misterbisson commented Sep 12, 2016 • edited Loading

Smithx10 commented Sep 12, 2016

tgross commented Sep 23, 2016 • edited Loading

tgross commented Sep 23, 2016

misterbisson commented Jun 5, 2017 • edited Loading

misterbisson commented Sep 12, 2016 •

edited

Loading

tgross commented Sep 23, 2016 •

edited

Loading

misterbisson commented Jun 5, 2017 •

edited

Loading