Per-route historical metrics #1737

grampelberg · 2018-10-03T18:44:18Z

Linkerd has metrics at the service level. These are valuable for understanding the general health of a service. Unfortunately, it is usually a specific route that is having problems. While it is possible to run top and get some metrics today, these are gathered in real time and not stored in prometheus.

It should be possible to see per-route metrics alongside the existing per-service metrics (success rate, latency, throughput). These can improve the time to fixing issues and provide visibility into what is really happening.

User Stories

As a service owner, I would like to see per-route metrics in my dashboards so that I can quickly see any endpoints that are operating outside my SLO.
As a service owner, I would like to see a list of all the routes in my service and sort that list by success rate, so that I can quickly see what is currently failing.
As a service owner, I would like to have per-route metrics aggregated by URL parameters such as user id, so that I can quickly see what code path is being taken.
As a service owner, I would like to persist per-route metrics so that I can use them to debug historical issues.

UX

CLI

GUI

Dashboards

Open Questions

The text was updated successfully, but these errors were encountered:

klingerf · 2018-10-03T19:14:17Z

Related to #187.

grampelberg · 2018-10-03T19:31:00Z

Also related #1418

ctaggart · 2018-11-27T23:40:35Z

Any update on per-route metrics? Is it stubbed out in the UI and not implemented on the backend as of linkerd edge 18.11.2 as it appears here?

grampelberg · 2018-11-28T16:21:31Z

@ctaggart it is on master (so it'll be in the next edge release). I believe most of the stuff you need is in 18.11.2 but the CLI pieces to actually see what is going on didn't land until earlier this week.

Fixes linkerd#1737 Signed-off-by: alex lundberg <[email protected]>

* Add route dashboard to grafana instance Fixes #1737 Signed-off-by: alex lundberg <[email protected]>

grampelberg added area/proxy area/controller area/web area/cli area/telemetry stage/proposal labels Oct 3, 2018

grampelberg added this to the Hotspur milestone Oct 3, 2018

grampelberg added the priority/P0 Release Blocker label Oct 3, 2018

grampelberg added the needs/more label Oct 3, 2018

klingerf mentioned this issue Oct 9, 2018

Adjust telemetry reporting of paths to handle higher cardinality #187

Closed

klingerf mentioned this issue Nov 27, 2018

No stats on grafana Linkerd Services dashboard for our namespace #1451

Closed

rmars mentioned this issue Nov 27, 2018

Add the top routes feature to the dashboard UI #1868

Merged

grampelberg removed this from the Hotspur milestone Nov 28, 2018

grampelberg closed this as completed Feb 6, 2019

lundbird pushed a commit to lundbird/linkerd2 that referenced this issue Mar 9, 2020

add a dashboard for routes to the linkerd grafana instance

b947acd

Fixes linkerd#1737 Signed-off-by: alex lundberg <[email protected]>

lundbird pushed a commit to lundbird/linkerd2 that referenced this issue Mar 9, 2020

add a dashboard for routes to the linkerd grafana instance

a0d1c74

Fixes linkerd#1737 Signed-off-by: alex lundberg <[email protected]>

lundbird pushed a commit to lundbird/linkerd2 that referenced this issue Mar 9, 2020

Add route dashboard to grafana instance

6b820c9

Fixes linkerd#1737 Signed-off-by: alex lundberg <[email protected]>

lundbird mentioned this issue Mar 9, 2020

Add route dashboard to grafana instance #4155

Merged

alpeb pushed a commit that referenced this issue Mar 27, 2020

Add route dashboard to grafana instance (#4155)

0d4d2dc

* Add route dashboard to grafana instance Fixes #1737 Signed-off-by: alex lundberg <[email protected]>

github-actions bot locked as resolved and limited conversation to collaborators Jul 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Per-route historical metrics #1737

Per-route historical metrics #1737

grampelberg commented Oct 3, 2018

klingerf commented Oct 3, 2018

grampelberg commented Oct 3, 2018

ctaggart commented Nov 27, 2018

grampelberg commented Nov 28, 2018

Per-route historical metrics #1737

Per-route historical metrics #1737

Comments

grampelberg commented Oct 3, 2018

User Stories

UX

CLI

GUI

Dashboards

Open Questions

klingerf commented Oct 3, 2018

grampelberg commented Oct 3, 2018

ctaggart commented Nov 27, 2018

grampelberg commented Nov 28, 2018