How to handle hw counter multiplexing case? #2422

tbosch · 2019-12-10T18:06:51Z

Hello,
We are currently trying to run rr on machines that execute different kinds of jobs. We are seeing failures for rr because some of the other jobs are also using hw counters, and the kernel is multiplexing them, so rr does not get correct values (e.g. it often fails already during startup with "Got 0 branch events, expected at least 500").

I filed #2421 to improve detecting this case, also when this happens during the runtime of rr (and not just during startup).

In general, do you have any ideas how to work around this problem?

Thanks!
Tobi

The text was updated successfully, but these errors were encountered:

khuey · 2019-12-10T18:56:24Z

I'm not familiar with the way the kernel PMU multiplexes the perf counters. Is it "overcommitting" them by allowing perf_event_open to succeed even if there's no hardware perf counter available? That would be very bad IMO.

Are your other jobs usage of hardware perf counters not limited to specific processes?

khuey · 2019-12-10T19:07:39Z

Ok I read perf_event_open's man page again. In general there's no way for rr to survive a situation where the performance counters are multiplexed.

tbosch · 2019-12-10T20:00:25Z

Could rr use instructions-retired counter to measure/replay each timeslice instead of retired-conditional-branches? AFAIK, that would be a fixed-function counter, so it should always be available regardless of multiplexing.

khuey · 2019-12-10T21:52:22Z

Instructions-retired itself is not deterministic (because an instruction that triggers a page fault is counted as executed twice, and page faults are non deterministic) and the page fault performance counter is not available as a fixed function counter so I don't think that will help.

tbosch · 2019-12-12T00:39:27Z

Thanks for the answers!

rocallahan · 2019-12-14T03:31:40Z

We are seeing failures for rr because some of the other jobs are also using hw counters, and the kernel is multiplexing them,

So FWIW it's not clear to how other jobs could force multiplexing to happen for rr's recorded processes, unless those jobs install cpu-global counters (or specifically attach counters to rr's recorded processes, which would be crazy).

Turns out, however, that perf_event_open does let us disable multiplexing of our counters, by setting the pinned flag. So I added that in 3edf7ed.

My understanding is that with that change, our counters will take priority of cpu-global non-pinned counters, which might help in your situation. They should only be pushed off the PMU by cpu-global pinned counters. Hopefully your jobs aren't using those.

tbosch mentioned this issue Dec 10, 2019

Add docs about the perf event ids. #2420

Merged

tbosch closed this as completed Dec 12, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to handle hw counter multiplexing case? #2422

How to handle hw counter multiplexing case? #2422

tbosch commented Dec 10, 2019

khuey commented Dec 10, 2019

khuey commented Dec 10, 2019

tbosch commented Dec 10, 2019

khuey commented Dec 10, 2019 •

edited

Loading

tbosch commented Dec 12, 2019

rocallahan commented Dec 14, 2019

How to handle hw counter multiplexing case? #2422

How to handle hw counter multiplexing case? #2422

Comments

tbosch commented Dec 10, 2019

khuey commented Dec 10, 2019

khuey commented Dec 10, 2019

tbosch commented Dec 10, 2019

khuey commented Dec 10, 2019 • edited Loading

tbosch commented Dec 12, 2019

rocallahan commented Dec 14, 2019

khuey commented Dec 10, 2019 •

edited

Loading