memory leak on status update #1245

trws · 2024-07-15T15:39:17Z

The json_t * variables R_all, R_down and R_alloc in status_request_cb are only freed by json_decref on the error path. The caches in the ctx (m_r_{all,down,alloc}) are never freed at all. Found this thanks to a 10k leak heaptrack found on one of our drained resource tests, tiny in the test, but could get larger over time if there are a lot of requests to this RPC, or a lot of resource updates, or especially both.

Plan:

add a lifecycle management type for json_t so this can't happen by accident anymore, either a smart pointer wrapper with json_decref as the free function or an actual functional wrapper like the janssoncpp headers provide
add the same for flux-core with __attribute__((cleanup)) so we can be safer about all of this

This is a priority bugfix, if I can get it together today I will, and we should try to get it deployed ASAP.

The text was updated successfully, but these errors were encountered:

milroy · 2024-07-15T20:42:15Z

I don't think the sched-fluxion-resource.status RPC is used anymore, since I believe all the functionality is handled by core now.

The caches in the ctx (m_r_{all,down,alloc}) are never freed at all.

The member data is updated in status_request_cb whenever the values change or a configured amount of time has elapsed, but yeah, they aren't freed when Fluxion is stopped.

trws · 2024-07-15T22:41:08Z

Mark mentioned the same thing, and I agree it shouldn't be (at least not often) but it shows up in my recent heaptrack leak report. Will check again, try to get more detail here.

garlick · 2024-07-15T22:50:26Z

It's not used by default anymore, but we can check the scheduler's view of resources by running e.g.

FLUX_RESOURCE_LIST_RPC=sched.resource-status flux resource list

which can be handy. I'd say don't get rid of it :-)

trws · 2024-07-15T22:53:18Z

Yup, not planning to get rid of it, do think we need to fix the leak though, and rather wondering what's poking it.

trws self-assigned this Jul 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

memory leak on status update #1245

memory leak on status update #1245

trws commented Jul 15, 2024

milroy commented Jul 15, 2024

trws commented Jul 15, 2024

garlick commented Jul 15, 2024

trws commented Jul 15, 2024

memory leak on status update #1245

memory leak on status update #1245

Comments

trws commented Jul 15, 2024

milroy commented Jul 15, 2024

trws commented Jul 15, 2024

garlick commented Jul 15, 2024

trws commented Jul 15, 2024