[sled-agent] WIP pargs/pstack of oxide processes #7117

papertigers · 2024-11-20T19:47:55Z

No description provided.

leftwo · 2024-11-21T14:38:25Z

I know this is still a WIP, and I appreciate all the work you are doing here, but I want to suggest we move this work to a separate stand alone service instead of making it part of sled-agent.

For details, see https://rfd.shared.oxide.computer/rfd/0495, but here is my summary:

All the checks and work here require that sled-agent is running and not what is broken. For support-bundles themselves, this makes sense and without a running nexus, none of that framework can operate. My concern is that we also need to support the situation where:

Future us, where we don't have ssh access to sleds, or it becomes more difficult to do so.
Sled agent itself is what is broken.

If we take the good work here, and instead of putting it inside sled-agent, we put it in a stand alone health check service, we get the following:

sled-agent becomes a client who makes requests and still gets all the benefits of the service.
Another client tool, omdb, or something similar, can be run from the switch zone and also gather debugging data and not require sled-agent itself to be running.
If we are in a pre-rack-setup situation, we could still have a health check service that could be used for triage but not require sled-agent to be online.

My concern is that there is a bunch of code that we may end up wanting to move to another place, and if we wait too long, it becomes more entrenched and could be difficult to dislodge.

If we decide that we should keep this in sled-agent, then we need to update RFD 495 with the determination that we are not going to build a stand alone service and why we made that choice.

papertigers · 2024-12-13T20:15:23Z

Superseded by #7194

Taking @leftwo advice above we decided to put this stuff in a standalone sled-diagnostics crate.

[sled-agent] WIP pargs/pstack of oxide processes

85fb330

papertigers closed this Dec 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[sled-agent] WIP pargs/pstack of oxide processes #7117

[sled-agent] WIP pargs/pstack of oxide processes #7117

papertigers commented Nov 20, 2024

leftwo commented Nov 21, 2024

papertigers commented Dec 13, 2024

[sled-agent] WIP pargs/pstack of oxide processes #7117

[sled-agent] WIP pargs/pstack of oxide processes #7117

Conversation

papertigers commented Nov 20, 2024

leftwo commented Nov 21, 2024

papertigers commented Dec 13, 2024