-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Set BUILDKITE_CANCEL_SIGNAL to SIGQUIT for build and test #366
base: main
Are you sure you want to change the base?
Conversation
5131f92
to
8f01134
Compare
Observe here that I intentionally canceled this build, so that it would print SIGQUIT (which it successfully did). Though looks like no upload task then was able to get the core files (the timeout for uploading the core files after cancellation occurs is set to 10 seconds by default, unless changed in the worker configuration) |
f0e4bc8
to
94cdb9d
Compare
This comment was marked as outdated.
This comment was marked as outdated.
e6713e5
to
67d5efc
Compare
Looks to me like it's still not working. |
Does it need an updated copy of the sandbox binary with JuliaContainerization/Sandbox.jl@269e91f? |
This seems to indicate the running version is quite old? |
e671766
to
3f8451c
Compare
Indeed, there were more layers of sandboxing than I remembered :P |
well that is entertaining on musl:
|
Aha, my extra debugging information coming in handy! Here's a fix for that minor issue: JuliaCI/rootfs-images#252 |
I tried canceling one of the test jobs, and it looks to me like the new sandbox executable segfaulted: https://buildkite.com/julialang/julia-buildkite/builds/1577#01904c26-78c2-4120-bdcc-d01a439b5bfa/1373-1378 |
It seems more likely to me that the child segfaulted and then the sandbox killed itself with the same exit code. Seems like the SIGQUIT got propagated though, which looked pretty good. Not sure why it then segfaulted and did not coredump, but maybe that is a quirk of rr? Or is the ulimit -c potentially disabled? |
julia-buildkite/utilities/test_julia.sh Line 141 in dea1b33
|
But this is https://github.com/JuliaCI/julia-buildkite/blob/dea1b33fba992d460ffae69bce54f598330fb0d3/utilities/build_julia.sh, so is that yes? |
The segfault I linked was a test job. |
Right, but it was an rr job too |
Right, so for |
Would it hurt to just upload core dumps unconditionally (for both |
|
It's just wasted compute time and storage space. There's really no information in a coredump that an |
@staticfloat can you change DROPME: Use sandbox#main to use |
53b4920
to
ee38857
Compare
Looks like sandbox needs that branch pushed to correspond to the release tags? And a rebase here |
b2a61b6
to
8423537
Compare
8423537
to
8e93d18
Compare
8e93d18
to
e78acba
Compare
Okay, looks very close, but didn't get enough time to upload the coredump reports for the exact failing test we hope this PR will help with. But it did generate them when canceling (though didn't generate them when killing due to timeout). This will likely require JuliaCI/coreupload-buildkite-plugin#14 and fixing the JuliaCI/coreupload branch name in this PR |
4c09738
to
a5c6431
Compare
a5c6431
to
2948146
Compare
Needs resigning, and merge? |
This should get us `zstd`, for core uploading
2948146
to
b1e8881
Compare
No description provided.