-
Notifications
You must be signed in to change notification settings - Fork 65
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
"GC error (probable corruption)" with Enzyme on Julia 1.11 #2070
Comments
Apparently it's non deterministic: I re-ran CI with the same commit and it didn't crash |
can you post a mwe? |
As stated:
Feel free to close the issue, but at least it's there if someone else encounters it |
Sure but even if you can’t reproduce it, the input code which triggered it is useful (perhaps reducing to the one in question on CI). |
The code that triggered the error in CI is here: https://github.com/JuliaDiff/DifferentiationInterface.jl/blob/gd/en11/DifferentiationInterface/test/Back/Enzyme/test.jl But I don't know how to narrow it down further. I tried running that whole file on my computer and it worked fine. It also worked on the second CI run (there are test failures but no crashes). Besides, the stack trace doesn't help: it says
but that line (31) doesn't contain anything error-worthy at all, it's not even autodiff testing: I'm very puzzled by all this |
yeah I don't trust debug information during GC errors. And fwiw the last several GC issues were found in julia proper so its very possible it's a bug in julia itself. In any case, I'd recommend taking your code in CI and removing code until it certainly succeeds. GC errors are often non deterministic so you may need to run a couple of times to see them. See if you can make a standalone version which fails [probably starting by either removing code on CI until it doesn't fail 10 times in a row], or perhaps seeing if running local tests with Pkg.test() triggers it. |
any luck here? |
Sorry, I didn't spend any time chasing this because there is no telling how many CI cycles I would need to run to find a nondeterministic bug that disappeared the second time. It's not on top of my priority list but if the bug shows up again organically I'll let you know. |
At the moment I'm not running the Enzyme tests on 1.11 at all because of #2071, so once that is fixed there will be more iterations of CI with that version |
okay going to close for now then since there's no code that can trigger this atm. Feel free to reopen when you have an erring code |
Congrats on the huge work for making Enzyme compatible with Julia 1.11!
DI's test suite manages to run much further, but it now hits a weird GC error and the process is aborted. This does not happen on Julia 1.10, using the same Enzyme version (v0.13.14).
Unfortunately I wasn't able to reproduce it locally. The tests do run on my computer, they fail but at least they don't crash my REPL.
CI log: https://github.com/JuliaDiff/DifferentiationInterface.jl/actions/runs/11720210493/job/32645063624 from the PR JuliaDiff/DifferentiationInterface.jl#615.
Stack trace:
Any idea what could be the cause?
Related:
The text was updated successfully, but these errors were encountered: