-
Notifications
You must be signed in to change notification settings - Fork 589
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bug: stuck in recovery simulation test #15296
Comments
Note that re-run doesn't help and the recovery simulation test is always stuck when running with |
Previously fixed issue: #14104. The fix should be already in 1.7. But this bug could be due to similar reasons. |
The log indeed stops, indicating that the cluster is stuck or very slow. I cannot reproduce it locally though. |
We can use this branch for repro: https://github.com/risingwavelabs/risingwave/commits/patrick/recovery-stuck-repro/ |
It's a deadlock somewhere in
I gave it 15 mins more to execute here: https://github.com/risingwavelabs/risingwave/pull/15512/files. Notice the
Something is blocking it from completion. |
Another issue that could be related: #14910 |
Nope I just logged it, since I suspect deadlock due to drop 🤣 Any tools to share, perhaps we should add some to our toolbox. |
Describe the bug
In the CI run of #15262 (cherry-pick #15260), the recovery simulation test is stuck when running batch e2e with
MADSIM_TEST_SEED=5
.Because #15260 can pass all CI tests and can be successfully while the release-1.7 cherry-pick PR #15262 is stuck in recovery simulation test, after a discussion with @wangrunji0408 , we believe that the PR triggers a bug that is not related to the PR changes themselves.
Error message/log
To Reproduce
No response
Expected behavior
No response
How did you deploy RisingWave?
No response
The version of RisingWave
git commit: 14531cd
Additional context
No response
The text was updated successfully, but these errors were encountered: