Actions stuck in scheduled for 15 minutes #6165
Replies: 5 comments
-
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
Seems my theory of it being the schedulers/action runners fell in inactive after long periods of no jobs is incorrect. I created a job that runs every 30 minutes and twice in the last 24 hours this job that usually takes 1 second, took 15minutes |
Beta Was this translation helpful? Give feedback.
-
Upon trying a newer version of RMQ cluster the issue has disappeared. The new cluster that has run 4 days without issues is RMQ: 3.11.13 and Erlang: 25.3 |
Beta Was this translation helpful? Give feedback.
-
Good to know you found the root cause @skiedude and thank you for sharing the solution to resolve the issue 👍 It may help others in the future who encounter the same problem. |
Beta Was this translation helpful? Give feedback.
-
We are running Stackstorm on a Rocky8 VM.
The VM has 126gb of Memory, multiple cores. Server load never sits above .2
We are using a remote MongoDB cluster, and a remote RMQ cluster, and local Redis. Using the default of 10 action runners.
We have on a few occasions observed how actions are stuck in a
scheduled
state. (not always)Looking at the RMQ cluster, I can see the one item in queue
st2.actionrunner.work 1
Then exactly 15 minutes later it picks it up and runs it
Another instance of this from a few days ago shows another exact 15 minute break before running the action
Is there something that runs every 15 minutes to check for stuck jobs?
Beta Was this translation helpful? Give feedback.
All reactions