-
Notifications
You must be signed in to change notification settings - Fork 57
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Automated retries? #502
Comments
I've added requested changes to the associated PR ( #503 ), but I'll add some thoughts here for good measure. @dylanmtaylor is very much correct that we sometimes have spurious failures (various causes which likely include network issues), and those result in the team needing to manually retry some runs of the workflow. As an example, a spurious failure will usually result in success for most of the matrix options, but one or two will fail. I do agree that automatically retrying certain steps of the workflow will be helpful. I identified the two most useful in my requested changes:
I've specifically requested that we do NOT auto-retry the most complex step, Build Image. The most common causes of failure here are legitimate, usually due to an upstream RPM dependency issue. The one spurious issue I do know of in Build Image is related to the In addition to all this, I'd really like to see these improvements in Hope that provides some context to any reader regarding my views on this topic. |
A helpful issue was filed with PR which will help address some spurious issues with the github actions workflows. That inspired me to improve the github-release-install.sh script such that it will more properly fail(retry) when http errors occur. Relates: ublue-os/main#502
A helpful issue was filed with PR which will help address some spurious issues with the github actions workflows. That inspired me to improve the github-release-install.sh script such that it will more properly fail(retry) when http errors occur. In addition, this includes an improvement to the script which allows installing specific tags, not just the latest release. Relates: #502
The addresses spurious failures of pulling our (very large) base images by pre-pulling them to the build runner before using the buildah action. Relates: #502
These steps are known to potentially fail due to environmental/infrastructure reasons. Retries helps builds succeed despite that. Relates: ublue-os/main#502
Actually, i think we should close this as "done" since we merged the PR at the top and have continued to add appropriate retry logic in various places throughout the project. |
I see that build actions sometimes fail. I think we should leverage the retry action on ublue builds with an attempt limit of 3.
https://github.com/marketplace/actions/retry-action
That way if it's a weird network issue it something we won't have a day without a new image.
The text was updated successfully, but these errors were encountered: