This repository has been archived by the owner on Jun 9, 2024. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 44
Timeout blocking sync #97
Comments
OK - looks like sigkill kills the rsync parent. Here's the processes after restarting:
Then after 20 mins:
Watching strace for those 3 procs I see 10478 get killed:
If I kill 10584 with SIGTERM, things get moving again (then it tries rsyncing cc.rg.net again and fails 👎) This is reproducible on a test machine, so am happy to do debugging and such. For prod I think I'll need a script to kill off these rg.net rsyncs.. |
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
Hi,
We're running octorpki on a couple of servers, and have found they are both getting hung up on a particular repo with network timeouts, and it stops processing of future requests.
We've got.. some sort of network issue reaching this repo, as it works from other places. For some reason our requests just time out every now and then. Regardless I would want octorpki to gracefully handle that sort of thing - there seems to be a lot of half baked rpki repos so I guess we want to be pretty robust dealing with those.
Below are some logs and relevant processes running. Notable is the rsync zombie process, I wonder if there's something odd going on here where rsync is forking unexpectedly so rsync timeout doesn't work like we'd hope, or something.
If I kill the rsync process (not the zombie) it moves on.
We're on centos 7.9, and I've included the rsync and octorpki versions below.
It looks like we should be upgrading to 1.3.0 anyway, so I'll get to doing that, but I don't see any changes which would be related to this issue.
The text was updated successfully, but these errors were encountered: