You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The accounts hash interval was used to determine the health of a node, which in turn is used for solana-validator wait-for-restart-window, a highly sought-after command during updates.
A node is considered unhealthy when the difference between the max accounts hash interval from known validators and one we've sent in gossip is greater than health check slot distance, which is just 150 by default.
So, considering that the accounts hash interval is now essentially equal to the incremental snapshot interval, it may be worth setting the health check slot distance (if it's not specified) to be accounts hash interval + 50 (the magic 50 is here just because now default health check slot distance = 150 and default incremental snapshot archive slots = 100).
It might be worth simply removing the health check from the wait-for-restart-window.
Otherwise, the node will almost always be unhealthy, and the wait-for-restart-window will not complete. Of course, I understand that if an operator overrides the incremental snapshot interval, they can also override the health check slot distance. However, this will likely happen in almost 100% of cases, and it seems more convenient to define this as the default behaviour to reduce the number of misconfigurations.
The text was updated successfully, but these errors were encountered:
So, considering that the accounts hash interval is now essentially equal to the incremental snapshot interval, it may be worth setting the health check slot distance (if it's not specified) to be accounts hash interval + 50 (the magic 50 is here just because now default health check slot distance = 150 and default incremental snapshot archive slots = 100).
It might be worth simply removing the health check from the wait-for-restart-window.
Problem
Here's another concern I have about v1.17
The accounts hash interval was used to determine the health of a node, which in turn is used for
solana-validator wait-for-restart-window
, a highly sought-after command during updates.A node is considered unhealthy when the difference between the max accounts hash interval from known validators and one we've sent in gossip is greater than health check slot distance, which is just 150 by default.
So, considering that the accounts hash interval is now essentially equal to the incremental snapshot interval, it may be worth setting the health check slot distance (if it's not specified) to be accounts hash interval + 50 (the magic 50 is here just because now default health check slot distance = 150 and default incremental snapshot archive slots = 100).It might be worth simply removing the health check from the wait-for-restart-window.
Otherwise, the node will almost always be unhealthy, and the
wait-for-restart-window
will not complete. Of course, I understand that if an operator overrides the incremental snapshot interval, they can also override the health check slot distance. However, this will likely happen in almost 100% of cases, and it seems more convenient to define this as the default behaviour to reduce the number of misconfigurations.The text was updated successfully, but these errors were encountered: