feat: retry delay strategy #871

krpeacock · 2024-04-02T18:39:29Z

Description

Developers have noticed the agent is more frequently erroring with the new watermark protections against replay attacks / stale data. This feature adds a delay strategy for retries that will allow for more time for nodes to catch up, with exponential increases to the rate

Fixes SDK-1562

How Has This Been Tested?

new e2e tests

Checklist:

My changes follow the guidelines in CONTRIBUTING.md.
The title of this PR complies with Conventional Commits.
I have edited the CHANGELOG accordingly.
I have made corresponding changes to the documentation.

github-actions · 2024-04-02T18:43:31Z

size-limit report 📦

Path	Size
@dfinity/agent	85.62 KB (+0.93% 🔺)
@dfinity/candid	13.58 KB (0%)
@dfinity/principal	4.97 KB (0%)
@dfinity/auth-client	60.71 KB (+1.4% 🔺)
@dfinity/assets	80.24 KB (+0.88% 🔺)
@dfinity/identity	57.92 KB (+1.34% 🔺)
@dfinity/identity-secp256k1	265.65 KB (+0.2% 🔺)

packages/agent/src/agent/http/index.ts

packages/agent/src/polling/strategy.ts

ByronBecker

Exponential backoff with jitter should work fine for most cases, but assuming no jitter in the case of 3 retries that's a call at 0, 150ms, 600ms, and 1200ms (am I correct here?)

It would be nice to see if this covers every case, or if there would still be a super small chance that this error might get hit (even with the exponential backoff).

Timestamp failed to pass the watermark after retrying the configured 3 times. We cannot guarantee the integrity of the response since it could be a replay attack.

If this error got hit with the exponential backoff in place, what would that mean for the FE app? Is the subnet down or behind? Did we just get super unlucky? Does the same error message make sense now that the exponential backoff is in place (i.e. does this error have a different meaning)?

Ideally, this error would never happen by accident, or even if the FE client user has a poor internet connection. Then if they receive the error it would actually suggest something is not healthy with the nodes/subnet they've communicated with.

Putting a higher threshold in by using exponential backoff and hoping that this is a rare case is just going to confuse developers, and especially confuse end users in the even this error is hit and a toast component pops up with "Timestamp failed to pass the watermark after retrying the configured 3 times. We cannot guarantee the integrity of the response since it could be a replay attack" (in fact, this sort of sounds like security issue).

packages/agent/src/agent/http/index.ts

dfx-json

LGTM

I adopted Eric's suggestions

feat: retry delay strategy

2e5ba5a

krpeacock requested a review from a team as a code owner April 2, 2024 18:39

ericswanson-dfinity previously requested changes Apr 2, 2024

View reviewed changes

packages/agent/src/agent/http/index.ts Outdated Show resolved Hide resolved

packages/agent/src/polling/strategy.ts Outdated Show resolved Hide resolved

krpeacock added 2 commits April 2, 2024 14:43

renaming backoffStrategy

5f44d41

Merge branch 'main' into kai/SDK-1562-watermark-retry-delay

3e8e5ad

ByronBecker reviewed Apr 2, 2024

View reviewed changes

peterpeterparker mentioned this pull request Apr 3, 2024

build: agent-js v1.2.1 dfinity/ic-js#593

Merged

krpeacock added 8 commits April 5, 2024 16:06

wip

5d04768

unit tests passing

55c0134

e2e tests aren't finishing

643a650

wip

d06103d

Merge branch 'main' into kai/SDK-1562-watermark-retry-delay

58163b2

fixes to retry strategy (no delay on first try)

bf094c2

same null backoff strategy for queries as calls, clean up

62badfc

more cleanup

c149638

krpeacock requested review from ericswanson-dfinity and ByronBecker April 30, 2024 19:52

Merge branch 'main' into kai/SDK-1562-watermark-retry-delay

167667d

dfx-json reviewed Apr 30, 2024

View reviewed changes

packages/agent/src/agent/http/index.ts Outdated Show resolved Hide resolved

dfx-json approved these changes Apr 30, 2024

View reviewed changes

Update packages/agent/src/agent/http/index.ts

36a51ec

krpeacock merged commit a38f3d5 into main May 1, 2024
16 checks passed

krpeacock deleted the kai/SDK-1562-watermark-retry-delay branch May 1, 2024 21:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: retry delay strategy #871

feat: retry delay strategy #871

krpeacock commented Apr 2, 2024 •

edited

Loading

github-actions bot commented Apr 2, 2024 •

edited

Loading

ByronBecker left a comment •

edited

Loading

dfx-json left a comment

feat: retry delay strategy #871

feat: retry delay strategy #871

Conversation

krpeacock commented Apr 2, 2024 • edited Loading

Description

How Has This Been Tested?

Checklist:

github-actions bot commented Apr 2, 2024 • edited Loading

size-limit report 📦

ByronBecker left a comment • edited Loading

Choose a reason for hiding this comment

dfx-json left a comment

Choose a reason for hiding this comment

krpeacock commented Apr 2, 2024 •

edited

Loading

github-actions bot commented Apr 2, 2024 •

edited

Loading

ByronBecker left a comment •

edited

Loading