feat: improve datanode snapshot creation #11396

jeremyletang · 2024-06-19T14:35:18Z

As of now, the snapshot are created in a sequential and blocking way in the datanode. This means that while a snapshot is being taken, no block can be processed.

The following approach is made:

the database is locked with a transaction
queries are generated
one by one the query are:
executed
the result piped into the file system
finally the lock is released, and later the files are added to ipfs.

The bottle neck here is that the results are being save on the fs as they arrive, which is unecessary and amount for 95% of the time spent snapshoting (and so blocking anything else).

To prevent this, we keep those results from the database in buffers, and only save them to file via a worker go routine.

datanode/networkhistory/service.go

As of now, the snapshot are created in a sequential and blocking way in the datanode. This means that while a snapshot is being taken, no block can be processed. The following approach is made: - the database is locked with a transaction - queries are generated - one by one the query are: - executed - the result piped into the file system - finally the lock is released, and later the files are added to ipfs. The bottle neck here is that the results are being save on the fs as they arrive, which is unecessary and amount for 95% of the time spent snapshoting (and so blocking anything else). To prevent this, we keep those results from the database in buffers, and only save them to file via a worker go routine. Cache buffer size in datanode snapshot. Signed-off-by: Jeremy Letang <[email protected]>

Signed-off-by: Jeremy Letang <[email protected]>

jeremyletang added no-changelog no-issue labels Jun 19, 2024

EVODelavega reviewed Jun 19, 2024

View reviewed changes

datanode/networkhistory/service.go Outdated Show resolved Hide resolved

jeremyletang force-pushed the feature/improve-datanode-snapshot-creation-time branch from 08f9a31 to 7b3aa5f Compare June 19, 2024 16:42

jeremyletang self-assigned this Jun 24, 2024

jeremyletang force-pushed the feature/improve-datanode-snapshot-creation-time branch 7 times, most recently from 9098e67 to 59b88d6 Compare June 25, 2024 10:03

jeremyletang force-pushed the feature/improve-datanode-snapshot-creation-time branch 4 times, most recently from 3978470 to 4e2961c Compare July 2, 2024 22:28

jeremyletang added 3 commits July 8, 2024 18:22

chore: debug and ssuch

9593fa4

Signed-off-by: Jeremy Letang <[email protected]>

chore wip

882fadd

Signed-off-by: Jeremy Letang <[email protected]>

jeremyletang force-pushed the feature/improve-datanode-snapshot-creation-time branch from 128f1b9 to 882fadd Compare July 9, 2024 09:51

JonRay15 modified the milestones: 🏛️ Colosseo, 🏛️🏛️Colosseo II Jul 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: improve datanode snapshot creation #11396

feat: improve datanode snapshot creation #11396

jeremyletang commented Jun 19, 2024

feat: improve datanode snapshot creation #11396

Are you sure you want to change the base?

feat: improve datanode snapshot creation #11396

Conversation

jeremyletang commented Jun 19, 2024