Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

1014 - Optimizing the Sync Command #1018

Merged
merged 8 commits into from
Dec 9, 2024

Conversation

avrohomgottlieb
Copy link
Contributor

@avrohomgottlieb avrohomgottlieb commented Dec 8, 2024

Issue Number

Closes #1014

Purpose/Implementation Notes

This PR optimizes our usage of the aws s3 sync command, which had previously been a bottleneck with some projects during computed file generation. With projects that have many files, requesting a large subset of those files via the --include option causes the execution to slow and appear to hang, while it iteratively searches the entire file space for each individual file mentioned within an --include statement. Our solution was to make searches more granular by chopping up the file space, resulting in significantly quicker searches.

Types of changes

What types of changes does your code introduce?

  • Bugfix (non-breaking change which fixes an issue)

Functional tests

N/A

Checklist

  • Lint and unit tests pass locally with my changes

Screenshots

N/A

Copy link
Contributor

@davidsmejia davidsmejia left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Member

@nozomione nozomione left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍

@avrohomgottlieb avrohomgottlieb merged commit 770fb4b into dev Dec 9, 2024
5 checks passed
@avrohomgottlieb avrohomgottlieb deleted the avrohom/1014-optimizing-sync-command branch December 9, 2024 17:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Optimizing the sync command to fix spatial
3 participants