Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make download more robust to ensure we get max dataset coverage #12

Open
maryamariyan opened this issue Nov 4, 2020 · 0 comments
Open
Milestone

Comments

@maryamariyan
Copy link
Member

maryamariyan commented Nov 4, 2020

Previously we used to sacrifice download time in order to get as many issue/PR data as possible.

Later we refactored the download code, so that the queries become faster while sacrificing a cut on the number of issues/PRs that get downloaded. e.g. when doing bulk queries, if the query contains a faulted PR or issue to download then the whole query fails.
For example: dotnet/runtime#203 is one example of a faulty PR.

To accommodate this, let's provide an option: to have a choice to either get a faster bulk download or a larger coverage of all possible issues.

Currently the default is set to do a fast download.
To change that would need to change this flag:

@maryamariyan maryamariyan added this to the Future milestone Apr 13, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant