Replies: 1 comment 1 reply
-
Have you been tracking what data source teams use at all? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
After chatting with a few prospective modeling teams, I wanted to clarify what data we are anticipating teams will use to generate their nowcasts for this hub.
After some discussion, we have decided not to host the datasets needed in the hub repo itself because then we’re in the business of data curation, and people will start to rely on our pipelines in ways that we are not prepared to support. There already are some good pipelines for these data, and we don't want to create duplicative data stores.
That said, here are the options, current as of Monday, October 7, 2024:
We note that the NextStrain files above may use slightly different methods for filtering which sequences are aggregated and creating the counts, so it is possible that these data may not be exactly the same as the ones that our pipeline will eventually create for evaluation, but we expect differences (if any) to be minimal.
Beta Was this translation helpful? Give feedback.
All reactions