This repository has been archived by the owner on Jun 30, 2023. It is now read-only.
Replies: 1 comment
-
I like the idea of adding the S3 date pulled or commit date of the input data versions. It'd be great to add this discussion to the agenda for one of our weekly meetings as we get a better sense of the data version info we'll need. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Code version control is tracked by git.
Data version control is tracked within S3
We need to document which data versions are used with which model versions.
I'm thinking of tracking the following in a GitHub-shared spreadsheet that we fill out for each model run:
Date Model Run, Model Run Name, Description, Git Commit Date, Git Commit ID,
target
Name for Model, Path to a copy of_targets
folder after model run (R-readable inputs and outputs)Can also include S3 date pulled and any other info needed to know which data versions were used (I'm not yet sure what's needed).
Is this a good idea? Any other variables we should track?
Beta Was this translation helpful? Give feedback.
All reactions