-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Static #3
base: main
Are you sure you want to change the base?
Conversation
add first tests
Unit Tests Summary11 tests 11 ✅ 0s ⏱️ Results for commit 47c5f30. ♻️ This comment has been updated with latest results. |
refactor shared script for simulations
add internal function to fetch benchmark data
@danielinteractive @wlandau I am still adding a few more unit tests for the api, but the simulation scripts are ready to be looked at if you have time. I set up the target pipelines in a way that scenarios to generate various flavors of the data can be easily added by either us or other users. The data from {mmrm} is currently being stored in inst/benchmark |
refactor pkgdown.yml
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thanks @yonicd ! cool stuff
DESCRIPTION
Outdated
Version: 0.0.1 | ||
Date: 2023-11-09 | ||
Authors@R: person("Jonathan","Sidi" , , "[email protected]", role = c("aut", "cre")) | ||
Description: Open clinical data for openpharma packages. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Description: Open clinical data for openpharma packages. | |
Description: Open clinical data for use in R packages. |
make it more general?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How about lean more into the idea that the package is a vehicle to transport reproducible simulations of clinical trial data? That has more potential to gain traction i think
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What about non-simulated datasets from real trials? From our original discussion that led to this package, I think that would also be valuable for scope because it removes the burden from packages like mmrm
and brms.mmrm
from having to prove that the data are publicly available. Also, it is helpful to test on datasets which were not explicitly generated from any model we have. We would get a better sense of robustness and flexibility, especially under slight misspecification.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
From the work on the package I have come to understand those weren't "real" data sets. They were realizations of simulated script. But I am to keep the option for real data if we come across it. Those datasets are being stored in the package (inst/benchmark) to allow for mmrm/brms.mmrm to depend on it.
But I think in our last conversation in the group we talked about brms.mmrm can just use a seed on the simulation script from clindata simulated set and detach from the mmrm static dataset.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The main issue of storing real data iirc was the scalability of the size of the package. This led to the caching api.
@@ -1,188 +1,48 @@ | |||
# r.pkg.template |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
recommend to have a source README.Rmd
and then knit that that README.md
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Part of the documentation that I need to add for usage. I will convert to Rmd once there is something to render :)
expected <- "Hello, Tim" | ||
expect_identical(result, expected) | ||
}) | ||
testthat::describe("caching logic", { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
would recommend to load the testthat
package and then avoid the many prefixes with its name in the tests
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
old habits... adding ns to every fn as a reflex
@@ -0,0 +1,35 @@ | |||
testthat::describe("data api logic", { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
interesting why not use test_that()
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
it is just a convention i am used to. i like it more for creating hierarchies in reporting. It lets me get a better reporting structure when tests fail
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thanks, good to know! looks like a nice idea
Interesting approach to generating fixed datasets with It's neat that you cover MCAR and MAR in |
@wlandau thanks. My initial intention was to make an example pipeline for people to "fork" and then add/extend their own simulation sets/designs. I thought that embedding it inside the package would create another hurdle for non pkg dev to join. Maybe the data agnostic fns can be put in the pkg? The adv that i can see working through targets is that it enforces structure and is scalable and efficient (but you know that ;)) |
Yeah, I think if there are functions that generate datasets and do not require any fixed clinical data to start with, then I think those would make sense to document and export as usual. |
move _targets into data-raw add _targets.yaml to accomodate non root deployment
clean up license update targets pipelines to depend on package
Pull Request
initial version of clindata package.
todo: