Simulations and benchmarking for Debelius et al. Simulation notebooks are designed to run in a series: simulating data, calculating emperical power, and then calculating power based on a fit effect size. The analysis notebook demonstrates the application of the method.
The recommended way to install this repository is using the conda pacakge manager through miniconda.
Dependencies can be installed using the conda_environment.txt and pip_requirements.txt file.
$ conda create --name power --file conda_enviroment.txt
$ source activate power
$ pip install -r requirements.txt
$ pip install git+https://github.com/jwdebelius/monte_carlo_power --no-deps
The method is validated and benchmarked using simulated data. The method has been analyzed using four common types of parametric data, and two permutative tests for distance matrices.
All simulated data can be downloaded from [FTP] address. This includes all simulated data, emperically calculated power, and emperical summary files.
The simulated data should be place in the ipynb
directory. From the parent repository directory, it can be downloaded as
wget [ftp]
tar -czf simulations.tgz
1-Build Simulations: builds simulated data parametric tests and distance matrices for permuatitive tests
2-Power for Parametric Distributions: calculates emperical power and distribution-based power for parametric distributions
3-Power for distance permutations: calculates power for emperical for permutative tests
4-Comparisons of Power Calculations: fits the emperical power curves, and compares the performance of the distribution-based method, emperical method, and fit method
power.py
and test_power.py
are modified from scikit-bio 0.5.0. The code is relased under a BSD-2.0 license; copyright (c) 2013 scikit-bio development team.
These were written and modified by Justine Debelius (@jwdebelius), Greg Caporaso (@gregcaporaso), Jai Ram Rideout (@jairideout), Evan Bolyen (@eboylen), and Vivek Rai (@vivekitkgp).