Name	Name	Last commit message	Last commit date
parent directory ..
scripts	scripts
README.md	README.md
environment.yml	environment.yml

Conda environment and data download

Before you start

If you want to run scripts/notebooks from PhenoPLIER, you need to have a conda environment. For that, you basically have two options: 1) create a local conda environment in your computer (explained here), or 2) use our Docker image (where you don't need to create a conda environment). We strongly recommend using our Docker image (see main README.md), which will greatly simplify running the code and make sure you use the same environment for the analyses (for example, if you are willing to reproduce results in our manuscript).

Environment to reproduce manuscript analyses

Keep in mind that the conda environment specification for PhenoPLIER changes over time as new analyzes are performed. Therefore, if you want to reproduce the analyses of the PhenoPLIER manuscript, you need to check out the latest v1.x.x version. If you are using Docker, then use the tag latest when referencing the image or do not specify it (latest is assumed by default, such as docker pull miltondp/phenoplier).

Steps to prepare environment

Below we explain how to create a local conda environment and download the necessary data.

Install Miniconda or Anaconda.
Open a terminal, run cd environment from the phenoplier folder repo.

(optional) Adjust your environment variables:

# (optional, will default to subfolder 'phenoplier' under the system's temporary directory)
# Root directory where all data will be downloaded to
export PHENOPLIER_ROOT_DIR=/tmp/phenoplier

# (optional, will default to 1 core)
# Adjust the number of cores available for general tasks
export PHENOPLIER_N_JOBS=2

# (optional)
# Export this variable if you downloaded the manuscript sources and want to
# generate the figures for it
export PHENOPLIER_MANUSCRIPT_DIR=/tmp/manuscript_dir

(optional) Adjust other settings (i.e. root directory, available computational resources, etc.) by modifying the file ../libs/settings.py
Adjust your PYTHONPATH variable to include the libs directory:
```
export PYTHONPATH=`readlink -f ../libs/`:$PYTHONPATH
```
readlink might not work on macOS. In that case, simply replace it with the absolute path to the ../libs/ folder.

Create a conda environment and install main packages:

conda config --set channel_priority strict
conda env create --name phenoplier --file environment.yml
conda run -n phenoplier --no-capture-output bash scripts/install_other_packages.sh

Download the data:

conda run -n phenoplier --no-capture-output python scripts/setup_data.py

This will download ~130 GB of data and software needed to run the analyses.

That's it! Now you should be able to continue and run the code. Check out the main README.md file for instructions on how to run the code.

Instructions for developers

You very likely do not need to follow these steps, unless you are a developer working on PhenoPLIER.

All steps are run from the root directory (not within environment/).

It is a good idea to try to build the environment locally first and, when all issues have been solved, then create the Docker image. A usual problem is to use a too recent Python version that produces several conflicts in conda. In that case, a previous Python version should be used instead.

Modify environment/scripts/environment_base.yml accordingly (if needed). Usually, this involves updating to the latest Python and R versions.

(if creating a local environment) Run:

conda config --set channel_priority strict
conda env create --name phenoplier --file environment/scripts/environment_base.yml
conda run -n phenoplier --no-capture-output bash environment/scripts/install_other_packages.sh

(if creating a new Docker image) Run:

# override environment.yml temporarily to install the latest packages
cp environment/scripts/environment_base.yml environment/environment.yml

Now open scripts/create_docker_image.sh and change settings according to instructions in the file. Then run:

# IMPORTANT: the script below will build two images: base and final.
#  The base image will only be rebuilt if the version in settings (see
#  the script) is changed. If for some reason you want to force building the
#  the image (for example, you fix something in the Dockerfile), you have to
#  pass the following argument: -f

bash scripts/create_docker_image.sh

Make sure the image works (it should produce no output):

# change version below accordingly
export VERSION="2.0.0"

docker run --rm miltondp/phenoplier:${VERSION} python -c "import conf; assert hasattr(conf, 'GENERAL')"

Export conda environment:

# if creating a local environment:
conda env export --name phenoplier --file environment/environment.yml

# if creating a new Docker image:
bash scripts/run_docker_dev.sh conda env export --name phenoplier --file environment/environment.yml

Modify environment/environment.yml and leave only manually installed packages (not their dependencies).
(if creating a new Docker image) Push the new Docker images. See at the end of scripts/create_docker_image.sh for examples.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

environment

environment

README.md

Conda environment and data download

Before you start

Environment to reproduce manuscript analyses

Steps to prepare environment

Instructions for developers

Files

environment

Directory actions

More options

Directory actions

More options

Latest commit

History

environment

Folders and files

parent directory

README.md

Conda environment and data download

Before you start

Environment to reproduce manuscript analyses

Steps to prepare environment

Instructions for developers