Jupyter Notebook and data for the publication Bayesian Inference for Integrating Yarrowia lipolytica Multi-omics Datasets with Metabolic Modeling
This repository contains all the necessary Jupyter Notebooks and code to replicate the results obtained in our paper. This also serves as a jumping off point for expanding the work done here to your own models if necessary.
A conda environment is provided in this repository for the sake of easier reproducability. There are not many major packages required to run the code, but the code requires Ensemble Modeling with Linear-Logarithmic Kinetics (emll) source which you can either install independently or download through the given environment.
To install the working environment type the following commands
conda env create -f environment.yml
followed by
source activate BMI
You can find the specific packages required in the environment.yml file. This will get you up and running with the notebooks provided.
There is one primary Jupyter notebook designed to walk through the entire process of our Bayesian workflow. You will need to add the BMI environment to Jupyter in order to use it with either Jupyter Notebook/Lab
python -m ipykernel install --user --name=BMI
if this doesn't work, try reinstalling the ipykernel package like this:
conda install -c anaconda ipykernel
Once done, make sure to set the kernel to BMI before running any of the code.
In this repository you will find the results of the multi-omics analysis used to complete this project. The *Omics_Data folder is laid out in this fashion:
Omics_Data
├── 9_strain_data
│ ├── Fluxomics
│ ├── Metabolomics
│ ├── Proteomics
│ └── Transcriptomics
├── 23_strain_data
│ ├── Fluxomics
│ ├── Metabolomics
│ ├── Proteomics
│ └── Transcriptomics
Both sample sets 9 and 24 are used in this work, but not all files in these folders are used every run of the code pipeline. Therefore, some files are unneccesary to reproduce the results.
In order to carry out our Bayesian analysis, we need a genome scale model of the host organism of choice and also a model for the central metabolism derived from the larger model. These pieces of information can be found under the General_Information folder.
General_Information
├── Pathways
└── YLita649