bayesian-inference

Jupyter Notebook and data for the publication Bayesian Inference for Integrating Yarrowia lipolytica Multi-omics Datasets with Metabolic Modeling

What is in this Repository?

This repository contains all the necessary Jupyter Notebooks and code to replicate the results obtained in our paper. This also serves as a jumping off point for expanding the work done here to your own models if necessary.

What you will need

A conda environment is provided in this repository for the sake of easier reproducability. There are not many major packages required to run the code, but the code requires Ensemble Modeling with Linear-Logarithmic Kinetics (emll) source which you can either install independently or download through the given environment.

Conda Environment

To install the working environment type the following commands

conda env create -f environment.yml

followed by

source activate BMI

You can find the specific packages required in the environment.yml file. This will get you up and running with the notebooks provided.

Jupyter Notebook

There is one primary Jupyter notebook designed to walk through the entire process of our Bayesian workflow. You will need to add the BMI environment to Jupyter in order to use it with either Jupyter Notebook/Lab

python -m ipykernel install --user --name=BMI

if this doesn't work, try reinstalling the ipykernel package like this:

conda install -c anaconda ipykernel

Once done, make sure to set the kernel to BMI before running any of the code.

Multi-Omics Data

In this repository you will find the results of the multi-omics analysis used to complete this project. The *Omics_Data folder is laid out in this fashion:

Omics_Data
├── 9_strain_data
│   ├── Fluxomics
│   ├── Metabolomics
│   ├── Proteomics
│   └── Transcriptomics
├── 23_strain_data
│   ├── Fluxomics
│   ├── Metabolomics
│   ├── Proteomics
│   └── Transcriptomics

Both sample sets 9 and 24 are used in this work, but not all files in these folders are used every run of the code pipeline. Therefore, some files are unneccesary to reproduce the results.

Genome Scale Model

In order to carry out our Bayesian analysis, we need a genome scale model of the host organism of choice and also a model for the central metabolism derived from the larger model. These pieces of information can be found under the General_Information folder.

General_Information
├── Pathways
└── YLita649

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.ipynb_checkpoints		.ipynb_checkpoints
General_Information		General_Information
Notebook		Notebook
Omics_Data		Omics_Data
.DS_Store		.DS_Store
Bayes_Flow.png		Bayes_Flow.png
LICENSE		LICENSE
README.md		README.md
disclaimer.txt		disclaimer.txt
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

bayesian-inference

What is in this Repository?

What you will need

Conda Environment

Jupyter Notebook

Multi-Omics Data

Genome Scale Model

About

Releases

Packages

Contributors 2

Languages

License

PNNL-CompBio/bayesian-inference

Folders and files

Latest commit

History

Repository files navigation

bayesian-inference

What is in this Repository?

What you will need

Conda Environment

Jupyter Notebook

Multi-Omics Data

Genome Scale Model

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages