Skip to content

IPDGC x GP2 Hackathon 2021 Project - Pathway and cell type enrichment pipeline

Notifications You must be signed in to change notification settings

ipdgc/GP2-pathway-enrichment-pipeline

Repository files navigation


Logo

Pathway and cell type enrichment pipeline

One of the projects from the 2021 GP2/IPDGC Hackathon. The related manuscript can be found on [biorxiv](https://www.biorxiv.org/content/10.1101/2022.05.04.490670v1)
Contributers: Shameemah Abrahams, Inas Elsayed, Vassilene Iankova, Amica Muller, Zunej, Manuela Tan

Table of Contents
  1. About The Project
  2. Getting Started
  3. Usage
  4. Acknowledgments

About The Project

Project Screen Shot

Quick Description

The goal for this project was to build a pipeline on Terra that uses public PD summary statistics to investigate biological pathway enrichment as well as cell type and tissue enrichment in GWAS loci.

Background/motivation

Investigating GWAS loci enrichment in pathways, cell types, and tissues is an important part of linking genetic risk to biological mechanisms. Peparing GWAS summary statistics and performing enrichment analyses can be time-consuming. This workflow serves as an example for how to use and format summary statistics for popular software FUMA and WebGestaltR.

Workflow Summary

  1. Setting up and importing summary statistics
  2. Formatting summary statistics for FUMA
  3. Using WebGestaltR

(back to top)

Getting Started

Prerequisites

Installation

  1. Clone the repo
    git clone https://github.com/ipdgc/GP2-pathway-enrichment-pipeline.git

(back to top)

Usage

These notebooks can be directly uploaded to your workspace on Terra. They are designed to be used on Terra with AMP-PD data, but by changing the file paths you can use these notebooks anywhere other than Terra and for any data.

1. First import PD GWAS summary statistics into Terra workspace

Using import_GWAS_sumstats_Manuela.ipynb. Summary statistics (excluding 23andMe data) were downloaded from https://pdgenetics.org/resources and are publicly available.

2. Format for FUMA

Using format_for_FUMA_Manuela.ipynb. This saves the GWAS summary statistics in FUMA format, then you can just upload this straight to FUMA. So far we can't find a way to link/embed FUMA within Terra.

3. Run WebGestaltR in Terra

Using run_WebGestaltR_Manuela.ipynb. For this we used the PD GWAS significant loci (p < 5 x 10-8) annotated with the nearest genes, available from https://www.nature.com/articles/ng.3043 (Supplementary Table 2).

For more examples, please refer to FUMA documentation and WebGestaltR documentation

(back to top)

Acknowledgments

  • We used some of AMP-PD's really helpful setup code

(back to top)

About

IPDGC x GP2 Hackathon 2021 Project - Pathway and cell type enrichment pipeline

Resources

Stars

Watchers

Forks

Packages

No packages published