Skip to content
forked from npaulat/teava

A workflow for local SNV and indel enrichment analysis based on polymorphic transposable element insertions identified by MELT.

License

Notifications You must be signed in to change notification settings

davidaray/teava

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TEAVA - TE-Associated Variant Analysis

A pipeline for a local single nucleotide variant (SNV) and indel enrichment analysis near polymorphic transposable element (TE) sites.

This workflow is based on 1) TE annotation via RepeatMasker, 2) BWA-mapped Illumina paired-end reads, 3) polymorphic TE site identification via MELT, and 4) indel and SNV calls via FreeBayes. It uses a combination of Shell scripts, Python, and R.

General workflow

  1. TE annotation of reference genome with RepeatMasker and selection of desired TEs
  2. BWA read mapping and FreeBayes variant calling
  3. Polymorphic TE identification with MELT
  4. Polymorphic TE quality filtering and reformatting
  5. SNV/Indel quality filtering and enrichment analysis

The workflow for MELT runs using simulated TE insertion data is also available in the simulations folder.

Necessary programs

NOTE: The program versions listed are those used in the original pipeline, but newer versions should be compatible.

Please note that these scripts often have hard-coded directory and queue variables, and shell scripts are set up for a UGE cluster.

About

A workflow for local SNV and indel enrichment analysis based on polymorphic transposable element insertions identified by MELT.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 65.7%
  • Shell 27.6%
  • R 6.7%