De Bruijn Genome Assembly

Creates random reads from a genome sequence and tries to put it back together.

Getting Started

git clone https://github.com/molnxx/genome_assembly.git

Prerequisites

You need numpy and a preferably circular DNA sequence.

Usage

Create reads

Create random reads with a coverage n and variable length of around l:

python de_bruijn_assembly.py create -i genome.fasta -n 5 -l 200 -o outfile

outfile determines the fasta file with all the reads.

Assemble genome

Assemble the genome sequence with de Bruijn graphs:

python de_bruijn_assembly.py assemble -i reads.fasta -k 43 -p orig_genome.fasta -o result_file

k is the length of the kmers. orig_genome.fasta is the genome sequence to proof the results (which are stored in result_file).

Examples

Included is the genome of pUC19, a well-known vector. Also the example from the paper referenced below.

python de_bruijn_assembly.py create -i pUC19.fasta -n 6 -l 150 -o pUC19_6_150.fasta
python de_bruijn_assembly.py assemble -i pUC19_6_150.fasta -k 43 -p pUC19.fasta -o results.txt

License

Public Domain

Acknowledgments

Phillip E C Compeau et al. for How to apply de Bruijn graphs to genome assembly doi
Atom Community (https://github.com/atom/atom)
NumPy devs (https://www.numpy.org/)

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
README.md		README.md
de_bruijn_assembly.py		de_bruijn_assembly.py
pUC19.fasta		pUC19.fasta
plasmid.txt		plasmid.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

De Bruijn Genome Assembly

Getting Started

Prerequisites

Usage

Create reads

Assemble genome

Examples

License

Acknowledgments

About

Releases

Packages

Languages

juvilius/genome_assembly

Folders and files

Latest commit

History

Repository files navigation

De Bruijn Genome Assembly

Getting Started

Prerequisites

Usage

Create reads

Assemble genome

Examples

License

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages