####0.14a

TBD

check if something need recompilation and try to recompile
instead of processing 100M reads for 4 libs with similar IS, split it by 25M each
bioconda
speed-up gap closing - so far the slowest step! (5/7 of the run!)

####0.13c

preprocessing
more accurate libs stats estimation (using 5% of the genome in the longest contigs)
reduction:
improved speed and accuracy
scaffolding:
using combination of snap+lastal - very fast and super sensitive!
no gap closing during scaffolding?

####0.13b

####0.13a

added long reads (EXPERIMENTAL) & reference-based scaffolding support (through pyScaf)
added all dependencies to github
INSTALL.sh downloads & compiles everything
all necessary paths are defined by redundans.py
README.md updated
- docs/README.md reduce and updated
contigs FastA is sorted by descending contig size & contigs below --minLength are removed (this speeds up reduction greatly)
code was polished & optimised, especially in reduction step (fasta2homozygous.py)
- subprocess is closed when not needed to lower memory footprint
- speed-optimised: avoided LASTal results sorting by sorting input (contigs FastA)
- memory-optimised ie. generator instead of list (thanks to sorted contigs FastA) (RAM usage: 150G -> 1G)
- contigs FastA file has to be ordered by descending contig size!
prints how many iterations in total ie. iteration 1.1 of 2.2 ...
libraries stats are estimated after reduction (avoiding double estimations for crappy libs)
- insert size estimated only on the major read orientation
reduction:
- plotting identity histogram of heterozygous contigs as contigs.reduced.fa.hist.png
- reporting heterozygous contigs statistics in contigs.reduced.fa.hetero.tsv
scaffolding:
- added SNAP aligner, faster mapping alternative to BWA MEM

####0.12c

####0.12-beta

LASTal version checked on runtime
added FastaIndex.py
- generate stats into .fai file - samtools faidx compatible
simplified dependencies
- Biopython, scipy, numpy & SQLite no longer needed

####0.12-alpha

####0.11-beta

####0.11b

two similarity search algorithms: BLAT for --identity 0.85+ and LAST for --identity < 0.85
corrected error in fastq2sspace.py, so now libraries are merged based on mean insert size, not median

####0.11a

iterative insert size estimation refining for mate pairs
fasta2diverged.py deprecated
-o/--output now contains also sspace intermediate files
reduction (fasta2homozygous.py) uses LAST instead of BLAT
- LAST multi-threaded (only in Python 2.7+)
cleaning-up output directory from intermediate files

####0.10b

-l/--limit now takes number of reads limit as fraction of homozygous genome size. To process all reads set -l to 0.
Gap2Seq is default gap closing software
BLAT is default reduction software
BWA MEM is default scaffolding software

Provide feedback

Saved searches