VizBin provides a user-friendly interface for the efficient visualisation of (assembled) metagenomic data. Furthermore, it can be used for the visualisation of single genome assembly data in order to inspect if any significant amounts of contamination has occured. This could manifest in a set of clusters apparent in the VizBin map rather than a single cluster.
Sometimes, independent of whether the application is for metagenomes or single genomes, it might be interesting to increase the density of the apparent clusters by normalizing the sequence length.
In particular for metagenomic assemblies, assembled sequence length may vary greatly between the consituent populations.
Some population might have long and few contigs, whereas the opposite might be true for other populations.
For these cases, the chunkify.py
script can be used.
This script creates "chunks" out of the provided sequences with a defaul chunk-length of 3,000 nt.
The script seq_stats.pl
can be used to generate per-sequence length information and to generate an overall summary.
Simply pipe the stdout to /dev/null
to get the overall summary only.