All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog,
- Make genbank file with pfam annotation: desc field has format "tax:viral||anno:XXXX"
- Check "suspicious" words in each contig: methyltransferase, epimerase, endonuclease
- Mark short jobs as local_rules to optimaize for
virsorter run
--cluster mode - Add "all" in --include-groups
- Change initial slide window to gene number based (4 genes) from bp size; apply the minimal prophage size at the end.
- Fix issues with numpy v1.24 by limiting numpy version <1.24
- Change "singularity" to "apptainer" in
README
- Add "PROVIRUS_MIN_HALLMARK_CNT" in
template-config-original.yaml
for tuning prophage search inprovirus.py
- Add new cols: "final_max_score_group" and "final_max_score" in
final-viral-boundary.tsv
to be consistent with "max_score_group" and "max_score" infinal-viral-score.tsv
- Remove records in
final-viral-boundary.tsv
but not infinal-viral-score.tsv
- Fix
--seqname-suffix-off
regex issues - Fix issues with pandas v1.3.0
- Add email option (virsorter2 at gmail) for bug report
- Fix bug with --seqname-suffix-off causing error with DRAMv
- Fix minor bugs
- Add --viral-gene-enrich-off option
- Add --seqname-suffix-off option
- Change default --include-groups from all groups to dsDNAphage and ssDNA
- Add FAQ, viral identification SOP link in README
- Add --keep-original-seq option
- Add --high-confidence-only option
- Add --exclude-lt2gene option
- Remove more itermediate files by default
- Change HMMER3.3.1 to HMMER3.3; HMMER3.3.1 throws seg fault with low complexity sequences
- Add --viral-gene-required
- Add --hallmark-required
- Add --hallmark-required-on-short
- Add --prep-for-dramv
- Add --label in virsorter run to allow adding prefix to output files
- Add shape to boundry file
- Apply length prefilter to provirus.py; skip all seqs length < min(3000, MIN_GENOME_SIZE)
- Limit max file splits to 1000
- Make score table for trimmed viral seqs after provirus extraction
- Add length to score table
- Add subcommand
virsorter train-feature
andvirsorter train-model
for training customized classifiers of new viral groups,virsorter config
for change configurations
- First release