Michael Byars ([email protected]) Simone Gable ([email protected]) Tollis Lab
BUSCO (version 3.0, not compatible with version 4.0+) MAFFT: https://mafft.cbrc.jp/alignment/software/ RAxML-HPC: https://cme.h-its.org/exelixis/web/software/raxml/index.html TrimAL: https://github.com/scapella/trimal ASTRAL: https://github.com/smirarab/ASTRAL
MAFFT: https://mafft.cbrc.jp/alignment/software/ RAxML-HPC: https://cme.h-its.org/exelixis/web/software/raxml/index.html TrimAL: https://github.com/scapella/trimal
3.) Export .yaml files for each environment into the folder /YOUR_WORKING_DIR/busco_phylo_pipeline/conda_yamls
amas.yml mafft.yml raxml.yml trimal.yml
Note: If you are using a different version of Astral you will have to correct the name in the Snakemake pipeline
Helper Scripts have been placed in the scripts/ directory to help automate this process. Run busco_multiseq_generator.sh in the base directory of your BUSCO output directory to extract and collate the sequences into one file containing the sequences from every species for each BUSCO. If you have transcriptome mode output from BUSCO there is a script called trans_extract_sequences.sh that you will need to run on each transcriptome mode run before running busco_multiseq_generator.sh.
snakemake --cores 4 --use-conda --keep-going