Skip to content
This repository has been archived by the owner on Jan 21, 2024. It is now read-only.

Commit

Permalink
prepare genbank DBs
Browse files Browse the repository at this point in the history
  • Loading branch information
luizirber committed Sep 30, 2023
1 parent cd385e2 commit 43de483
Showing 1 changed file with 17 additions and 1 deletion.
18 changes: 17 additions & 1 deletion experiments/Snakefile
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,8 @@ EXEC="cargo run -p mastiff-index --release -- "

rule all:
# input: "outputs/rs207"
input: "outputs/rs207-2k"
# input: "outputs/rs207-2k"
input: expand("outputs/genbank-{domain}", domain=["archaea", "bacteria", "fungi", "protozoa", "viral"])

rule rs207_1k:
output: directory("outputs/rs207-1k")
Expand Down Expand Up @@ -148,6 +149,21 @@ rule catalog_metagenomes:
sig_path = path / "sigs" / f"{sra_id}.sig"
if sig_path.exists():
out.write(f"{sig_path}\n")
out.flush()

#######################################################################

rule genbank:
output: directory("outputs/genbank-{domain}")
input: "/data/wort/databases/genbank-2022.03-{domain}-k21.zip"

shell: """
{EXEC} index -k 21 -s 1000 \
--output {output} \
{input}
"""

#######################################################################

"""
{EXEC} index -k 21 -s 1000 --output /scratch/analysis/rocksdb_metagenomes catalog_metagenomes
Expand Down

0 comments on commit 43de483

Please sign in to comment.