From 46575c2701fda714ecde693d0153f9ab72d349fc Mon Sep 17 00:00:00 2001 From: HackMD <37423+hackmd-hub[bot]@users.noreply.github.com> Date: Sun, 18 Feb 2024 01:28:13 +0000 Subject: [PATCH] last changed at Feb 17, 2024 5:28 PM, pushed by C. Titus Brown --- doc/README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/doc/README.md b/doc/README.md index 903a6786..bf601c1a 100644 --- a/doc/README.md +++ b/doc/README.md @@ -9,7 +9,7 @@ | `multisearch` | Multithreaded comparison of multiple sketches, in memory | [link](#Running-multisearch) | `pairwise` | Multithreaded pairwise comparison of multiple sketches, in memory | [link](#Running-multisearch) -This repository implements multithreaded plugins for [sourmash](https://sourmash.readthedocs.io/) that provide very fast implementations of `sketch`, `search`, and `gather`. These commands are typically hundreds to thousands of times faster, and 10-50x lower memory, than the current sourmash code. +This repository implements multithreaded plugins for [sourmash](https://sourmash.readthedocs.io/) that provide very fast implementations of `sketch`, `search`, and `gather`. These commands are typically hundreds to thousands of times faster, and 10-50x lower memory, than the current sourmash code. For example, a `gather` of SRR606249 with sourmash v4.8.6 against GTDB rs214 takes 40 minutes and 14 GB of RAM, while `fastgather` takes only 2 minutes and 2 GB of RAM. The main *drawback* to these plugin commands is that their inputs and outputs are not as rich as the native sourmash commands. This means that your input files may need to be prepared differently, and the output may in some cases be most useful as a prefilter in conjunction with regular sourmash commands - see the instructions below for using `fastgather` to create picklists for sourmash. @@ -49,7 +49,7 @@ sourmash sig cat -o sigs.zip There are various places where we recommend using manifests instead of zip files. Why? Well, first, if you are using a zip file created by sourmash, you are already using a manifest! And you will get all of the benefits described above! - + But if you want to use a collection of multiple very large metagenomes (as search targets in `manysearch`, or as queries in `fastmultigather`), then standalone manifests might be a good solution for you. This is for two specific reasons: