From 8fc754ecc9471dddb5ade752480ea2a5c9ab440a Mon Sep 17 00:00:00 2001 From: Victor Lin <13424970+victorlin@users.noreply.github.com> Date: Thu, 15 Aug 2024 14:55:08 -0700 Subject: [PATCH] Clarify --sequences-per-group example --- src/guides/bioinformatics/filtering-and-subsampling.rst | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/src/guides/bioinformatics/filtering-and-subsampling.rst b/src/guides/bioinformatics/filtering-and-subsampling.rst index c86b903d..b72cd1cc 100644 --- a/src/guides/bioinformatics/filtering-and-subsampling.rst +++ b/src/guides/bioinformatics/filtering-and-subsampling.rst @@ -157,9 +157,9 @@ Subsampling Another common filtering operation is subsetting of data to achieve a more even spatio-temporal distribution or to cut-down data set size to more -manageable numbers. The filter command allows you to select a specific number of -sequences from specific groups, for example one sequence per month from each -country: +manageable numbers. The filter command allows you to partition the data into +groups based on column values and sample uniformly. For example, target one +sequence per month from each country: .. code-block:: bash