diff --git a/src/guides/bioinformatics/filtering-and-subsampling.rst b/src/guides/bioinformatics/filtering-and-subsampling.rst index c86b903d..b72cd1cc 100644 --- a/src/guides/bioinformatics/filtering-and-subsampling.rst +++ b/src/guides/bioinformatics/filtering-and-subsampling.rst @@ -157,9 +157,9 @@ Subsampling Another common filtering operation is subsetting of data to achieve a more even spatio-temporal distribution or to cut-down data set size to more -manageable numbers. The filter command allows you to select a specific number of -sequences from specific groups, for example one sequence per month from each -country: +manageable numbers. The filter command allows you to partition the data into +groups based on column values and sample uniformly. For example, target one +sequence per month from each country: .. code-block:: bash