TST: Remove groupby/test_function.py #56338

rhshadrach · 2023-12-05T02:45:02Z

closes #xxxx (Replace xxxx with the GitHub issue number)
Tests added and passed if fixing a bug or adding a new feature
All code checks passed.
Added type annotations to new arguments/methods/functions.
Added an entry in the latest doc/source/whatsnew/vX.X.X.rst file if fixing a bug or adding a new feature.

This finishes up the removal of test_function.py from the groupby tests. With the exception of test_groupby_non_arithmetic_agg_int_like_precision (which is moved to other files here), the tests that remained fall into two (and partially overlapping) camps:

Tests for numeric_only
Tests that use the fixture groupby_func, which goes over every groupby operation

For the first, the options I considered were creating test_numeric_only.py or split it up into various other files such as test_reductions.py, test_cumlative.py, and some others. Similarly, for tests that use groupby_func that don't fit in other files, I considered creating test_all_methods.py (which can be renamed to something better) or split up the tests into several different files.

I'm completely on the fence here. Breaking up a parameterized test and duplicating code across multiple files seems pretty bad. While I don't love having test_all_methods (it's sort of a catch-all), at first it seemed to me that it is less bad than breaking up the parameterized tests. But then grepping around, it appears that there are only three tests across all of groupby that would go in test_all_methods. Perhaps this is something we can expand on later - but maybe since there are so few tests it's okay to break these up?

Any ideas here are very much welcome.

cc @jbrockmendel @mroeschke

rhshadrach · 2023-12-05T02:46:49Z

pandas/tests/groupby/methods/test_nth.py

+        (24650000000000001, 24650000000000002),
+    ],
+)
+def test_groupby_nth_int_like_precision(data):


This tests has been cleaned up and not just moved.

rhshadrach · 2023-12-05T02:47:03Z

pandas/tests/groupby/test_reductions.py

+    ],
+)
+@pytest.mark.parametrize("method", ["count", "min", "max", "first", "last"])
+def test_groupby_non_arithmetic_agg_int_like_precision(method, data):


This tests has been cleaned up and not just moved.

WillAyd

looks reasonable to me. nice work

mroeschke

Breaking up a parameterized test and duplicating code across multiple files seems pretty bad. While I don't love having test_all_methods (it's sort of a catch-all), at first it seemed to me that it is less bad than breaking up the parameterized tests.

IMO I think in the future this could be rolled into a potential refactor of test_groupby.py as a file testing "groupby scenarios that should work with all methods"

mroeschke · 2023-12-05T17:48:41Z

Thanks @rhshadrach

rhshadrach · 2023-12-05T21:27:06Z

Breaking up a parameterized test and duplicating code across multiple files seems pretty bad. While I don't love having test_all_methods (it's sort of a catch-all), at first it seemed to me that it is less bad than breaking up the parameterized tests.

IMO I think in the future this could be rolled into a potential refactor of test_groupby.py as a file testing "groupby scenarios that should work with all methods"

Sounds good - test_groupby is my next target; I've always been under the impression that tests here should be for something like "All groupby attributes that aren't ops and don't have a more specific location". E.g. len(df.groupby(...)) or df.groupby(...).groups. I think it makes sense to roll test_all_methods into here.

rhshadrach added 2 commits December 4, 2023 21:30

TST: Finish removal of groupby/test_function.py

3bb978f

Move one more

84128b9

rhshadrach added Testing pandas testing functions or related to the test suite Groupby Clean labels Dec 5, 2023

rhshadrach commented Dec 5, 2023

View reviewed changes

WillAyd approved these changes Dec 5, 2023

View reviewed changes

mroeschke approved these changes Dec 5, 2023

View reviewed changes

mroeschke added this to the 2.2 milestone Dec 5, 2023

mroeschke merged commit dc4c474 into pandas-dev:main Dec 5, 2023
50 checks passed

rhshadrach deleted the clean_gb_tests_7 branch December 5, 2023 21:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TST: Remove groupby/test_function.py #56338

TST: Remove groupby/test_function.py #56338

rhshadrach commented Dec 5, 2023

rhshadrach Dec 5, 2023

rhshadrach Dec 5, 2023

WillAyd left a comment

mroeschke left a comment

mroeschke commented Dec 5, 2023

rhshadrach commented Dec 5, 2023

TST: Remove groupby/test_function.py #56338

TST: Remove groupby/test_function.py #56338

Conversation

rhshadrach commented Dec 5, 2023

rhshadrach Dec 5, 2023

Choose a reason for hiding this comment

rhshadrach Dec 5, 2023

Choose a reason for hiding this comment

WillAyd left a comment

Choose a reason for hiding this comment

mroeschke left a comment

Choose a reason for hiding this comment

mroeschke commented Dec 5, 2023

rhshadrach commented Dec 5, 2023