BUG: Set check_exact to true if dtype is int #55934

parthi-siva · 2023-11-13T08:57:26Z

closes BUG: assert_series_equal not raising on unequal series? #55882
[Tests added and passed]
All [code checks passed]
Added [type annotations]
Added an entry in the latest doc/source/whatsnew/vX.X.X.rst file if fixing a bug or adding a new feature.

parthi-siva · 2023-11-13T09:01:59Z

@MarcoGorelli not sure where to add test cases for the change.

Edit: Got it (pandas/tests/util/test_assert_frame_equal.py)

- If int dtype is different we are ignoring the difference - so added check to set check_exact to true only when dtype is same

pandas/_testing/asserters.py

MarcoGorelli · 2023-11-17T17:15:10Z

Well I'm relieved that CI is still green 😄 Can you update the docs for this method to note that rtol, atol and check_exact don't take effect for int dtype?

Regarding the check - not totally sure about the check, will take a closer look next week

parthi-siva · 2023-11-19T04:53:14Z

Sure. Updated the documentation. Added note under check_exact. Since we are setting it to True in case of int dtype. Also, documentation for rtol and atol states it is applicable only when check_exact is False so I assumed that it is sufficient to add note under check_exact

MarcoGorelli · 2023-11-23T15:17:58Z

I think it should always be exact, except for floats?

You should also check for extensions arrays, and include a whatsnew

I've added a commit anyway, hope it's ok - do the changes look alright to you?

parthi-siva · 2023-11-23T16:29:29Z

Yeah, make sense.

Some testcases are failing in some builds. Is that alright?

MarcoGorelli · 2023-11-23T16:43:54Z

Looks like another bug was hiding because of the check_exact default 😄 #56136

For now let's xfail the relevant test

MarcoGorelli

Looks good to me - although I did add a commit, so leaving open a bit in case others have objections

parthi-siva · 2023-11-27T16:38:28Z

Sure Marco

pandas/tests/extension/base/methods.py

Co-authored-by: Matthew Roeschke <[email protected]>

mroeschke · 2023-11-27T18:41:13Z

Thanks @parthi-siva and @MarcoGorelli

jorisvandenbossche · 2023-12-05T08:36:31Z

pandas/tests/series/test_constructors.py

        data[1] = 1
        result = Series(data, index=index)
        expected = Series([0, 1, 2], index=index, dtype=int)
-        tm.assert_series_equal(result, expected)
+        with pytest.raises(AssertionError, match="Series classes are different"):
+            # TODO should this be raising at all?
+            # https://github.com/pandas-dev/pandas/issues/56131
+            tm.assert_series_equal(result, expected)


This is a different issue than reported in #56131 (there is also no check_dtype=False here)

This doesn't involve different data types (it's all numpy dtypes), and the two objects created here seemingly are the same:

data = np.ma.masked_all((3,), dtype=int) data[0] = 0 data[1] = 1 data[2] = 2 index = ["a", "b", "c"] result = Series(data, index=index) expected = Series([0, 1, 2], index=index, dtype=int)

In [29]: result Out[29]: a 0 b 1 c 2 dtype: int64 In [30]: expected Out[30]: a 0 b 1 c 2 dtype: int64 In [31]: result.dtype == expected.dtype Out[31]: True

But apparently we have a bug in the Series constructor that preserves the masked array as underlying value if it has no masked elements:

In [32]: result.values Out[32]: masked_array(data=[0, 1, 2], mask=[False, False, False], fill_value=999999) In [33]: expected.values Out[33]: array([0, 1, 2])

That seems like a separate, actual bug we should solve, regardless of the behaviour of check_dtype in assert_series_equal (although being more strict here actually uncovered this bug ..)

parthi-siva · 2023-12-05T08:45:47Z

Looks like another bug was hiding because of the check_exact default 😄 #56136

For now let's xfail the relevant test

@jorisvandenbossche yeah.. there was another issues because of this

BUG: Set check_exact to true if dtype is int

4eabcd6

MarcoGorelli self-requested a review November 13, 2023 09:07

parthi-siva marked this pull request as draft November 13, 2023 09:30

parthi-siva added 2 commits November 13, 2023 15:11

BUG: Add check to ignore dytype difference

bce3087

- If int dtype is different we are ignoring the difference - so added check to set check_exact to true only when dtype is same

TST: Added test cases

8fd5e61

parthi-siva marked this pull request as ready for review November 13, 2023 09:46

TST: Fix failing test cases

c3ecb84

MarcoGorelli reviewed Nov 13, 2023

View reviewed changes

pandas/_testing/asserters.py Outdated Show resolved Hide resolved

mroeschke added the Testing pandas testing functions or related to the test suite label Nov 13, 2023

DOC: Update function documentation

6d74ec3

parthi-siva force-pushed the BUG-GH#55882 branch from 0231547 to 6d74ec3 Compare November 20, 2023 05:10

MarcoGorelli added 2 commits November 23, 2023 14:46

Merge remote-tracking branch 'upstream/main' into BUG-GH#55882

8f608a0

check_exact only takes effect for floating dtypes

c46b2d6

xfail failing test

a4eabea

MarcoGorelli added this to the 2.2 milestone Nov 27, 2023

MarcoGorelli approved these changes Nov 27, 2023

View reviewed changes

mroeschke reviewed Nov 27, 2023

View reviewed changes

pandas/tests/extension/base/methods.py Outdated Show resolved Hide resolved

Update pandas/tests/extension/base/methods.py

00b7c91

Co-authored-by: Matthew Roeschke <[email protected]>

mroeschke approved these changes Nov 27, 2023

View reviewed changes

mroeschke merged commit 17eec96 into pandas-dev:main Nov 27, 2023
42 checks passed

jorisvandenbossche mentioned this pull request Dec 5, 2023

REGR: assert_frame_equal check_dtype=False no longer works for equal values but numpy vs extension dtype #56340

Closed

jorisvandenbossche reviewed Dec 5, 2023

View reviewed changes

asishm mentioned this pull request Dec 28, 2023

BUG: pd.testing.assert_series_equal break in version 2.2.0rc0 #56646

Closed

3 tasks

crusaderky mentioned this pull request Jan 25, 2024

BUG: testing.assert_series_equal: inferred check_exact should not be passed down to index check #57067

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: Set check_exact to true if dtype is int #55934

BUG: Set check_exact to true if dtype is int #55934

parthi-siva commented Nov 13, 2023 •

edited

Loading

parthi-siva commented Nov 13, 2023 •

edited

Loading

MarcoGorelli commented Nov 17, 2023

parthi-siva commented Nov 19, 2023 •

edited

Loading

MarcoGorelli commented Nov 23, 2023

parthi-siva commented Nov 23, 2023

MarcoGorelli commented Nov 23, 2023

MarcoGorelli left a comment

parthi-siva commented Nov 27, 2023

mroeschke commented Nov 27, 2023

jorisvandenbossche Dec 5, 2023 •

edited

Loading

parthi-siva commented Dec 5, 2023

BUG: Set check_exact to true if dtype is int #55934

BUG: Set check_exact to true if dtype is int #55934

Conversation

parthi-siva commented Nov 13, 2023 • edited Loading

parthi-siva commented Nov 13, 2023 • edited Loading

MarcoGorelli commented Nov 17, 2023

parthi-siva commented Nov 19, 2023 • edited Loading

MarcoGorelli commented Nov 23, 2023

parthi-siva commented Nov 23, 2023

MarcoGorelli commented Nov 23, 2023

MarcoGorelli left a comment

Choose a reason for hiding this comment

parthi-siva commented Nov 27, 2023

mroeschke commented Nov 27, 2023

jorisvandenbossche Dec 5, 2023 • edited Loading

Choose a reason for hiding this comment

parthi-siva commented Dec 5, 2023

parthi-siva commented Nov 13, 2023 •

edited

Loading

parthi-siva commented Nov 13, 2023 •

edited

Loading

parthi-siva commented Nov 19, 2023 •

edited

Loading

jorisvandenbossche Dec 5, 2023 •

edited

Loading