-
-
Notifications
You must be signed in to change notification settings - Fork 18.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
BUG: Index.join different sorting behavior in libjoin faspath #54646
Comments
Any idea if #54611 helps here? |
Based on a quick look I’m optimistic |
@lukemanley was this closed by #54765? |
#54765 addressed the |
Is this related to this comment? pandas/pandas/core/indexes/base.py Lines 4662 to 4665 in 7b8c6f6
Bumped into the comment above and will link to this issue if it is. |
definitely looks related, yes |
In Index we have a _can_use_libjoin check that determined if we can use the libjoin fastpaths (AFAICT these are just fastpaths and the behavior is supposed to be identical to the non-fastpaths). There are a few cases where _can_use_libjoin is not strict enough and we end up making copies in order to use the fastpaths, which negates the benefits (havent actually done any measurements, just assuming). In particular MultiIndex and RangeIndex cases.
Patching can_use_libjoin to return False in MultiIndex and RangeIndex cases breaks a bunch of tests bc the join results cease to be ordered. Needs further investigation.
The text was updated successfully, but these errors were encountered: