Fix: Ensure Cache Key pk is Converted to INT to Prevent Dataframe Series Null Issues #161

arthur-verta · 2024-12-23T22:00:13Z

-> Ensure that the cache key pk (if used) is always converted to an INT format.

This addresses a bug that occurs when a queryset is loaded into a dataframe. Specifically, if the queryset includes a foreign key with nullable fields and a mix of instances with null and non-null related fields, pandas assigns the dtype of the primary key (pk) column as object. Consequently, pk values are automatically converted to floats because a pandas integer Series cannot contain None.

To avoid this, we must explicitly reconvert the pk column to INT before using it as a cache key.

Without this step, as of now, the dataframe ends up with None for every row in such cases.

[Using pandas 2.2.2]

Make sure that the cache key pk used, if any available, is converted to INT format.

Update utils.py

Add a try / except, in case the orginial pk is not an integer

arthur-verta added 3 commits December 23, 2024 16:53

Update utils.py

e0bb5a5

Make sure that the cache key pk used, if any available, is converted to INT format.

Merge pull request #1 from arthur-verta/arthur-verta-patch-1

42d75e7

Update utils.py

Update utils.py

fdfe404

Add a try / except, in case the orginial pk is not an integer

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: Ensure Cache Key pk is Converted to INT to Prevent Dataframe Series Null Issues #161

Fix: Ensure Cache Key pk is Converted to INT to Prevent Dataframe Series Null Issues #161

arthur-verta commented Dec 23, 2024 •

edited

Loading

Fix: Ensure Cache Key pk is Converted to INT to Prevent Dataframe Series Null Issues #161

Are you sure you want to change the base?

Fix: Ensure Cache Key pk is Converted to INT to Prevent Dataframe Series Null Issues #161

Conversation

arthur-verta commented Dec 23, 2024 • edited Loading

arthur-verta commented Dec 23, 2024 •

edited

Loading