DOC: Update docstring for read_excel #56543

phofl · 2023-12-17T23:27:08Z

The initial motivation for this change was the typo in xlrd

rhshadrach · 2023-12-18T14:45:54Z

pandas/io/excel/_base.py

-        When ``engine=None``, the following logic will be
-        used to determine the engine:


I do think it's good to have this logic documented, can you just move it out of the versionchanged instead?

IIRC xlrd used to not only support xlsx files but at one point was even the default so we had to go through some lengths to document that transition as clearly as possible. We are a few years removed from that and since then all default read libraries have specialized in a given extension(s), so I think we can do without going into this detail in the docstring

and since then all default read libraries have specialized in a given extension(s)

Where are the default read libraries documented? E.g. both openpyxl and calamine can read xlsx, and both pyxlsb and calamine can read xlsb.

Calamine is never the default - you'd have to explicitly use that as an engine. Otherwise this is documented in the Excel section of the IO manual

https://pandas.pydata.org/pandas-docs/stable/user_guide/io.html#excel-files

Granted that section could be rewritten to be a little clearer, but I think that is out of scope for what @phofl is doing here

Calamine is never the default - you'd have to explicitly use that as an engine.

This is true today, but there is an issue to make it the default for xlsb files. In any case, I don't believe it's documented that Calamine is never the default.

Updated and added back in without the version changed

rhshadrach · 2023-12-18T14:46:34Z

pandas/io/excel/_base.py

@@ -165,31 +165,12 @@
    Supported engines: "xlrd", "openpyxl", "odf", "pyxlsb", "calamine".


Out of scope, but it appears to me this line is duplicative.

I removed this line

phofl · 2024-01-03T22:17:51Z

/preview

github-actions · 2024-01-03T22:18:02Z

Website preview of this PR available at: https://pandas.pydata.org/preview/56543/

phofl · 2024-01-03T22:25:13Z

I've also clarified the user guide

rhshadrach

lgtm

rhshadrach · 2024-01-04T03:30:28Z

Thanks @phofl

…cel) (#56730) Backport PR #56543: DOC: Update docstring for read_excel Co-authored-by: Patrick Hoefler <[email protected]>

DOC: Update docstring for read_excel

548c16e

phofl added Docs IO Excel read_excel, to_excel labels Dec 17, 2023

phofl added this to the 2.2 milestone Dec 17, 2023

phofl requested a review from rhshadrach as a code owner December 17, 2023 23:27

rhshadrach requested changes Dec 18, 2023

View reviewed changes

WillAyd approved these changes Jan 2, 2024

View reviewed changes

phofl added 5 commits January 3, 2024 23:19

Update

0d17d62

Merge remote-tracking branch 'upstream/main' into doc_excel

7a3f09e

Update

411eda7

Update user guide

21d88bf

Update user guide

b72d32b

Fixup

c39f154

rhshadrach approved these changes Jan 4, 2024

View reviewed changes

rhshadrach merged commit 70fc174 into pandas-dev:main Jan 4, 2024
50 checks passed

meeseeksmachine mentioned this pull request Jan 4, 2024

Backport PR #56543 on branch 2.2.x (DOC: Update docstring for read_excel) #56730

Merged

meeseeksmachine pushed a commit to meeseeksmachine/pandas that referenced this pull request Jan 4, 2024

Backport PR pandas-dev#56543: DOC: Update docstring for read_excel

db20e04

phofl deleted the doc_excel branch January 4, 2024 08:21

phofl added a commit that referenced this pull request Jan 4, 2024

Backport PR #56543 on branch 2.2.x (DOC: Update docstring for read_ex…

97eb331

…cel) (#56730) Backport PR #56543: DOC: Update docstring for read_excel Co-authored-by: Patrick Hoefler <[email protected]>

pmhatre1 pushed a commit to pmhatre1/pandas-pmhatre1 that referenced this pull request May 7, 2024

DOC: Update docstring for read_excel (pandas-dev#56543)

18c12a3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC: Update docstring for read_excel #56543

DOC: Update docstring for read_excel #56543

phofl commented Dec 17, 2023

rhshadrach Dec 18, 2023

WillAyd Jan 2, 2024

rhshadrach Jan 3, 2024 •

edited

Loading

WillAyd Jan 3, 2024

WillAyd Jan 3, 2024

rhshadrach Jan 3, 2024

phofl Jan 3, 2024

rhshadrach Dec 18, 2023

phofl Jan 3, 2024

phofl commented Jan 3, 2024

github-actions bot commented Jan 3, 2024

phofl commented Jan 3, 2024

rhshadrach left a comment

rhshadrach commented Jan 4, 2024

		When ``engine=None``, the following logic will be
		used to determine the engine:

		@@ -165,31 +165,12 @@
		Supported engines: "xlrd", "openpyxl", "odf", "pyxlsb", "calamine".

DOC: Update docstring for read_excel #56543

DOC: Update docstring for read_excel #56543

Conversation

phofl commented Dec 17, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rhshadrach Jan 3, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

phofl commented Jan 3, 2024

github-actions bot commented Jan 3, 2024

phofl commented Jan 3, 2024

rhshadrach left a comment

Choose a reason for hiding this comment

rhshadrach commented Jan 4, 2024

rhshadrach Jan 3, 2024 •

edited

Loading