Skip to content

Commit

Permalink
[SPARK-49869][INFRA] Add NumPy in Python 3.13 image
Browse files Browse the repository at this point in the history
### What changes were proposed in this pull request?

This PR add NumPy in Python 3.13 image. Note that this is different from SPARK-49862 because NumPy is a required dependency for ML in Python.

### Why are the changes needed?

To fix Python 3.13 (https://github.com/apache/spark/actions/runs/11168860784/job/31048343334).

### Does this PR introduce _any_ user-facing change?

No, dev-only.

### How was this patch tested?

Will monitor the build.

### Was this patch authored or co-authored using generative AI tooling?

No.

Closes #48342 from HyukjinKwon/SPARK-49869.

Authored-by: Hyukjin Kwon <[email protected]>
Signed-off-by: Dongjoon Hyun <[email protected]>
  • Loading branch information
HyukjinKwon authored and dongjoon-hyun committed Oct 4, 2024
1 parent 96666d4 commit a38505c
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion dev/infra/Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -148,7 +148,7 @@ RUN apt-get update && apt-get install -y \
RUN curl -sS https://bootstrap.pypa.io/get-pip.py | python3.13
# TODO(SPARK-49862) Add BASIC_PIP_PKGS and CONNECT_PIP_PKGS to Python 3.13 image when it supports Python 3.13
RUN python3.13 -m pip install --ignore-installed blinker>=1.6.2 # mlflow needs this
RUN python3.13 -m pip install lxml && \
RUN python3.13 -m pip install lxml numpy>=2.1 && \
python3.13 -m pip cache purge

# Remove unused installation packages to free up disk space
Expand Down

0 comments on commit a38505c

Please sign in to comment.