Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

connect: expose a single executable for spark connect. #3551

Open
universalmind303 opened this issue Dec 11, 2024 · 2 comments
Open

connect: expose a single executable for spark connect. #3551

universalmind303 opened this issue Dec 11, 2024 · 2 comments
Assignees
Labels
enhancement New feature or request needs triage

Comments

@universalmind303
Copy link
Contributor

Is your feature request related to a problem?

I'd like to be able to run spark connect without needing to write any python code. If I want to run spark via Java or Scala, I still need to write python code to start the connect server.

Describe the solution you'd like

# download the binary
# this is just an example. Most of the times there's an install script that selects the correct binary to install based off the arch. 
# it could also just be hosted on github artifacts.
curl https://eventualcomputing.com/install-spark-connect.sh | sh

# start the server
./spark-connect --port 55555

Then I can connect using any language I want.

import org.apache.spark.sql.SparkSession
val spark = SparkSession.builder.remote("sc://localhost:55555").build()

Describe alternatives you've considered

No response

Additional Context

this follows pretty closely with the official spark connect overview

Would you like to implement a fix?

No

@universalmind303 universalmind303 added enhancement New feature or request needs triage labels Dec 11, 2024
@andrewgazelka
Copy link
Contributor

hmmm would we bundling python in? how do you perceive this working? or do you want to remove all python dependencies? the issue that I was given for removing all python dependencies is it would not work with ray.

@universalmind303
Copy link
Contributor Author

universalmind303 commented Dec 12, 2024

hmmm would we bundling python in? how do you perceive this working? or do you want to remove all python dependencies? the issue that I was given for removing all python dependencies is it would not work with ray.

The user will still need python installed on their system, I'm pyo3 supports dynamically linking to python at runtime. Right now we use the extension-module feature that essentially treats it as a python module (.so). If we compile a normal rust binary without that feature enabled, it should link to the system's python at runtime.

@andrewgazelka andrewgazelka self-assigned this Dec 12, 2024
@raunakab raunakab self-assigned this Dec 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request needs triage
Projects
None yet
Development

No branches or pull requests

3 participants