-
Notifications
You must be signed in to change notification settings - Fork 169
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: newFileScanRDD should not take constructor from custom Spark versions #412
Conversation
workaround for Amazon EMR version: emr-6.15.0 and Spark 3.4.1 custom implementation
spark/src/main/spark-3.x/org/apache/comet/shims/ShimCometScanExec.scala
Outdated
Show resolved
Hide resolved
spark/src/main/spark-3.x/org/apache/comet/shims/ShimCometScanExec.scala
Outdated
Show resolved
Hide resolved
I take the liberty to commit some suggestions on code comment and style as it is not responded for days. I will merge this once CI passes. |
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## main #412 +/- ##
============================================
- Coverage 34.02% 34.02% -0.01%
- Complexity 857 858 +1
============================================
Files 116 116
Lines 38565 38583 +18
Branches 8517 8521 +4
============================================
+ Hits 13120 13126 +6
- Misses 22691 22702 +11
- Partials 2754 2755 +1 ☔ View full report in Codecov by Sentry. |
Merged. Thanks @ceppelli @kazuyukitanimura @andygrove |
…sions (apache#412) * [FIX] - workaround for aws emr spark 3.4 workaround for Amazon EMR version: emr-6.15.0 and Spark 3.4.1 custom implementation * Update spark/src/main/spark-3.x/org/apache/comet/shims/ShimCometScanExec.scala * Update spark/src/main/spark-3.x/org/apache/comet/shims/ShimCometScanExec.scala --------- Co-authored-by: Liang-Chi Hsieh <[email protected]> (cherry picked from commit 1f23c18)
Which issue does this PR close?
Closes #411 .
Rationale for this change
the file spark-sql_2.12-3.4.1-amzn-2.jar is a custom version of spark and contains the class org.apache.spark.sql.execution.datasources.FileScanRDD with 2 constructs, one with 6 parameters and the second with 8 parameters. The suggested workaround filters out the custom constructor.
What changes are included in this PR?
How are these changes tested?