CometExec's outputPartitioning might not be same as Spark expects after AQE interferes #298

viirya · 2024-04-22T02:53:18Z

Describe the bug

Currently, CometExec has default outputPartitioning implementation that reuses Spark original plan's outputPartitioning. In most cases, it is correct. But Spark AQE has special node AQEShuffleRead which can possibly change output partitioning for coalescing purpose, e.g.:

*(2) ColumnarToRow                                                                                                                                                                                                               
+- CometSort [id#3311, data#3312, day#3313], [data#3312 DESC NULLS FIRST, id#3311 ASC NULLS FIRST]                                                                                                                                     
   +- AQEShuffleRead coalesced                                                                                                                                                                                                         
      +- ShuffleQueryStage 0                                                                                                                                                                                                           
         +- CometColumnarExchange hashpartitioning(data#3312, 5), REPARTITION_BY_COL, CometColumnarShuffle, [plan_id=10211]                                                                                                            
            +- RowToColumnar                                                                                                                                                                                                           
               +- *(1) Project [_1#3304 AS id#3311, _2#3305 AS data#3312, _3#3306 AS day#3313]                                                                                                                                         
                  +- *(1) SerializeFromObject [knownnotnull(assertnotnull(input[0, scala.Tuple3, true]))._1 AS _1#3304, staticinvoke(class org.apache.spark.unsafe.types.UTF8String, StringType, fromString, knownnotnull(assertnotnull
(input[0, scala.Tuple3, true]))._2, true, false, true) AS _2#3305, staticinvoke(class org.apache.spark.sql.catalyst.util.DateTimeUtils$, DateType, fromJavaDate, knownnotnull(assertnotnull(input[0, scala.Tuple3, true]))._3, true, fa
lse, true) AS _3#3306]                                                                                                                                                                                                                                      +- Scan[obj#3303]

It causes many test failures in org.apache.spark.sql.connector.WriteDistributionAndOrderingSuite on #250. For example, CometSort's output partitioning is the original plan's hashpartitioning(data#3312, 5) of original exchange. But after AQE, AQEShuffleRead is added and it modifies output partitioning to coalescedhashpartitioning(hashpartitioning('data, 5), CoalescedBoundary(0,5)). Because we replace Spark SortExec with CometSort before AQE interferes Spark query plan, CometSort uses SortExec's output partitioning hashpartitioning(data#3312, 5).

Steps to reproduce

No response

Expected behavior

No response

Additional context

No response

The text was updated successfully, but these errors were encountered:

viirya added the bug Something isn't working label Apr 22, 2024

viirya self-assigned this Apr 22, 2024

viirya mentioned this issue Apr 22, 2024

fix: CometExec's outputPartitioning might not be same as Spark expects after AQE interferes #299

Merged

viirya closed this as completed in #299 Apr 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CometExec's outputPartitioning might not be same as Spark expects after AQE interferes #298

CometExec's outputPartitioning might not be same as Spark expects after AQE interferes #298

viirya commented Apr 22, 2024

CometExec's outputPartitioning might not be same as Spark expects after AQE interferes #298

CometExec's outputPartitioning might not be same as Spark expects after AQE interferes #298

Comments

viirya commented Apr 22, 2024

Describe the bug

Steps to reproduce

Expected behavior

Additional context