feat: Add COMET_SHUFFLE_MODE config to control Comet shuffle mode #460

viirya · 2024-05-22T22:09:45Z

Which issue does this PR close?

Closes #459.

Rationale for this change

What changes are included in this PR?

How are these changes tested?

andygrove · 2024-05-23T04:35:30Z

docs/source/user-guide/tuning.md

-
-
-
+`spark.comet.exec.shuffle.mode` to `auto` will let Comet choose the best shuffle mode based on the query plan.


viirya · 2024-05-23T15:21:56Z

cc @sunchao

sunchao

LGTM in general. How do we pick shuffle mode when it is auto? I don't seem to find the logic in this PR.

sunchao · 2024-05-23T16:21:25Z

common/src/main/scala/org/apache/comet/CometConf.scala

+        "By default, this config is 'jvm'.")
+    .stringConf
+    .transform(_.toLowerCase(Locale.ROOT))
+    .checkValues(Set("native", "jvm", "auto"))


sunchao · 2024-05-23T16:21:52Z

docs/source/user-guide/configs.md

@@ -39,6 +38,7 @@ Comet provides the following configuration settings.
 | spark.comet.exec.memoryFraction | The fraction of memory from Comet memory overhead that the native memory manager can use for execution. The purpose of this config is to set aside memory for untracked data structures, as well as imprecise size estimation during memory acquisition. Default value is 0.7. | 0.7 |
 | spark.comet.exec.shuffle.codec | The codec of Comet native shuffle used to compress shuffle data. Only zstd is supported. | zstd |
 | spark.comet.exec.shuffle.enabled | Whether to enable Comet native shuffle. By default, this config is false. Note that this requires setting 'spark.shuffle.manager' to 'org.apache.spark.sql.comet.execution.shuffle.CometShuffleManager'. 'spark.shuffle.manager' must be set before starting the Spark application and cannot be changed during the application. | false |
+| spark.comet.exec.shuffle.mode | The mode of Comet shuffle. This config is only effective only if Comet shuffle is enabled. Available modes are 'native', 'jvm', and 'auto'. 'native' is for native shuffle which has best performance in general.'jvm' is for jvm-based columnar shuffle which has higher coverage than native shuffle.'auto' is for Comet to choose the best shuffle mode based on the query plan.By default, this config is 'jvm'. | jvm |


is only effective only -> is only effective

sunchao · 2024-05-23T16:22:23Z

docs/source/user-guide/configs.md

@@ -39,6 +38,7 @@ Comet provides the following configuration settings.
 | spark.comet.exec.memoryFraction | The fraction of memory from Comet memory overhead that the native memory manager can use for execution. The purpose of this config is to set aside memory for untracked data structures, as well as imprecise size estimation during memory acquisition. Default value is 0.7. | 0.7 |
 | spark.comet.exec.shuffle.codec | The codec of Comet native shuffle used to compress shuffle data. Only zstd is supported. | zstd |
 | spark.comet.exec.shuffle.enabled | Whether to enable Comet native shuffle. By default, this config is false. Note that this requires setting 'spark.shuffle.manager' to 'org.apache.spark.sql.comet.execution.shuffle.CometShuffleManager'. 'spark.shuffle.manager' must be set before starting the Spark application and cannot be changed during the application. | false |
+| spark.comet.exec.shuffle.mode | The mode of Comet shuffle. This config is only effective only if Comet shuffle is enabled. Available modes are 'native', 'jvm', and 'auto'. 'native' is for native shuffle which has best performance in general.'jvm' is for jvm-based columnar shuffle which has higher coverage than native shuffle.'auto' is for Comet to choose the best shuffle mode based on the query plan.By default, this config is 'jvm'. | jvm |


also spaces before jvm and auto

sunchao · 2024-05-23T16:23:25Z

spark/src/test/scala/org/apache/comet/exec/CometAggregateSuite.scala

-      Seq(true, false).foreach { cometColumnShuffleEnabled =>
-        withSQLConf(
-          CometConf.COMET_COLUMNAR_SHUFFLE_ENABLED.key -> cometColumnShuffleEnabled.toString) {
+      Seq("native", "jvm").foreach { cometColumnShuffleEnabled =>


nit: maybe update the variable name too

sunchao · 2024-05-23T16:23:55Z

spark/src/test/scala/org/apache/comet/exec/CometExecSuite.scala

@@ -134,14 +134,14 @@ class CometExecSuite extends CometTestBase {
        .toDF("c1", "c2")
        .createOrReplaceTempView("v")

-      Seq(true, false).foreach { columnarShuffle =>
+      Seq("native", "jvm").foreach { columnarShuffle =>


nit: shuffleMode?

viirya · 2024-05-23T17:55:45Z

LGTM in general. How do we pick shuffle mode when it is auto? I don't seem to find the logic in this PR.

When it is auto, Comet chooses native shuffle if possible as it shows better performance. If it is not available (unsupported cases), Comet uses jvm-based columnar shuffle instead.

sunchao

LGTM

viirya · 2024-05-23T18:47:46Z

Thank you @sunchao

viirya · 2024-05-23T19:09:11Z

Error:  /Users/runner/work/datafusion-comet/datafusion-comet/spark/src/test/scala/org/apache/comet/exec/CometAggregateSuite.scala:1215: value COMET_COLUMNAR_SHUFFLE_ENABLED is not a member of object org.apache.comet.CometConf

Weird. CI reports the above compilation error, but I don't see COMET_COLUMNAR_SHUFFLE_ENABLED in CometAggregateSuite locally...

viirya · 2024-05-23T19:53:41Z

Oh, it is from one patch just merged.

kazuyukitanimura

LGTM pending CI

viirya · 2024-05-23T21:08:10Z

Merged. Thanks @sunchao @kazuyukitanimura @andygrove

…ache#460) (cherry picked from commit 507e475)

viirya force-pushed the auto_shuffle_config branch 8 times, most recently from 8660c0c to 2bb0efd Compare May 23, 2024 04:20

andygrove reviewed May 23, 2024

View reviewed changes

viirya force-pushed the auto_shuffle_config branch 2 times, most recently from 07c9ba2 to f3f46bd Compare May 23, 2024 14:43

sunchao reviewed May 23, 2024

View reviewed changes

sunchao approved these changes May 23, 2024

View reviewed changes

viirya force-pushed the auto_shuffle_config branch from 678cd8c to 0c8f0f9 Compare May 23, 2024 18:50

viirya closed this May 23, 2024

viirya reopened this May 23, 2024

feat: Add COMET_SHUFFLE_MODE config to control Comet shuffle mode

586e1a7

viirya force-pushed the auto_shuffle_config branch from 0c8f0f9 to 586e1a7 Compare May 23, 2024 20:03

kazuyukitanimura approved these changes May 23, 2024

View reviewed changes

viirya merged commit 507e475 into apache:main May 23, 2024
40 checks passed

viirya deleted the auto_shuffle_config branch May 23, 2024 21:07

wForget mentioned this pull request Jul 30, 2024

[Doc] Update outdated spark.comet.columnar.shuffle.enabled configuration doc #737

Closed

himadripal pushed a commit to himadripal/datafusion-comet that referenced this pull request Sep 7, 2024

feat: Add COMET_SHUFFLE_MODE config to control Comet shuffle mode (ap…

c78c8c0

…ache#460) (cherry picked from commit 507e475)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add COMET_SHUFFLE_MODE config to control Comet shuffle mode #460

feat: Add COMET_SHUFFLE_MODE config to control Comet shuffle mode #460

viirya commented May 22, 2024 •

edited

Loading

andygrove May 23, 2024

viirya commented May 23, 2024

sunchao left a comment

sunchao May 23, 2024

sunchao May 23, 2024

sunchao May 23, 2024

sunchao May 23, 2024

viirya May 23, 2024

sunchao May 23, 2024

viirya commented May 23, 2024

sunchao left a comment

viirya commented May 23, 2024

viirya commented May 23, 2024

viirya commented May 23, 2024

kazuyukitanimura left a comment

viirya commented May 23, 2024




		`spark.comet.exec.shuffle.mode` to `auto` will let Comet choose the best shuffle mode based on the query plan.

feat: Add COMET_SHUFFLE_MODE config to control Comet shuffle mode #460

feat: Add COMET_SHUFFLE_MODE config to control Comet shuffle mode #460

Conversation

viirya commented May 22, 2024 • edited Loading

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

How are these changes tested?

andygrove May 23, 2024

Choose a reason for hiding this comment

viirya commented May 23, 2024

sunchao left a comment

Choose a reason for hiding this comment

sunchao May 23, 2024

Choose a reason for hiding this comment

sunchao May 23, 2024

Choose a reason for hiding this comment

sunchao May 23, 2024

Choose a reason for hiding this comment

sunchao May 23, 2024

Choose a reason for hiding this comment

viirya May 23, 2024

Choose a reason for hiding this comment

sunchao May 23, 2024

Choose a reason for hiding this comment

viirya commented May 23, 2024

sunchao left a comment

Choose a reason for hiding this comment

viirya commented May 23, 2024

viirya commented May 23, 2024

viirya commented May 23, 2024

kazuyukitanimura left a comment

Choose a reason for hiding this comment

viirya commented May 23, 2024

viirya commented May 22, 2024 •

edited

Loading