feat: Improve shuffle metrics (second attempt) #1175

andygrove · 2024-12-17T16:53:19Z

Which issue does this PR close?

N/A

This PR replaces #1173

Rationale for this change

Changes:

Fix: Report write time accurately (this is now just the time for writing to disk but it previously included some of the IPC encoding time)
New: Number of input batches
New: Encoding and compression time
New: Total time spent in native shuffle code

Before

After

What changes are included in this PR?

How are these changes tested?

andygrove · 2024-12-17T17:40:12Z

@mbutrovich @parthchandra fyi

codecov-commenter · 2024-12-17T17:59:06Z

Codecov Report

Attention: Patch coverage is 76.92308% with 3 lines in your changes missing coverage. Please review.

Project coverage is 34.32%. Comparing base (95727aa) to head (6adb04c).
Report is 14 commits behind head on main.

Files with missing lines	Patch %	Lines
...apache/spark/sql/comet/CometCollectLimitExec.scala	50.00%	0 Missing and 1 partial ⚠️
...ark/sql/comet/CometTakeOrderedAndProjectExec.scala	50.00%	0 Missing and 1 partial ⚠️
...t/execution/shuffle/CometShuffleExchangeExec.scala	80.00%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@             Coverage Diff              @@
##               main    #1175      +/-   ##
============================================
- Coverage     34.32%   34.32%   -0.01%     
  Complexity      899      899              
============================================
  Files           115      115              
  Lines         43500    43506       +6     
  Branches       9496     9498       +2     
============================================
+ Hits          14931    14932       +1     
- Misses        25659    25661       +2     
- Partials       2910     2913       +3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

parthchandra · 2024-12-17T18:37:32Z

Just for clarification - what's the relation between shuffle write time, encoding and compression total time, and native shuffle total time?
I would think shuffle write time + encoding and compression total time = native shuffle total time, but that does not seem to be the case?

andygrove · 2024-12-17T19:06:57Z

Just for clarification - what's the relation between shuffle write time, encoding and compression total time, and native shuffle total time? I would think shuffle write time + encoding and compression total time = native shuffle total time, but that does not seem to be the case?

There is also evaluating the partition expressions (typically very fast if they are just column references) and then the time to actually split the batches into partitions.

parthchandra · 2024-12-17T21:43:49Z

Just for clarification - what's the relation between shuffle write time, encoding and compression total time, and native shuffle total time? I would think shuffle write time + encoding and compression total time = native shuffle total time, but that does not seem to be the case?

There is also evaluating the partition expressions (typically very fast if they are just column references) and then the time to actually split the batches into partitions.

From the above screenshot, shuffle write time + encoding and compression total time = 18.4s and native shuffle total time=28.8s, so there is a difference of 10.4s which is substantial. Wondering if we are missing something.

Nonetheless, the PR certainly improves on the current.

andygrove · 2024-12-17T22:13:39Z

There is also interaction with the memory pool, which makes JNI calls into synchronized code in the JVM.

I will see if I can make the metrics more complete in this PR.

andygrove · 2024-12-17T23:11:00Z

@parthchandra The numbers almost add up now.

andygrove · 2024-12-17T23:15:37Z

Here is Gluten's equivalent for comparison:

parthchandra · 2024-12-17T23:46:37Z

@parthchandra The numbers almost add up now.

Brilliant!

andygrove added 2 commits December 17, 2024 09:46

improve shuffle metrics

7150c87

docs

6adb04c

andygrove changed the title ~~[ignore] improve shuffle metrics second attempt~~ feat: Improve shuffle metrics (second attempt) Dec 17, 2024

andygrove mentioned this pull request Dec 17, 2024

feat: Add additional metrics for shuffle write #1173

Closed

andygrove marked this pull request as ready for review December 17, 2024 17:39

andygrove requested review from viirya and kazuyukitanimura December 17, 2024 17:39

andygrove marked this pull request as draft December 17, 2024 22:43

andygrove added 2 commits December 17, 2024 16:05

more metrics

2782a9f

refactor

e37d039

andygrove marked this pull request as ready for review December 17, 2024 23:11

parthchandra approved these changes Dec 18, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Improve shuffle metrics (second attempt) #1175

feat: Improve shuffle metrics (second attempt) #1175

andygrove commented Dec 17, 2024 •

edited

Loading

andygrove commented Dec 17, 2024

codecov-commenter commented Dec 17, 2024

parthchandra commented Dec 17, 2024

andygrove commented Dec 17, 2024

parthchandra commented Dec 17, 2024

andygrove commented Dec 17, 2024

andygrove commented Dec 17, 2024

andygrove commented Dec 17, 2024

parthchandra commented Dec 17, 2024

feat: Improve shuffle metrics (second attempt) #1175

Are you sure you want to change the base?

feat: Improve shuffle metrics (second attempt) #1175

Conversation

andygrove commented Dec 17, 2024 • edited Loading

Which issue does this PR close?

Rationale for this change

Before

After

What changes are included in this PR?

How are these changes tested?

andygrove commented Dec 17, 2024

codecov-commenter commented Dec 17, 2024

Codecov Report

parthchandra commented Dec 17, 2024

andygrove commented Dec 17, 2024

parthchandra commented Dec 17, 2024

andygrove commented Dec 17, 2024

andygrove commented Dec 17, 2024

andygrove commented Dec 17, 2024

parthchandra commented Dec 17, 2024

andygrove commented Dec 17, 2024 •

edited

Loading