[Layer] Improve forwarding logic of ConcatLayer @open sesame 08/08 18:22 #2702

djeong20 · 2024-08-08T08:49:02Z

Description

This pull request aims to optimize the performance of forwarding() in ConcatLayer.
This is accomplished by improving the current concatenation logic by utilizing the tensor operation concat() instead of Map().
Note that this optimization is only for forwarding() and calcDeriv() would be improved in later PR.

Result

Before

key	avg	min	max	sum
calcDeriv(concat)	1.004153s	0.997947s	1.010360s	2.008307s
forward(concat)	1.083193s	1.042626s	1.010360s	2.166386s
concat	1.129055s	0.997965s	1.252510s	9.032440s

After

key	avg	min	max	sum
calcDeriv(concat)	1.134034s	1.131812s	1.136257s	2.268069s
forward(concat)	0.017167s	0.012304s	0.022031s	0.034335s
concat	0.627868s	0.012310s	1.346551s	5.022949s

Note

input 0: 1:1:196:256 [ FP16 : NCHW ]
input 1: 1:1:196:1024 [ FP16 : NCHW ]
output: 1:1:196:1280 [ FP16 : NCHW ]

This PR updates current ConcatLayer forwarding for faster computation. **Changes proposed in this PR:** - Utilize the Tensor::concat() operation to perform forwarding and replace manual mapping and copying. **Self-evaluation:** 1. Build test: [X]Passed [ ]Failed [ ]Skipped 2. Run test: [X]Passed [ ]Failed [ ]Skipped Signed-off-by: Donghyeon Jeong <[email protected]>

taos-ci · 2024-08-08T08:49:06Z

📝 TAOS-CI Version: 1.5.20200925. Thank you for submitting PR #2702. Please a submit 1commit/1PR (one commit per one PR) policy to get comments quickly from reviewers. Your PR must pass all verificiation processes of cibot before starting a review process from reviewers. If you are new member to join this project, please read manuals in documentation folder and wiki page. In order to monitor a progress status of your PR in more detail, visit http://ci.nnstreamer.ai/.

djeong20 · 2024-08-08T08:51:02Z

nntrainer/layers/concat_layer.cpp

+    if (out_dim[axis] != in_dim[axis]) {
+      /// @todo Currently a new output tensor is created. This can be optimized.
+      Tensor result = Tensor::cat(input_tensors, axis);
+      output.copy(result);


copy() is unnecessary here. this will be replaced with in-place ops in a later PR.

taos-ci

@djeong20, 💯 All CI checkers are successfully verified. Thanks.

jijoongmoon · 2024-08-09T00:18:04Z

nntrainer/layers/concat_layer.cpp

-    input.reshape(irh);
+    original_input_dims.push_back(input.getDim());
+    input.reshape(input_reshape_helper[idx]);
+    input_tensors.push_back(input);


Just for sure, when we do this push_back, there is no deep copy happened due to the

nntrainer/nntrainer/tensor/tensor.cpp

Line 102 in 32d901c

Tensor &Tensor::operator=(const Tensor &rhs) {

and could you check whether we are not using leading_helper_dim at #L80

it seems leading_helper_dim is used in setBatch()!

jijoongmoon

LGTM!

skykongkong8

Great work!

DonghakPark

Great Work!!!!

EunjuYang

Awesome!👍

djeong20 requested review from myungjoo, jijoongmoon, again4you, jaeyun-jung, leemgs, wooksong, helloahn, kparichay, gichan-jang, anyj0527, zhoonit, lhs8928, songgot, jihochu, DonghakPark, SeoHyungjun, baek2sm, skykongkong8, EunjuYang and a team as code owners August 8, 2024 08:49

github-actions bot added the Need Review label Aug 8, 2024

djeong20 commented Aug 8, 2024

View reviewed changes

djeong20 changed the title ~~[Layer] Improve forwarding logic of ConcatLayer~~ [Layer] Improve forwarding logic of ConcatLayer @open sesame 08/08 18:22 Aug 8, 2024

djeong20 linked an issue Aug 8, 2024 that may be closed by this pull request

ConcatLayer performance issue #2701

Closed

taos-ci approved these changes Aug 8, 2024

View reviewed changes

jijoongmoon reviewed Aug 9, 2024

View reviewed changes

jijoongmoon approved these changes Aug 9, 2024

View reviewed changes

skykongkong8 approved these changes Aug 9, 2024

View reviewed changes

github-actions bot added PR/READY2MERGE and removed Need Review labels Aug 9, 2024

DonghakPark approved these changes Aug 9, 2024

View reviewed changes

EunjuYang approved these changes Aug 9, 2024

View reviewed changes

jijoongmoon merged commit 3b11453 into nnstreamer:main Aug 12, 2024
49 checks passed

djeong20 mentioned this pull request Aug 14, 2024

[Layer] enhance ConcatLayer algorithms for efficient concatenation and split #2706

Merged

djeong20 deleted the layer/concat/update_fwd branch August 27, 2024 08:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Layer] Improve forwarding logic of ConcatLayer @open sesame 08/08 18:22 #2702

[Layer] Improve forwarding logic of ConcatLayer @open sesame 08/08 18:22 #2702

djeong20 commented Aug 8, 2024

taos-ci commented Aug 8, 2024

djeong20 Aug 8, 2024

taos-ci left a comment

jijoongmoon Aug 9, 2024 •

edited

Loading

djeong20 Aug 12, 2024

jijoongmoon left a comment

skykongkong8 left a comment

DonghakPark left a comment

EunjuYang left a comment

[Layer] Improve forwarding logic of ConcatLayer @open sesame 08/08 18:22 #2702

[Layer] Improve forwarding logic of ConcatLayer @open sesame 08/08 18:22 #2702

Conversation

djeong20 commented Aug 8, 2024

Description

Result

Before

After

Note

taos-ci commented Aug 8, 2024

djeong20 Aug 8, 2024

Choose a reason for hiding this comment

taos-ci left a comment

Choose a reason for hiding this comment

jijoongmoon Aug 9, 2024 • edited Loading

Choose a reason for hiding this comment

djeong20 Aug 12, 2024

Choose a reason for hiding this comment

jijoongmoon left a comment

Choose a reason for hiding this comment

skykongkong8 left a comment

Choose a reason for hiding this comment

DonghakPark left a comment

Choose a reason for hiding this comment

EunjuYang left a comment

Choose a reason for hiding this comment

jijoongmoon Aug 9, 2024 •

edited

Loading