Optimize LayerNorm #161

daemyung · 2023-10-06T01:24:00Z

🙏 Describe the pull request

LayerNorm is optimized by performing only the necessary boundary checks.

✅ Checklist

Code follows the project's coding conventions and style.
Tests have been added or updated to cover the changes.
Documentation has been updated, if necessary.

mejai1206

LGTM

kakao-steve-ai · 2023-10-06T04:26:16Z

trident/kernel/layer_norm.py

            grad_norm = weight * grad_output
        else:
            grad_norm = grad_output

        grad_std = tl.sum(grad_norm * centered_mean, 1)
        grad_var = grad_std * -(0.5 * rstd * rstd * rstd) / x_size
        grad_distance = 2 * centered_mean * grad_var
-        grad_centered_mean = tl.where(condition, grad_norm * rstd + grad_distance, 0)
+        grad_centered_mean = grad_norm * rstd + grad_distance


여기 있던 where 절은 없어져도 결과에 영향을 안 미치나요?

네 위에서 데이터를 불러올때 padding="zero"로 해결했어요.

Optimize LayerNorm

db7fa8c

daemyung added the enhancement New feature or request label Oct 6, 2023

daemyung requested review from kakao-steve-ai and mejai1206 October 6, 2023 01:24

daemyung self-assigned this Oct 6, 2023

mejai1206 approved these changes Oct 6, 2023

View reviewed changes

kakao-steve-ai reviewed Oct 6, 2023

View reviewed changes

daemyung merged commit 96b0328 into main Oct 6, 2023
1 check passed

daemyung deleted the layer_norm branch October 6, 2023 15:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize LayerNorm #161

Optimize LayerNorm #161

daemyung commented Oct 6, 2023

mejai1206 left a comment

kakao-steve-ai Oct 6, 2023

daemyung Oct 6, 2023

Optimize LayerNorm #161

Optimize LayerNorm #161

Conversation

daemyung commented Oct 6, 2023

🙏 Describe the pull request

✅ Checklist

mejai1206 left a comment

Choose a reason for hiding this comment

kakao-steve-ai Oct 6, 2023

Choose a reason for hiding this comment

daemyung Oct 6, 2023

Choose a reason for hiding this comment