Skip to content

Commit

Permalink
[rocm6.3_internal_testing] Fix SWDEV-459623 - 2 (#1629)
Browse files Browse the repository at this point in the history
Fix faulty conflict merge when cherry-picking
aea0386
to rocm6.3_internal_testing

---------

Co-authored-by: Xinya Zhang <[email protected]>
  • Loading branch information
jithunnair-amd and xinyazhang authored Oct 10, 2024
1 parent 7ac294f commit 4df8a01
Showing 1 changed file with 4 additions and 0 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -96,6 +96,10 @@ mha_fwd(const at::Tensor &q, // batch_size x seqlen_q x num_heads x head
int window_size_right,
const bool return_softmax,
std::optional<at::Generator> gen_) {
// Otherwise the kernel will be launched from cuda:0 device
// Cast to char to avoid compiler warning about narrowing
at::hip::HIPGuardMasqueradingAsCUDA device_guard{(char)q.get_device()};

auto stream = at::hip::getCurrentHIPStreamMasqueradingAsCUDA().stream();
check_gpu_arch(stream);

Expand Down

0 comments on commit 4df8a01

Please sign in to comment.