Bitonic Sort example fails with GPU kernel error. #364

ethanbarry · 2024-05-18T13:21:15Z

Description

When I run the compiled CUDA bitonic sorter example (linked in the README) I get this error:

Failed to launch kernels (error code an illegal memory access was encountered)!

To Reproduce

Steps to reproduce the behavior:

bend gen-cu sorter.bend > sorter.cu
nvcc sorter.cu -o sorter
prime-run ./sorter (Launches it on the GPU for Arch Linux.)
Error recieved.

Expected behavior

The program runs on the GPU.

Desktop (please complete the following information):

OS: Linux (Arch 6.9.1-arch1-1)
CPU: Intel i7-11800H
GPU: RTX 3050 Ti Mobile
GPU Driver: Nvidia open kernel modules v550.78
CUDA release 12.4, V12.4.131

Additional context

The program runs using the C codegen backend, but with the CUDA backend, it seems to fail regardless of what I do. If anyone is curious about the prime-run command, it's really just a script that forces the dGPU to handle a task - nothing fancy.

The text was updated successfully, but these errors were encountered:

developedby · 2024-05-20T13:41:33Z

I'm moving this issue to the HVM repository HigherOrderCO/HVM#314

2lian · 2024-05-22T03:36:00Z

I am re-posting my fix here

The discussion is here on the hvm github. Re-posting because, hvm V2.0.13 is required and this is not the version of the hvm github, so this fix is specific to bend. (github V2.0.14 does not work at all with my bend)

I had the same issue. I cloned, HVM changed LNet seeting according to #283 , but the current repo V2.0.14 does not work with bend, and I do not know where V2.0.13 (for bend) is.

I never used cargo so excuse me if I am doing some black magic here, but this is how I fixed it for bend:

mkdir ~/hvmtmp
cd ~/hvmtmp
cargo init
cargo add hvm@=2.0.13
cargo vendor vendor
cd vendor/hvm

You are now inside the source of hvm V2.0.13.

Open and edit src/hvm.cu. Line 334 reduce L_NODE_LEN and L_VARS_LEN, but do not reduce too much. This value works on my GTX 1080Ti:

// Local Net
const u32 L_NODE_LEN = 0x2000/4;
const u32 L_VARS_LEN = 0x2000/4;
struct LNet {
  Pair node_buf[L_NODE_LEN];
  Port vars_buf[L_VARS_LEN];
};

Now go back to hvm V2.0.13 you downloaded and install it:

cd ~/hvmtmp/vendor/hvm
cargo +nightly install --path .

This should work, you can now delete ~/hvmtmp.

ethanbarry added the bug Something isn't working label May 18, 2024

developedby mentioned this issue May 20, 2024

Bitonic Sort on fails on CUDA with (error code an illegal memory access was encountered) HigherOrderCO/HVM#314

Open

developedby added the HVM About the HVM label May 20, 2024

TimotejFasiang mentioned this issue Jun 8, 2024

Execution is stuck until termination when running on CUDA in WSL2 #538

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bitonic Sort example fails with GPU kernel error. #364

Bitonic Sort example fails with GPU kernel error. #364

ethanbarry commented May 18, 2024

developedby commented May 20, 2024

2lian commented May 22, 2024 •

edited

Loading

Bitonic Sort example fails with GPU kernel error. #364

Bitonic Sort example fails with GPU kernel error. #364

Comments

ethanbarry commented May 18, 2024

Description

To Reproduce

Expected behavior

Desktop (please complete the following information):

Additional context

developedby commented May 20, 2024

2lian commented May 22, 2024 • edited Loading

I am re-posting my fix here

2lian commented May 22, 2024 •

edited

Loading