-
Notifications
You must be signed in to change notification settings - Fork 409
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bitonic Sort on fails on CUDA with (error code an illegal memory access was encountered) #314
Comments
I am here from the main Bend repo with issue #Bitonic Sort example failed with GPU kernel error. I too am having a kernel memory issue: $ ./sorter # The same as prime-run due to environment variables already being set.
| Failed to launch kernels (error code an illegal memory access was encountered)! I am also running a mobile gpu where I am getting this issue. Some GPU properties and info from exec:
Specs:
As well as that, running it through |
I had the same issue. I cloned, HVM changed LNet seeting according to #283 , but the current repo V2.0.14 does not work with bend, and I do not know where V2.0.13 (for bend) is. I never used cargo so excuse me if I am doing some black magic here, but this is how I fixed it for bend: mkdir ~/hvmtmp
cd ~/hvmtmp
cargo init
cargo add hvm@=2.0.13
cargo vendor vendor
cd vendor/hvm You are now inside the source of hvm V2.0.13. Open and edit // Local Net
const u32 L_NODE_LEN = 0x2000/4;
const u32 L_VARS_LEN = 0x2000/4;
struct LNet {
Pair node_buf[L_NODE_LEN];
Port vars_buf[L_VARS_LEN];
}; Now go back to hvm V2.0.13 you downloaded and install it: cd ~/hvmtmp/vendor/hvm
cargo +nightly install --path . This should work, you can now delete |
I wonder why they needed |
To report more about this, on my GTX 1080Ti (using WSL2, cuda toolkit 12.3), I have tried:
Only |
all tries not work for me, on my GTX 1050 Ti.
test example:
|
Did anyone manage to find some |
Originally from HigherOrderCO/Bend#364 by user @ethanbarry
Description
When I run the compiled CUDA bitonic sorter example (linked in the README) I get this error:
Failed to launch kernels (error code an illegal memory access was encountered)!
To Reproduce
Steps to reproduce the behavior:
Expected behavior
The program runs on the GPU.
Desktop (please complete the following information):
Additional context
The program runs using the C codegen backend, but with the CUDA backend, it seems to fail regardless of what I do. If anyone is curious about the prime-run command, it's really just a script that forces the dGPU to handle a task - nothing fancy.
The text was updated successfully, but these errors were encountered: