You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
On branch-2.2, compiling the Cascaded algorithm with the -G flag changes the static shared memory alignment behavior, causing misalignment errors.
This issue has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be labeled inactive-90d if there is no activity in the next 60 days.
This issue has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be labeled inactive-90d if there is no activity in the next 60 days.
Describe the bug
On branch-2.2, compiling the Cascaded algorithm with the -G flag changes the static shared memory alignment behavior, causing misalignment errors.
Steps/Code to reproduce bug
To compile successfully, I reduced the default_chunk_size in default_chunk_size from 4096 to 2048.
Expected behavior
The test should pass without errors.
Environment details (please complete the following information):
Environment location:
Ubuntu-22.04
Driver Version: 555.99
CUDA Version: 12.5
NVIDIA GeForce RTX 3080
Method of nvCOMP install: branch-2.2 source code
Additional context
After debugging, I explicitly declared alignment for shared memory allocation in the following files:
CascadedHlifKernels.cu:122
CascadedKernels.cuh:801
After making these changes, all tests in test_cascaded.cpp passed. I believe this dependency on compiler optimization for correctness is a bug.
The text was updated successfully, but these errors were encountered: