Add hipstdpar support to BabelStream #195

gsitaram · 2024-05-02T14:58:53Z

This PR adds support for offload to AMD GPUs using the par_unseq execution policy in C++ standard parallelism algorithms. To trigger the GPU offload of all parallel algorithms, the --hipstdpar compilation flag must be provided. For GPU targets other than the current default of gfx906, the --offload-arch=<arch_string> option must also be provided at compile time.

When using ROCm 6.1.0, the compilation commands may look like the following if compiling for an AMD Instinct MI200 series GPU:

cmake -Bbuild -H. -DMODEL=std-data -DCMAKE_CXX_COMPILER=hipcc -DCLANG_OFFLOAD=gfx90a
cmake --build build

Remember to set the environment variable to enable address translation and page migration (where applicable) when running std-data-stream or std-indices-stream:

export HSA_XNACK=1

Add support for hipstdpar

tomdeakin · 2024-05-13T16:55:29Z

src/std-indices/model.cmake

@@ -50,4 +66,13 @@ macro(setup)
        register_definitions(USE_ONEDPL)
        register_link_library(oneDPL)
    endif ()
+    if (CLANG_OFFLOAD)


I'm not totally sold on the name of this definition as CLANG is not specific to AMD, but the CLANG_FLAGS set below are definitely specific. Can we rename this please to something appropriate?

Would replacing CLANG_OFFLOAD with AMDGPU_TARGET_OFFLOAD and CLANG_FLAGS with AMDGPU_TARGET_OFFLOAD_FLAGS be acceptable?

Hi @tomdeakin, please confirm if the above suggestion would work or recommend something better. I can make the changes accordingly.

tomdeakin · 2024-05-13T17:06:19Z

It's great to see hipstdpar working, so let's work to get this merged in. Thanks for the contributions.

Change definition names to be more specific to AMD GPUs

gsitaram · 2024-06-20T14:49:21Z

Hi @tomdeakin, @afanfa and I have made the changes requested. Please check and approve if everything looks okay.
Thanks!

gonzalobg · 2024-08-14T07:37:26Z

Added this PR with some fixes to #202 .

gsitaram and others added 3 commits April 29, 2024 20:51

Add support for hipstdpar

9d4cc72

Remove --hipstdpar-path as it is not needed with ROCm 6.1 onwards

9f67c5f

Merge pull request #1 from gsitaram/hipstdpar

6bd658c

Add support for hipstdpar

tomdeakin requested changes May 13, 2024

View reviewed changes

tomdeakin changed the base branch from main to develop May 26, 2024 10:37

gsitaram and others added 2 commits June 13, 2024 12:27

Change definition names to be more specific to AMD GPUs

7e5c410

Merge pull request #2 from gsitaram/hipstdpar

cf2e1b3

Change definition names to be more specific to AMD GPUs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add hipstdpar support to BabelStream #195

Add hipstdpar support to BabelStream #195

gsitaram commented May 2, 2024

tomdeakin May 13, 2024

gsitaram May 13, 2024 •

edited

Loading

gsitaram May 28, 2024

tomdeakin commented May 13, 2024

gsitaram commented Jun 20, 2024

gonzalobg commented Aug 14, 2024

Add hipstdpar support to BabelStream #195

Are you sure you want to change the base?

Add hipstdpar support to BabelStream #195

Conversation

gsitaram commented May 2, 2024

tomdeakin May 13, 2024

Choose a reason for hiding this comment

gsitaram May 13, 2024 • edited Loading

Choose a reason for hiding this comment

gsitaram May 28, 2024

Choose a reason for hiding this comment

tomdeakin commented May 13, 2024

gsitaram commented Jun 20, 2024

gonzalobg commented Aug 14, 2024

gsitaram May 13, 2024 •

edited

Loading