You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have a simple example fortran code (Poisson-Solver) which uses:
openMP GPU offloading
openACC GPU offloading
as well as their CPU-threaded counterparts
I use CMake as a build system and it would be great if we could optimize for the whole code without needing to write fake data inputs or separate my kernel out of the main code base.
Possibly good to also support make.
Poisson-Solver provides a slightly more realistic example workload than vector add and I mostly use it to test compiler support of openMP and openACC GPU directive offloads.
I am really interested in energy consumption and performance (wall time) trade-offs so I would be extremely interested to see how much this simple example is effected by the optimizations suggested by kernel_tuner.
If you have any questions regarding my code, or if you need help building/compiling please let me know. Feel free to submit issues on my repo or reply to this issue.
The text was updated successfully, but these errors were encountered:
@isazi we spoke at the RSECon23 workshop.
I have a simple example fortran code (Poisson-Solver) which uses:
openMP
GPU offloadingopenACC
GPU offloadingI use
CMake
as a build system and it would be great if we could optimize for the whole code without needing to write fake data inputs or separate my kernel out of the main code base.Possibly good to also support
make
.Poisson-Solver provides a slightly more realistic example workload than vector add and I mostly use it to test compiler support of openMP and openACC GPU directive offloads.
I am really interested in energy consumption and performance (wall time) trade-offs so I would be extremely interested to see how much this simple example is effected by the optimizations suggested by
kernel_tuner
.If you have any questions regarding my code, or if you need help building/compiling please let me know. Feel free to submit issues on my repo or reply to this issue.
The text was updated successfully, but these errors were encountered: