GPU architecture
CUDA
-
CUDA Sample Code
Topics
- https://developer.nvidia.com/blog/inside-volta/
- https://developer.nvidia.com/blog/cuda-pro-tip-optimized-filtering-warp-aggregated-atomics/
- https://developer.nvidia.com/blog/cooperative-groups/
TensorRT
Triton
TBA
Videos:
-
University of Illinois ECE 408 - Nsight Compute and Nsight Systems:
- Deep Learning for Science and Engineering, George Karniadakis, Professor, Brown University
- DLI
- End-to-End AI for Science