Skip to content
forked from ROCm/Tensile

Stretching GPU performance for GEMMs and tensor contractions.

License

Notifications You must be signed in to change notification settings

ellosel/Tensile

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Tensile is a tool for creating benchmark-driven backend libraries for GEMMs, GEMM-like problems (such as batched GEMM), and general N-dimensional tensor contractions on a GPU. The Tensile library is mainly used as a backend library for rocBLAS. Tensile acts as the performance backbone for a wide variety of 'compute' applications running on AMD GPUs.

Note

The published documentation is available at Tensile in an organized, easy-to-read format, with search and a table of contents. The documentation source files reside in the Tensile/docs/src folder of this repository. As with all ROCm projects, the documentation is open source. For more information on contributing to the documentation, see Contribute to ROCm documentation.

About

Stretching GPU performance for GEMMs and tensor contractions.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 51.5%
  • C++ 29.3%
  • Assembly 15.1%
  • TeX 1.4%
  • Shell 1.2%
  • CMake 0.9%
  • Other 0.6%