Skip to content
Change the repository type filter

All

    Repositories list

    • aotriton

      Public
      Ahead of Time (AOT) Triton Math Library
      Python
      MIT License
      154192Updated Nov 25, 2024Nov 25, 2024
    • AMD's graph optimization engine.
      C++
      MIT License
      8718734440Updated Nov 25, 2024Nov 25, 2024
    • jax

      Public
      Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
      Python
      Apache License 2.0
      2.8k1902Updated Nov 25, 2024Nov 25, 2024
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      4.7k45024Updated Nov 25, 2024Nov 25, 2024
    • Fast and memory-efficient exact attention
      Python
      BSD 3-Clause "New" or "Revised" License
      1.3k141248Updated Nov 25, 2024Nov 25, 2024
    • HIP Python Low-level Bindings
      Shell
      MIT License
      31711Updated Nov 25, 2024Nov 25, 2024
    • This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific topics (amd/*). For all other issues/PRs, please submit upstream at https://github.com/llvm/llvm-project.
      LLVM
      Other
      12k1233014Updated Nov 25, 2024Nov 25, 2024
    • TensorFlow ROCm port
      C++
      Apache License 2.0
      74k6888860Updated Nov 25, 2024Nov 25, 2024
    • CMake modules used within the ROCm libraries
      CMake
      MIT License
      4359513Updated Nov 25, 2024Nov 25, 2024
    • triton

      Public
      Development repository for the Triton language and compiler
      C++
      MIT License
      1.7k961043Updated Nov 25, 2024Nov 25, 2024
    • aomp

      Public
      AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.
      Fortran
      Apache License 2.0
      47206742Updated Nov 25, 2024Nov 25, 2024
    • MIOpen

      Public
      AMD's Machine Intelligence Library
      Assembly
      Other
      2301.1k25056Updated Nov 25, 2024Nov 25, 2024
    • rocDecode

      Public
      rocDecode is a high performance video decode SDK for AMD hardware
      C++
      Other
      161323Updated Nov 25, 2024Nov 25, 2024
    • hipTensor

      Public
      AMD’s C++ library for accelerating tensor primitives
      C++
      MIT License
      173507Updated Nov 25, 2024Nov 25, 2024
    • Advanced Profiling and Analytics for AMD Hardware
      Python
      MIT License
      501385015Updated Nov 25, 2024Nov 25, 2024
    • ROCm

      Public
      AMD ROCm™ Software - GitHub Home
      Shell
      MIT License
      3894.7k11015Updated Nov 25, 2024Nov 25, 2024
    • pytorch

      Public
      Tensors and Dynamic neural networks in Python with strong GPU acceleration
      Python
      Other
      23k21910339Updated Nov 25, 2024Nov 25, 2024
    • rccl

      Public
      ROCm Communication Collectives Library (RCCL)
      C++
      Other
      1222731019Updated Nov 25, 2024Nov 25, 2024
    • hipBLAS

      Public
      ROCm BLAS marshalling library
      C++
      Other
      7812114Updated Nov 25, 2024Nov 25, 2024
    • hipBLASLt

      Public
      hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library
      Assembly
      MIT License
      8963964Updated Nov 25, 2024Nov 25, 2024
    • rocWMMA

      Public
      rocWMMA
      C++
      MIT License
      269223Updated Nov 25, 2024Nov 25, 2024
    • HIPIFY

      Public
      HIPIFY: Convert CUDA to Portable C++ Code
      C++
      MIT License
      75525200Updated Nov 25, 2024Nov 25, 2024
    • Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
      C++
      Other
      1293212846Updated Nov 25, 2024Nov 25, 2024
    • rocSPARSE

      Public
      Next generation SPARSE implementation for ROCm platform
      C++
      MIT License
      5611720Updated Nov 25, 2024Nov 25, 2024
    • Tensile

      Public
      Stretching GPU performance for GEMMs and tensor contractions.
      Python
      MIT License
      15122387Updated Nov 25, 2024Nov 25, 2024
    • rocMLIR

      Public
      MLIR
      40129117Updated Nov 25, 2024Nov 25, 2024
    • ROCm Documentation Python package for ReadTheDocs build standardization
      CSS
      Other
      161399Updated Nov 25, 2024Nov 25, 2024
    • HIP

      Public
      HIP: C++ Heterogeneous-Compute Interface for Portability
      C++
      MIT License
      5393.8k3041Updated Nov 25, 2024Nov 25, 2024
    • rocAL

      Public
      The AMD rocAL is designed to efficiently decode and process images and videos from a variety of storage formats and modify them through a processing graph programmable by the user.
      C++
      MIT License
      141185Updated Nov 25, 2024Nov 25, 2024
    • 🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
      Python
      Apache License 2.0
      27k414Updated Nov 25, 2024Nov 25, 2024