Skip to content
Change the repository type filter

All

    Repositories list

    • GEAK

      Public
      It is an LLM-based AI agent, which can write correct and efficient gpu kernels automatically.
      Python
      MIT License
      1381819Updated Mar 28, 2026Mar 28, 2026
    • Primus

      Public
      Python
      Other
      2882625Updated Mar 28, 2026Mar 28, 2026
    • DLRM implementation for Primus
      Python
      MIT License
      0000Updated Mar 28, 2026Mar 28, 2026
    • Primus-SaFE(Stability and Fault Endurance)
      Go
      Other
      15409Updated Mar 28, 2026Mar 28, 2026
    • TraceLens

      Public
      Automating analysis from trace files
      Python
      MIT License
      9669413Updated Mar 28, 2026Mar 28, 2026
    • Magpie

      Public
      A lightweight, general-purpose framework for evaluating GPU kernel correctness and performance.
      Python
      MIT License
      54911Updated Mar 27, 2026Mar 27, 2026
    • AgentKernelArena provides an end-to-end siloed-benchmarking environment where different LLM-powered agents—such as Cursor Agent, Claude Code, Codex, SWE-agent, …
      Python
      Apache License 2.0
      313121Updated Mar 27, 2026Mar 27, 2026
    • Python
      Other
      1364105Updated Mar 27, 2026Mar 27, 2026
    • Apex

      Public
      Agents, and RL environment, for optimizing GPU kernels on AMD ROCm using LLM agents. Benchmarks LLM serving workloads end-to-end, profiles bottleneck kernels, o…
      Python
      MIT License
      75001Updated Mar 26, 2026Mar 26, 2026
    • FLy

      Public
      Python
      MIT License
      0010Updated Mar 25, 2026Mar 25, 2026
    • Toolkit for launching and observing MaxText training on Slurm-managed GPU clusters
      Shell
      MIT License
      22701Updated Mar 22, 2026Mar 22, 2026
    • Reference implementations of MLPerf® inference benchmarks
      Python
      Apache License 2.0
      616000Updated Mar 16, 2026Mar 16, 2026
    • PARD

      Public
      PARD: Accelerating LLM Inference with Low-Cost PARallel Draft Model Adaptation (ICLR 26)
      Python
      MIT License
      11910Updated Mar 13, 2026Mar 13, 2026
    • DUET-VLM

      Public
      DUET-VLM: Dual stage Unified Efficient Token reduction for VLM Training and Inference
      Python
      Apache License 2.0
      12111Updated Mar 5, 2026Mar 5, 2026
    • nixl

      Public
      NVIDIA Inference Xfer Library (NIXL)
      C++
      Other
      278100Updated Feb 24, 2026Feb 24, 2026
    • dynamo

      Public
      A Datacenter Scale Distributed Inference Serving Framework
      Rust
      Other
      963000Updated Feb 24, 2026Feb 24, 2026
    • Nitro-E

      Public
      Python
      MIT License
      1011321Updated Feb 24, 2026Feb 24, 2026
    • Repository for Showcasing DLRM v2 Functionality on AMD
      Python
      MIT License
      0000Updated Feb 16, 2026Feb 16, 2026
    • Python
      Apache License 2.0
      01400Updated Feb 10, 2026Feb 10, 2026
    • For world model code developing and releasing.
      Python
      Other
      44500Updated Feb 6, 2026Feb 6, 2026
    • TraceLens-inference

      Public archive
      Automating analysis from trace files
      Python
      MIT License
      9000Updated Feb 5, 2026Feb 5, 2026
    • axlearn

      Public
      An Extensible Deep Learning Library
      Python
      Apache License 2.0
      402100Updated Jan 29, 2026Jan 29, 2026
    • Repo containing artifacts for Neurips 2025 tutorial- How to Build Agents to Generate Kernels for Faster LLMs (and Other Models!)
      Jupyter Notebook
      MIT License
      21400Updated Jan 23, 2026Jan 23, 2026
    • Python
      Other
      0710Updated Jan 22, 2026Jan 22, 2026
    • AMD 0.9B efficient text to video diffusion model
      Python
      Other
      64411Updated Jan 12, 2026Jan 12, 2026
    • This is a short course covering GPU optimization techniques for LLM inference
      Python
      MIT License
      0000Updated Jan 11, 2026Jan 11, 2026
    • Examples of training autodrive models in ROCm
      Python
      Other
      0300Updated Jan 9, 2026Jan 9, 2026
    • GEAK-eval

      Public
      Python
      61190Updated Dec 24, 2025Dec 24, 2025
    • Synthetic data generation pipeline, finetuning and evaluation scripts.
      Python
      Other
      1110Updated Dec 24, 2025Dec 24, 2025
    • Python
      Apache License 2.0
      1110Updated Dec 16, 2025Dec 16, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.