Change the repository type filter
All
Repositories list
72 repositories
llmq
PublicQuantized LLM training in pure CUDA/C++.local_platinum_bench
PublicMoE-Quant
PublicQuEST
PublicEvoPress
PublicQuartet
PublicFP-Quant
PublicGridSearcher
Publicnanochat
PublicCAGE
Public- QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learning
torchtitan
Publicnanochat-qat
PublicCAGE-ao
Publicunified-sc-laws
PublicISTA-DASLab-Optimizers
Publicgptq-gguf-toolkit
Publicinfluence_distillation
PublicOfficial implementation of Influence Distillation: https://www.arxiv.org/abs/2505.19051PanzaMail
PublicHALO-anon
Publictorch_cgx
Publicgemm-int8
PublicDarwinLM
PublicScalableMNN
PublicSPADE
PublicHALO
PublicHALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation of https://arxiv.org/abs/2501.02625gemm-fp8
PublicMicroAdam
Publicllm-foundry
Public