Pinned Loading
-
NVIDIA/TensorRT-LLM
NVIDIA/TensorRT-LLM PublicTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…
-
huggingface/transformers
huggingface/transformers Public🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
-
NVIDIA/TensorRT-Model-Optimizer
NVIDIA/TensorRT-Model-Optimizer PublicA unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment…
-
huggingface/diffusers
huggingface/diffusers Public:hugging_face: Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
-
huggingface/accelerate
huggingface/accelerate Public🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
If the problem persists, check the GitHub status page or contact support.