JohnQinAMD

Follow

🚩

Focusing

Hyperloom JohnQinAMD

🚩

Focusing

Follow

2 followers · 4 following

Achievements

Achievements

Pinned Loading

dynamo dynamo Public

Forked from ai-dynamo/dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust
InferenceX InferenceX Public

Forked from SemiAnalysisAI/InferenceX

Open Source Continuous Inference Benchmarking Qwen3.5, DeepSeek, GPTOSS - GB200 NVL72 vs MI355X vs B200 vs GB300 NVL72 vs H100 & soon™ TPUv6e/v7/Trainium2/3

Python
sglang-amd sglang-amd Public

Forked from sgl-project/sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python
vllm-amd vllm-amd Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
AMD-AGI/Primus AMD-AGI/Primus Public

Python 82 28
AMD-AGI/Primus-Turbo AMD-AGI/Primus-Turbo Public

Python 64 14