Skip to content

Pull requests: ROCm/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Integrate mxfp4 MoE native kernels
#632 opened Aug 15, 2025 by mawong-amd Loading…
[Model] Add GPT-OSS model code and config
#625 opened Aug 7, 2025 by ashishtanwer Loading…
add Fused_rms_quant for deepseek_v2 model
#611 opened Jul 29, 2025 by ZJLi2013 Loading…
[FEAT] [ROCm] Shared Experts Aiter
#605 opened Jul 25, 2025 by tjtanaavllm Loading…
add fused fp8 bmm
#604 opened Jul 25, 2025 by k50112113 Loading…
Update fp8 paged attention
#592 opened Jul 9, 2025 by amd-xiaoyu12 Draft
Update test-template.j2
#579 opened Jun 16, 2025 by okakarpa Loading…
Disable skynny gemms by default
#568 opened Jun 5, 2025 by k-artem Loading…
Patch to run AITER 0507 stale
#541 opened May 8, 2025 by qli88 Loading…
Remap fp8 kv-scale names for Deepseek stale
#535 opened May 1, 2025 by sstamenk Loading…
Updated README.md with April 29 results stale
#526 opened Apr 27, 2025 by Mcirino1 Loading…
BF16 Skinny Optimization stale
#520 opened Apr 22, 2025 by amd-hhashemi Loading…
Test Queues
#456 opened Feb 28, 2025 by dhonnappa-amd Draft
Enable custom paged attention kernel for Navi 3/4
#446 opened Feb 24, 2025 by hyoon1 Loading…
ProTip! Add no:assignee to see everything that’s not assigned.