Skip to content

[RFC]: vLLM-Omni 2026 Q1 Roadmap #677

@hsliuustc0106

Description

@hsliuustc0106

Motivation.

This issue outlines the development roadmap for vllm-omni in Q1 2026. Our primary focus for this quarter includes architectural refactoring to support dynamic execution graphs, expanding support for diffusion and omni models, and introducing disaggregated serving capabilities. Any contribution and discussion is welcome :)

Please attach your design doc using this template in your RFC :)

Proposed Change.

🛠 Core Architecture & Infrastructure

Refactoring the core execution pipeline to support more dynamic and efficient workflows.

P0:

🚀 Disaggregated Serving & Distributed Systems

Enhancing serving capabilities for large-scale and multi-node deployments.

P0:

🎨 Feature: Diffusion Pipeline #814

Major feature additions to improve the flexibility and performance of image/video generation.

⚡️ Feature: Omni(AR+DiT) Pipeline

Enhancing the multimodal interaction capabilities.

P0:

🤖 Reinforcement Learning & Model Support

Expanding the ecosystem of supported models and training feedback loops.

Targeted Model Optimizations

📊 Benchmarks, Metrics & Logging

Improving observability and establishing performance baselines.
P0:

P1:

🧪 CI/CD & Quality Assurance

please check our design doc

Feedback Period.

No response

CC List.

@ywang96 @Gaohan123 @tzhouam @ZJY0516 @DarkLight1337 @Isotr0py @SamitHuang @david6666666

Any Other Things.

No response

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

Metadata

Metadata

Labels

documentationImprovements or additions to documentationhelp wantedExtra attention is needed

Type

No fields configured for Task.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions