Skip to content

[AutoDeploy][Feature]: Spec Dec + Guided Decoding Support #10134

@govind-ramnarayan

Description

@govind-ramnarayan

🚀 The feature, motivation and pitch

Spec dec + guided decoding together cause some complications - roughly speaking, the sampling needs to do guided decoding, but then the draft tokens also need to be guided otherwise they will never be accepted.

Currently we just check that we don't have both enabled at the same time for AutoDeploy.

Alternatives

No response

Additional context

No response

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and checked the documentation and examples for answers to frequently asked questions.

Metadata

Metadata

Labels

AutoDeploy<NV> AutoDeploy BackendSpeculative Decoding<NV>MTP/Eagle/Medusa/Lookahead/Prompt-Lookup-Decoding/Draft-Target-Model/ReDrafterfeature requestNew feature or request. This includes new model, dtype, functionality support

Type

No type

Projects

Status

Backlog

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions