Skip to content

Commit 6265a5e

Browse files
lfr-0531dominicshanshan
authored andcommitted
[nvbug/5354946][fix] Fix mtp vanilla draft inputs (NVIDIA#5568)
Signed-off-by: Fanrong Li <[email protected]>
1 parent de23051 commit 6265a5e

File tree

1 file changed

+0
-1
lines changed
  • tensorrt_llm/_torch/speculative

1 file changed

+0
-1
lines changed

tensorrt_llm/_torch/speculative/mtp.py

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -522,7 +522,6 @@ def forward(
522522
"position_ids": draft_inputs["position_ids"],
523523
"hidden_states": draft_hidden_states,
524524
"attn_metadata": draft_inputs["attn_metadata"],
525-
"spec_metadata": draft_inputs["spec_metadata"],
526525
}
527526
next_draft_tokens = torch.stack(next_draft_tokens, dim=1)
528527

0 commit comments

Comments
 (0)