[feat]: support logit_bias #5354

xq25478 · 2025-06-19T05:41:02Z

feat(openai protocol):support logitbias

Summary by CodeRabbit

New Features
- Added support for the logit_bias parameter in both chat and completion APIs, allowing users to influence token generation by biasing specific tokens.
Bug Fixes
- Removed previous restriction that disallowed the use of logit_bias in chat completions.
Tests
- Introduced new asynchronous tests to verify correct handling and error reporting for valid and invalid logit_bias inputs in chat and completion APIs.
- Added a timeout constraint to an end-to-end chat test.

Copilot

Pull Request Overview

A concise description of the purpose of the PR, followed by summarized bullets of changes

Add support for logit_bias by integrating a new logits processor into sampling parameters
Remove the old validator blocking logit_bias
Define LogitBiasLogitsProcessor to apply per-token biases at generation time

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
tensorrt_llm/serve/openai_protocol.py	Integrated `logit_bias` into both completion and chat sampling
tensorrt_llm/sampling_params.py	Added `LogitBiasLogitsProcessor` class and updated imports to include `Dict`

tensorrt_llm/serve/openai_protocol.py

LinPoly

Thanks for contribution! I leave a few comment about LogitBiasLogitsProcessor's implementation, we also need to add tests, vLLM has tests for chat and for completion, these could be your reference. Ping me if you need any help.

tensorrt_llm/sampling_params.py

LinPoly · 2025-07-04T07:04:47Z

@xq25478 Can you please sign off your commits following the steps here. And I think it is acceptable to put the logit processor implementation in sampling_params.py for now, I'll loop in other guys for more comments if necessary.

LinPoly · 2025-07-04T07:43:33Z

Add @netanel-haber as Nave suggested.

xq25478 · 2025-07-07T07:51:25Z

Thanks for contribution! I leave a few comment about LogitBiasLogitsProcessor's implementation, we also need to add tests, vLLM has tests for chat and for completion, these could be your reference. Ping me if you need any help.

test code has been added.

tensorrt_llm/sampling_params.py

tests/unittest/llmapi/apps/_test_openai_completions.py

tests/unittest/llmapi/apps/_test_openai_chat.py

tests/unittest/llmapi/apps/_test_openai_completions.py

tests/unittest/llmapi/apps/_test_openai_chat.py

tensorrt_llm/sampling_params.py

LinPoly · 2025-07-10T12:20:53Z

/bot run

tensorrt-cicd · 2025-07-10T12:26:04Z

PR_Github #11549 [ run ] triggered by Bot

tensorrt-cicd · 2025-07-10T12:46:08Z

PR_Github #11549 [ run ] completed with state FAILURE
/LLM/main/L0_MergeRequest_PR pipeline #8552 completed with status: 'FAILURE'

venkywonka · 2025-07-10T16:22:53Z

@xq25478 would you mind rebasing this (looks it can't be auto-rebased with main). Thanks!

venkywonka · 2025-07-10T16:23:03Z

/bot run

tensorrt-cicd · 2025-07-23T14:56:51Z

PR_Github #12710 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #9457 completed with status: 'FAILURE'

venkywonka · 2025-07-24T00:29:52Z

/bot run

tensorrt-cicd · 2025-07-24T00:35:17Z

PR_Github #12761 [ run ] triggered by Bot

tensorrt-cicd · 2025-07-24T02:52:27Z

PR_Github #12761 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #9503 completed with status: 'FAILURE'

venkywonka · 2025-07-24T05:23:12Z

@xq25478 , there was a tot breakage that was fixed recently by #6309 - could rebase again and push? thanks!

LinPoly · 2025-07-24T06:48:00Z

/bot run

tensorrt-cicd · 2025-07-24T06:53:16Z

PR_Github #12817 [ run ] triggered by Bot

tensorrt-cicd · 2025-07-24T08:15:10Z

PR_Github #12817 [ run ] completed with state FAILURE
/LLM/main/L0_MergeRequest_PR pipeline #9551 completed with status: 'FAILURE'

LinPoly · 2025-07-24T09:12:46Z

/bot run

tensorrt-cicd · 2025-07-24T09:18:21Z

PR_Github #12834 [ run ] triggered by Bot

tensorrt-cicd · 2025-07-24T13:38:06Z

PR_Github #12834 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #9567 completed with status: 'FAILURE'

venkywonka · 2025-07-24T21:23:19Z

@xq25478
seeing this again, apologies for this repetition, but could you rebase agian? thanks!

venkywonka · 2025-07-24T21:28:17Z

/bot run

tensorrt-cicd · 2025-07-24T21:33:57Z

PR_Github #12898 [ run ] triggered by Bot

tensorrt-cicd · 2025-07-25T02:37:28Z

PR_Github #12898 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #9616 completed with status: 'SUCCESS'

xq25478 · 2025-07-25T04:51:20Z

PR_Github #12898 [ run ] completed with state SUCCESS /LLM/main/L0_MergeRequest_PR pipeline #9616 completed with status: 'SUCCESS'

done

LinPoly · 2025-07-25T06:03:22Z

/bot run

tensorrt-cicd · 2025-07-25T06:08:32Z

PR_Github #12961 [ run ] triggered by Bot

tensorrt-cicd · 2025-07-25T06:40:51Z

PR_Github #12961 [ run ] completed with state FAILURE
/LLM/main/L0_MergeRequest_PR pipeline #9669 completed with status: 'FAILURE'

LinPoly · 2025-07-25T07:51:29Z

/bot skip --comment "Previous CI passed"

tensorrt-cicd · 2025-07-25T07:56:40Z

PR_Github #12980 [ skip ] triggered by Bot

tensorrt-cicd · 2025-07-25T08:22:56Z

PR_Github #12980 [ skip ] completed with state SUCCESS
Skipping testing for commit 8a70c10

Signed-off-by: xq25478 <[email protected]> Signed-off-by: Venky Ganesh <[email protected]> Signed-off-by: hexiao.xq <[email protected]> Co-authored-by: Venky Ganesh <[email protected]> Co-authored-by: hexiao.xq <[email protected]> Co-authored-by: Pengyun Lin <[email protected]> Signed-off-by: Shreyas Misra <[email protected]>

Signed-off-by: xq25478 <[email protected]> Signed-off-by: Venky Ganesh <[email protected]> Signed-off-by: hexiao.xq <[email protected]> Co-authored-by: Venky Ganesh <[email protected]> Co-authored-by: hexiao.xq <[email protected]> Co-authored-by: Pengyun Lin <[email protected]> Signed-off-by: Ransiki Zhang <[email protected]>

Signed-off-by: xq25478 <[email protected]> Signed-off-by: Venky Ganesh <[email protected]> Signed-off-by: hexiao.xq <[email protected]> Co-authored-by: Venky Ganesh <[email protected]> Co-authored-by: hexiao.xq <[email protected]> Co-authored-by: Pengyun Lin <[email protected]> Signed-off-by: Lanyu Liao <[email protected]>

xq25478 force-pushed the support_logitbias branch from 0268d72 to 147fb84 Compare June 19, 2025 06:42

poweiw added the Community want to contribute PRs initiated from Community label Jun 24, 2025

LinPoly self-requested a review July 1, 2025 05:48

LinPoly changed the title ~~feat(openai protocol):support logitbias~~ [feat]: support logit_bias Jul 3, 2025

LinPoly requested a review from Copilot July 3, 2025 07:22

Copilot AI reviewed Jul 3, 2025

View reviewed changes

tensorrt_llm/serve/openai_protocol.py Outdated Show resolved Hide resolved

tensorrt_llm/serve/openai_protocol.py Outdated Show resolved Hide resolved

LinPoly reviewed Jul 3, 2025

View reviewed changes

tensorrt_llm/sampling_params.py Outdated Show resolved Hide resolved

tensorrt_llm/sampling_params.py Show resolved Hide resolved

tensorrt_llm/sampling_params.py Outdated Show resolved Hide resolved

tensorrt_llm/sampling_params.py Outdated Show resolved Hide resolved

xq25478 force-pushed the support_logitbias branch from 7196cd0 to 5506bea Compare July 3, 2025 09:15

LinPoly requested a review from netanel-haber July 4, 2025 07:40

xq25478 force-pushed the support_logitbias branch 3 times, most recently from 2fd2d0e to f3c7a0b Compare July 7, 2025 07:47

xq25478 closed this Jul 7, 2025

xq25478 reopened this Jul 7, 2025

xq25478 force-pushed the support_logitbias branch from f3c7a0b to 9a4f0b9 Compare July 7, 2025 07:51

netanel-haber reviewed Jul 9, 2025

View reviewed changes

tensorrt_llm/sampling_params.py Outdated Show resolved Hide resolved

netanel-haber reviewed Jul 9, 2025

View reviewed changes

tensorrt_llm/sampling_params.py Outdated Show resolved Hide resolved

netanel-haber reviewed Jul 9, 2025

View reviewed changes

tensorrt_llm/sampling_params.py Outdated Show resolved Hide resolved

xq25478 force-pushed the support_logitbias branch 2 times, most recently from 7e4e5dd to 533f5b6 Compare July 10, 2025 08:06

LinPoly reviewed Jul 10, 2025

View reviewed changes

xq25478 force-pushed the support_logitbias branch from 775735c to b5a07f4 Compare July 10, 2025 09:49

Merge branch 'main' into support_logitbias

ccb9e8e

Merge branch 'NVIDIA:main' into support_logitbias

8a70c10

coderabbitai bot requested a review from venkywonka July 25, 2025 04:51

LinPoly enabled auto-merge (squash) July 25, 2025 06:03

LinPoly merged commit a0aecf0 into NVIDIA:main Jul 25, 2025
2 checks passed

[feat]: support logit_bias #5354

[feat]: support logit_bias #5354

Uh oh!

Conversation

xq25478 commented Jun 19, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

feat(openai protocol):support logitbias

Summary by CodeRabbit

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

LinPoly left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

LinPoly commented Jul 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LinPoly commented Jul 4, 2025

Uh oh!

xq25478 commented Jul 7, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

LinPoly commented Jul 10, 2025

Uh oh!

tensorrt-cicd commented Jul 10, 2025

Uh oh!

tensorrt-cicd commented Jul 10, 2025

Uh oh!

venkywonka commented Jul 10, 2025

Uh oh!

venkywonka commented Jul 10, 2025

Uh oh!

tensorrt-cicd commented Jul 23, 2025

Uh oh!

venkywonka commented Jul 24, 2025

Uh oh!

tensorrt-cicd commented Jul 24, 2025

Uh oh!

tensorrt-cicd commented Jul 24, 2025

Uh oh!

venkywonka commented Jul 24, 2025

Uh oh!

LinPoly commented Jul 24, 2025

Uh oh!

tensorrt-cicd commented Jul 24, 2025

Uh oh!

tensorrt-cicd commented Jul 24, 2025

Uh oh!

LinPoly commented Jul 24, 2025

Uh oh!

tensorrt-cicd commented Jul 24, 2025

Uh oh!

tensorrt-cicd commented Jul 24, 2025

Uh oh!

venkywonka commented Jul 24, 2025

Uh oh!

venkywonka commented Jul 24, 2025

Uh oh!

tensorrt-cicd commented Jul 24, 2025

Uh oh!

tensorrt-cicd commented Jul 25, 2025

Uh oh!

xq25478 commented Jun 19, 2025 •

edited by coderabbitai bot

Loading

LinPoly commented Jul 4, 2025 •

edited

Loading