[TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default #5312

Superjomn · 2025-06-18T05:05:01Z

PR Description

This PR makes the PyT LLM the default with the following API breaking change which is confirmed with @laikhtewari :

To use LLM with PyT backend: from tensorrt_llm import LLM
To use LLM with TRT backend: from tensorrt_llm._tensorrt_engine import LLM

We introduce the need for an explicit code change to existing TRT backend users due to the arglists of both PyT LLM and TRT LLM diverging, and there is no seamless way to switch the backend without modifying the code.

The usage code, including tests and examples, has been minimally updated to keep the PR concise and focused.

There will be dedicated PRs to change the doc, examples accordingly later.

juney-nvidia · 2025-06-18T11:18:12Z

Since we have already branch out 0.21 branch, it is okay to land this PR onto the GH main directly.

Thanks
June

Superjomn · 2025-06-18T11:27:58Z

/bot run

tensorrt-cicd · 2025-06-18T11:33:59Z

PR_Github #9377 [ run ] triggered by Bot

tensorrt_llm/llmapi/__init__.py

tensorrt-cicd · 2025-06-18T13:52:48Z

PR_Github #9377 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #6882 completed with status: 'FAILURE'

Superjomn · 2025-06-18T14:42:27Z

/bot run --disable-fail-fast

tensorrt-cicd · 2025-06-18T14:47:46Z

PR_Github #9394 [ run ] triggered by Bot

Superjomn · 2025-06-19T06:39:00Z

/bot run --disable-fail-fast

tensorrt-cicd · 2025-06-19T06:47:17Z

PR_Github #9461 [ run ] triggered by Bot

QiJune

LGTM

tensorrt-cicd · 2025-06-19T10:26:22Z

PR_Github #9461 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #6946 completed with status: 'FAILURE'

Superjomn · 2025-06-19T10:27:06Z

/bot run --disable-fail-fast

tensorrt-cicd · 2025-06-19T10:32:30Z

PR_Github #9499 [ run ] triggered by Bot

tensorrt-cicd · 2025-06-19T14:08:18Z

PR_Github #9499 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #6969 completed with status: 'FAILURE'

Signed-off-by: Superjomn <[email protected]>

Superjomn · 2025-06-19T14:40:54Z

/bot run --disable-fail-fast

tensorrt-cicd · 2025-06-19T14:46:40Z

PR_Github #9525 [ run ] triggered by Bot

tensorrt-cicd · 2025-06-19T19:00:56Z

PR_Github #9525 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #6988 completed with status: 'SUCCESS'
Pipeline passed with automatic retried tests. Check the rerun report for details.

…VIDIA#5312) Signed-off-by: Superjomn <[email protected]>

Superjomn requested review from a team as code owners June 18, 2025 05:05

Superjomn requested review from lucaslie and QiJune June 18, 2025 05:05

Superjomn marked this pull request as draft June 18, 2025 05:05

Superjomn removed request for lucaslie and QiJune June 18, 2025 05:05

Superjomn force-pushed the make-pyt-default branch from cedc6ea to ec3fd21 Compare June 18, 2025 07:50

Superjomn changed the title ~~chore[BREAKING CHANGE]: make pytorch LLM the default~~ chore TRTLLM-5208 [BREAKING CHANGE]: make pytorch LLM the default Jun 18, 2025

Superjomn force-pushed the make-pyt-default branch 3 times, most recently from 5ca2c7b to 42c2cad Compare June 18, 2025 09:42

Superjomn marked this pull request as ready for review June 18, 2025 09:44

Superjomn requested review from QiJune, lucaslie, suyoggupta and nv-guomingz June 18, 2025 09:45

Superjomn changed the title ~~chore TRTLLM-5208 [BREAKING CHANGE]: make pytorch LLM the default~~ [TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default Jun 18, 2025

QiJune reviewed Jun 18, 2025

View reviewed changes

tensorrt_llm/llmapi/__init__.py Show resolved Hide resolved

Superjomn force-pushed the make-pyt-default branch from c4ea41a to 5ef186c Compare June 19, 2025 06:26

QiJune approved these changes Jun 19, 2025

View reviewed changes

Superjomn force-pushed the make-pyt-default branch from 5ef186c to 9595180 Compare June 19, 2025 10:26

Superjomn requested a review from a team as a code owner June 19, 2025 10:26

Superjomn requested a review from syuoni June 19, 2025 10:27

Superjomn added 2 commits June 19, 2025 14:39

make PyT default

952e012

Signed-off-by: Superjomn <[email protected]>

fix

302d4b1

Signed-off-by: Superjomn <[email protected]>

Superjomn force-pushed the make-pyt-default branch from 9595180 to 302d4b1 Compare June 19, 2025 14:40

Superjomn enabled auto-merge (squash) June 19, 2025 14:41

Superjomn merged commit 9bd42ec into NVIDIA:main Jun 19, 2025
3 checks passed

Superjomn deleted the make-pyt-default branch June 19, 2025 21:32

This was referenced Jun 25, 2025

feat/add latency support for trtllm bench #3730

Merged

Update trtllm-bench to support new Pytorch default. #5491

Merged

dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 9, 2025

[TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default (N…

3b2a310

…VIDIA#5312) Signed-off-by: Superjomn <[email protected]>

dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 10, 2025

[TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default (N…

2930a6b

…VIDIA#5312) Signed-off-by: Superjomn <[email protected]>

dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 10, 2025

[TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default (N…

7e1b545

…VIDIA#5312) Signed-off-by: Superjomn <[email protected]>

dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 10, 2025

[TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default (N…

e7a6675

…VIDIA#5312) Signed-off-by: Superjomn <[email protected]>

dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 10, 2025

[TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default (N…

c52a1e2

…VIDIA#5312) Signed-off-by: Superjomn <[email protected]>

dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 11, 2025

[TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default (N…

4ce63d9

…VIDIA#5312) Signed-off-by: Superjomn <[email protected]>

dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 11, 2025

[TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default (N…

3106ee3

…VIDIA#5312) Signed-off-by: Superjomn <[email protected]>

dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 11, 2025

[TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default (N…

c9f40b8

…VIDIA#5312) Signed-off-by: Superjomn <[email protected]>

[TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default #5312

[TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default #5312

Uh oh!

Conversation

Superjomn commented Jun 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Description

Uh oh!

juney-nvidia commented Jun 18, 2025

Uh oh!

Superjomn commented Jun 18, 2025

Uh oh!

tensorrt-cicd commented Jun 18, 2025

Uh oh!

Uh oh!

tensorrt-cicd commented Jun 18, 2025

Uh oh!

Superjomn commented Jun 18, 2025

Uh oh!

tensorrt-cicd commented Jun 18, 2025

Uh oh!

Superjomn commented Jun 19, 2025

Uh oh!

tensorrt-cicd commented Jun 19, 2025

Uh oh!

QiJune left a comment

Choose a reason for hiding this comment

Uh oh!

tensorrt-cicd commented Jun 19, 2025

Uh oh!

Superjomn commented Jun 19, 2025

Uh oh!

tensorrt-cicd commented Jun 19, 2025

Uh oh!

tensorrt-cicd commented Jun 19, 2025

Uh oh!

Superjomn commented Jun 19, 2025

Uh oh!

tensorrt-cicd commented Jun 19, 2025

Uh oh!

tensorrt-cicd commented Jun 19, 2025

Uh oh!

Uh oh!

Uh oh!

Superjomn commented Jun 18, 2025 •

edited

Loading