Skip to content

Conversation

Superjomn
Copy link
Collaborator

@Superjomn Superjomn commented Jun 18, 2025

PR Description

This PR makes the PyT LLM the default with the following API breaking change which is confirmed with @laikhtewari :

  • To use LLM with PyT backend: from tensorrt_llm import LLM
  • To use LLM with TRT backend: from tensorrt_llm._tensorrt_engine import LLM

We introduce the need for an explicit code change to existing TRT backend users due to the arglists of both PyT LLM and TRT LLM diverging, and there is no seamless way to switch the backend without modifying the code.

The usage code, including tests and examples, has been minimally updated to keep the PR concise and focused.

There will be dedicated PRs to change the doc, examples accordingly later.

@Superjomn Superjomn requested review from a team as code owners June 18, 2025 05:05
@Superjomn Superjomn requested review from lucaslie and QiJune June 18, 2025 05:05
@Superjomn Superjomn marked this pull request as draft June 18, 2025 05:05
@Superjomn Superjomn removed request for lucaslie and QiJune June 18, 2025 05:05
@Superjomn Superjomn changed the title chore[BREAKING CHANGE]: make pytorch LLM the default chore TRTLLM-5208 [BREAKING CHANGE]: make pytorch LLM the default Jun 18, 2025
@Superjomn Superjomn force-pushed the make-pyt-default branch 3 times, most recently from 5ca2c7b to 42c2cad Compare June 18, 2025 09:42
@Superjomn Superjomn marked this pull request as ready for review June 18, 2025 09:44
@juney-nvidia
Copy link
Collaborator

Since we have already branch out 0.21 branch, it is okay to land this PR onto the GH main directly.

Thanks
June

@Superjomn
Copy link
Collaborator Author

/bot run

@Superjomn Superjomn changed the title chore TRTLLM-5208 [BREAKING CHANGE]: make pytorch LLM the default [TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default Jun 18, 2025
@tensorrt-cicd
Copy link
Collaborator

PR_Github #9377 [ run ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #9377 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #6882 completed with status: 'FAILURE'

@Superjomn
Copy link
Collaborator Author

/bot run --disable-fail-fast

@tensorrt-cicd
Copy link
Collaborator

PR_Github #9394 [ run ] triggered by Bot

@Superjomn
Copy link
Collaborator Author

/bot run --disable-fail-fast

@tensorrt-cicd
Copy link
Collaborator

PR_Github #9461 [ run ] triggered by Bot

Copy link
Collaborator

@QiJune QiJune left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@tensorrt-cicd
Copy link
Collaborator

PR_Github #9461 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #6946 completed with status: 'FAILURE'

@Superjomn Superjomn requested a review from a team as a code owner June 19, 2025 10:26
@Superjomn
Copy link
Collaborator Author

/bot run --disable-fail-fast

@Superjomn Superjomn requested a review from syuoni June 19, 2025 10:27
@tensorrt-cicd
Copy link
Collaborator

PR_Github #9499 [ run ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #9499 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #6969 completed with status: 'FAILURE'

Signed-off-by: Superjomn <[email protected]>
Signed-off-by: Superjomn <[email protected]>
@Superjomn
Copy link
Collaborator Author

/bot run --disable-fail-fast

@Superjomn Superjomn enabled auto-merge (squash) June 19, 2025 14:41
@tensorrt-cicd
Copy link
Collaborator

PR_Github #9525 [ run ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #9525 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #6988 completed with status: 'SUCCESS'
Pipeline passed with automatic retried tests. Check the rerun report for details.

@Superjomn Superjomn merged commit 9bd42ec into NVIDIA:main Jun 19, 2025
3 checks passed
@Superjomn Superjomn deleted the make-pyt-default branch June 19, 2025 21:32
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 9, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 10, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 10, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 10, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 10, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 11, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 11, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 11, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants