chore: partition LLM class into TorchLLM and TrtLLM #4900

Superjomn · 2025-06-04T06:55:04Z

This PR does

Create TorchLLM and TrtLLM with most of the code shared, making it ready to switch TorchLLM as the default LLM for the upcoming version
Keep alias of tensorrt_llm._torch.llm.LLM for TorchLLM and tensorrt_llm.LLM for TrtLLM
Fix the api reference doc accordingly, especially keep the existing LLM unbroken
- Add separate API references for both TrtLLM and TorchLLM, add support for TrtLlmArgs and TorchLlmArgs.

Document:

LLM's API reference works as before:

tensorrt_llm/llmapi/llm.py

Superjomn · 2025-06-09T05:18:13Z

/bot run --add-multi-gpu-test --disable-fail-fast

tensorrt-cicd · 2025-06-09T05:23:54Z

PR_Github #8090 [ run ] triggered by Bot

tensorrt-cicd · 2025-06-09T10:26:35Z

PR_Github #8090 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #5867 completed with status: 'FAILURE'

Superjomn · 2025-06-09T12:38:48Z

/bot run --add-multi-gpu-test --disable-fail-fast

tensorrt-cicd · 2025-06-09T12:44:24Z

PR_Github #8130 [ run ] triggered by Bot

tensorrt_llm/llmapi/llm.py

tensorrt-cicd · 2025-06-10T02:18:09Z

PR_Github #8130 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #5894 completed with status: 'FAILURE'

Superjomn · 2025-06-10T10:30:16Z

/bot run --add-multi-gpu-test --disable-fail-fast

tensorrt-cicd · 2025-06-10T10:31:31Z

PR_Github #8277 [ run ] triggered by Bot

tensorrt-cicd · 2025-06-10T10:31:53Z

PR_Github #8278 [ run ] triggered by Bot

tensorrt-cicd · 2025-06-10T10:31:55Z

PR_Github #8277 [ run ] completed with state ABORTED

tensorrt-cicd · 2025-06-10T10:35:53Z

PR_Github #8279 [ run ] triggered by Bot

tensorrt-cicd · 2025-06-10T10:35:55Z

PR_Github #8278 [ run ] completed with state ABORTED

Superjomn · 2025-06-16T03:25:01Z

/bot run

tensorrt-cicd · 2025-06-16T03:30:36Z

PR_Github #8965 [ run ] triggered by Bot

tensorrt-cicd · 2025-06-16T06:12:33Z

PR_Github #8965 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #6543 completed with status: 'FAILURE'

Superjomn · 2025-06-16T07:27:19Z

/bot run

tensorrt-cicd · 2025-06-16T07:32:54Z

PR_Github #8992 [ run ] triggered by Bot

shaharmor98

LGTM . Left two nits, feel free to resolve if you don't agree.

tensorrt_llm/llmapi/llm.py

tensorrt-cicd · 2025-06-16T13:57:06Z

PR_Github #8992 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #6565 completed with status: 'FAILURE'

Superjomn · 2025-06-16T14:27:01Z

/bot run

tensorrt-cicd · 2025-06-16T14:32:57Z

PR_Github #9041 [ run ] triggered by Bot

tensorrt-cicd · 2025-06-16T18:23:45Z

PR_Github #9041 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #6608 completed with status: 'FAILURE'

Superjomn · 2025-06-17T09:15:21Z

/bot run

tensorrt-cicd · 2025-06-17T09:20:55Z

PR_Github #9179 [ run ] triggered by Bot

tensorrt-cicd · 2025-06-17T11:38:25Z

PR_Github #9179 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #6725 completed with status: 'FAILURE'

Superjomn · 2025-06-17T12:46:52Z

/bot run

tensorrt-cicd · 2025-06-17T12:52:37Z

PR_Github #9207 [ run ] triggered by Bot

Signed-off-by: Superjomn <[email protected]>

Superjomn · 2025-06-17T23:58:45Z

/bot run

tensorrt-cicd · 2025-06-18T00:04:48Z

PR_Github #9253 [ run ] triggered by Bot

tensorrt-cicd · 2025-06-18T06:01:14Z

PR_Github #9253 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #6789 completed with status: 'SUCCESS'
Pipeline passed with automatic retried tests. Check the rerun report for details.

Superjomn requested a review from a team as a code owner June 4, 2025 06:55

Superjomn requested review from hlu1 and shaharmor98 June 4, 2025 06:55

Superjomn force-pushed the partition-llm branch 2 times, most recently from 1ea3bf9 to 98964ed Compare June 4, 2025 07:02

Superjomn marked this pull request as draft June 4, 2025 07:40

Superjomn requested review from lucaslie and suyoggupta June 5, 2025 00:55

lucaslie reviewed Jun 5, 2025

View reviewed changes

tensorrt_llm/llmapi/llm.py Outdated Show resolved Hide resolved

lucaslie mentioned this pull request Jun 5, 2025

[AutoDeploy] _AutoDeployLlmArgs as primary config object #4891

Merged

lucaslie reviewed Jun 5, 2025

View reviewed changes

tensorrt_llm/llmapi/llm.py Show resolved Hide resolved

lucaslie reviewed Jun 5, 2025

View reviewed changes

tensorrt_llm/llmapi/llm.py Outdated Show resolved Hide resolved

Superjomn force-pushed the partition-llm branch 2 times, most recently from d3c572d to 4333698 Compare June 9, 2025 05:17

Superjomn marked this pull request as ready for review June 9, 2025 05:18

Superjomn force-pushed the partition-llm branch from 4333698 to 5cca98a Compare June 9, 2025 12:38

lucaslie reviewed Jun 9, 2025

View reviewed changes

tensorrt_llm/llmapi/llm.py Outdated Show resolved Hide resolved

Superjomn force-pushed the partition-llm branch from 81d2956 to c508ab8 Compare June 10, 2025 10:29

Superjomn force-pushed the partition-llm branch from 20a34a0 to 8ed32f1 Compare June 16, 2025 03:24

Superjomn force-pushed the partition-llm branch from 8ed32f1 to e44927a Compare June 16, 2025 07:27

shaharmor98 approved these changes Jun 16, 2025

View reviewed changes

tensorrt_llm/llmapi/llm.py Show resolved Hide resolved

tensorrt_llm/llmapi/llm.py Show resolved Hide resolved

Superjomn force-pushed the partition-llm branch from e44927a to a35df01 Compare June 16, 2025 14:26

Superjomn force-pushed the partition-llm branch from ee2506c to 3da5829 Compare June 17, 2025 12:46

Superjomn added 3 commits June 18, 2025 07:58

create TorchLLM and TrtLLM

75e501c

Signed-off-by: Superjomn <[email protected]>

fix doc

ee83e59

Signed-off-by: Superjomn <[email protected]>

fix comment

6abfc9b

Signed-off-by: Superjomn <[email protected]>

Superjomn force-pushed the partition-llm branch from 3da5829 to 6abfc9b Compare June 17, 2025 23:58

Superjomn requested review from QiJune and removed request for hlu1 June 17, 2025 23:59

Superjomn enabled auto-merge (squash) June 18, 2025 05:01

Superjomn merged commit 724e495 into NVIDIA:main Jun 18, 2025
3 checks passed

chore: partition LLM class into TorchLLM and TrtLLM #4900

chore: partition LLM class into TorchLLM and TrtLLM #4900

Uh oh!

Conversation

Superjomn commented Jun 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Superjomn commented Jun 9, 2025

Uh oh!

tensorrt-cicd commented Jun 9, 2025

Uh oh!

tensorrt-cicd commented Jun 9, 2025

Uh oh!

Superjomn commented Jun 9, 2025

Uh oh!

tensorrt-cicd commented Jun 9, 2025

Uh oh!

Uh oh!

tensorrt-cicd commented Jun 10, 2025

Uh oh!

Superjomn commented Jun 10, 2025

Uh oh!

tensorrt-cicd commented Jun 10, 2025

Uh oh!

tensorrt-cicd commented Jun 10, 2025

Uh oh!

tensorrt-cicd commented Jun 10, 2025

Uh oh!

tensorrt-cicd commented Jun 10, 2025

Uh oh!

tensorrt-cicd commented Jun 10, 2025

Uh oh!

Superjomn commented Jun 16, 2025

Uh oh!

tensorrt-cicd commented Jun 16, 2025

Uh oh!

tensorrt-cicd commented Jun 16, 2025

Uh oh!

Superjomn commented Jun 16, 2025

Uh oh!

tensorrt-cicd commented Jun 16, 2025

Uh oh!

shaharmor98 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

tensorrt-cicd commented Jun 16, 2025

Uh oh!

Superjomn commented Jun 16, 2025

Uh oh!

tensorrt-cicd commented Jun 16, 2025

Uh oh!

tensorrt-cicd commented Jun 16, 2025

Uh oh!

Superjomn commented Jun 17, 2025

Uh oh!

tensorrt-cicd commented Jun 17, 2025

Uh oh!

tensorrt-cicd commented Jun 17, 2025

Uh oh!

Superjomn commented Jun 17, 2025

Uh oh!

tensorrt-cicd commented Jun 17, 2025

Uh oh!

Superjomn commented Jun 17, 2025

Uh oh!

tensorrt-cicd commented Jun 18, 2025

Uh oh!

tensorrt-cicd commented Jun 18, 2025

Uh oh!

Uh oh!

Uh oh!

Superjomn commented Jun 4, 2025 •

edited

Loading