Skip to content

Conversation

PerkzZheng
Copy link
Collaborator

The embedding TP should be disabled when the attention DP is used.

@PerkzZheng PerkzZheng requested a review from a team as a code owner July 1, 2025 10:03
@PerkzZheng PerkzZheng force-pushed the user/perkzz/fix-attention-dp branch from 626a13e to 680565a Compare July 1, 2025 10:05
@PerkzZheng
Copy link
Collaborator Author

/bot run

@tensorrt-cicd
Copy link
Collaborator

PR_Github #10486 [ run ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #10486 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #7763 completed with status: 'SUCCESS'
Pipeline passed with automatic retried tests. Check the rerun report for details.

@byshiue byshiue merged commit ba2ab50 into NVIDIA:main Jul 2, 2025
3 checks passed
Shunkangz pushed a commit to Shunkangz/TensorRT-LLM that referenced this pull request Jul 2, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 9, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 10, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 10, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 10, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 10, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 11, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 11, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 11, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants