Skip to content

Conversation

PerkzZheng
Copy link
Collaborator

Fallback to cubins for now, and revisit it later.

@PerkzZheng PerkzZheng requested a review from a team as a code owner July 7, 2025 02:59
@PerkzZheng
Copy link
Collaborator Author

/bot run

@PerkzZheng PerkzZheng requested review from QiJune and qsang-nv July 7, 2025 02:59
@tensorrt-cicd
Copy link
Collaborator

PR_Github #11092 [ run ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #11092 [ run ] completed with state SUCCESS
/LLM/release-0.21/L0_MergeRequest_PR pipeline #176 completed with status: 'SUCCESS'
Pipeline passed with automatic retried tests. Check the rerun report for details.

@crazydemo
Copy link
Collaborator

verified the fp8 cases, all pass.

@crazydemo crazydemo self-requested a review July 8, 2025 02:19
@QiJune QiJune requested a review from litaotju July 8, 2025 02:26
@litaotju litaotju merged commit 5a50e2b into NVIDIA:release/0.21 Jul 8, 2025
4 checks passed
dc3671 pushed a commit to dc3671/TensorRT-LLM that referenced this pull request Jul 10, 2025
… fmha kernels on Ada. (NVIDIA#5779)

Signed-off-by: Qidi Sang <[email protected]>
Signed-off-by: Perkz Zheng <[email protected]>
Co-authored-by: qsang-nv <[email protected]>
dc3671 pushed a commit to dc3671/TensorRT-LLM that referenced this pull request Jul 10, 2025
… fmha kernels on Ada. (NVIDIA#5779)

Signed-off-by: Qidi Sang <[email protected]>
Signed-off-by: Perkz Zheng <[email protected]>
Co-authored-by: qsang-nv <[email protected]>
dc3671 pushed a commit to dc3671/TensorRT-LLM that referenced this pull request Jul 10, 2025
… fmha kernels on Ada. (NVIDIA#5779)

Signed-off-by: Qidi Sang <[email protected]>
Signed-off-by: Perkz Zheng <[email protected]>
Co-authored-by: qsang-nv <[email protected]>
dc3671 pushed a commit to dc3671/TensorRT-LLM that referenced this pull request Jul 11, 2025
… fmha kernels on Ada. (NVIDIA#5779)

Signed-off-by: Qidi Sang <[email protected]>
Signed-off-by: Perkz Zheng <[email protected]>
Co-authored-by: qsang-nv <[email protected]>
dc3671 pushed a commit to dc3671/TensorRT-LLM that referenced this pull request Jul 14, 2025
… fmha kernels on Ada. (NVIDIA#5779)

Signed-off-by: Qidi Sang <[email protected]>
Signed-off-by: Perkz Zheng <[email protected]>
Co-authored-by: qsang-nv <[email protected]>
dc3671 pushed a commit to dc3671/TensorRT-LLM that referenced this pull request Jul 14, 2025
… fmha kernels on Ada. (NVIDIA#5779)

Signed-off-by: Qidi Sang <[email protected]>
Signed-off-by: Perkz Zheng <[email protected]>
Co-authored-by: qsang-nv <[email protected]>
dc3671 pushed a commit to dc3671/TensorRT-LLM that referenced this pull request Jul 14, 2025
… fmha kernels on Ada. (NVIDIA#5779)

Signed-off-by: Qidi Sang <[email protected]>
Signed-off-by: Perkz Zheng <[email protected]>
Co-authored-by: qsang-nv <[email protected]>
dc3671 pushed a commit that referenced this pull request Jul 14, 2025
… fmha kernels on Ada. (#5779)

Signed-off-by: Qidi Sang <[email protected]>
Signed-off-by: Perkz Zheng <[email protected]>
Co-authored-by: qsang-nv <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants