Skip to content

Conversation

@vishalpandya1990
Copy link
Contributor

@vishalpandya1990 vishalpandya1990 commented Dec 19, 2025

Description

  • Follow-up to PR-26555 - add Blackwell check in TRTRTX EP unit tests for FP4/FP8 Custom ops since Blackwell

Motivation and Context

  • NVFP4 recipe (combination of FP4 and FP8) is primarily intended for Blackwell+ GPUs as they have Tensor Cores for FP4 data type.

@vishalpandya1990
Copy link
Contributor Author

CC @yuslepukhin

}

// Log GPU compute capability
std::cout << "GPU Compute Capability: SM "
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This will print for each test case that using this function. It can be moved to GetCudaArchitecture to print only once.

@tianleiwu
Copy link
Contributor

/azp run Linux QNN CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI, Windows ARM64 QNN CI Pipeline, Windows GPU Doc Gen CI Pipeline

@azure-pipelines
Copy link

Azure Pipelines successfully started running 4 pipeline(s).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants