Skip to content

Conversation

@jushg
Copy link
Contributor

@jushg jushg commented Apr 2, 2025

Summary:
Add a param in the base parser to set torch_nccl_enable_timing variable to False by default, and only set it to true if user needed.

This value is used only on flight-recorder (for debugging purpose), and significantly affect performance of benchmarking in blocking mode (~30% for small-mid message sizes) if enable

Reviewed By: kingchc

Differential Revision: D72240605

Summary:
Add a param in the base parser to set `torch_nccl_enable_timing` variable to `False` by default, and only set it to true if user needed.

This value is used only on flight-recorder (for debugging purpose), and significantly affect performance of benchmarking in blocking mode (~30% for small-mid message sizes) if enable

Reviewed By: kingchc

Differential Revision: D72240605
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 2, 2025
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D72240605

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in b721662.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported Merged

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants