Skip to content

[CB] [Minor] Add parameter to tune default compile level#46533

Merged
remi-or merged 5 commits into
mainfrom
cb-shorter-compile
Jun 12, 2026
Merged

[CB] [Minor] Add parameter to tune default compile level#46533
remi-or merged 5 commits into
mainfrom
cb-shorter-compile

Conversation

@remi-or

@remi-or remi-or commented Jun 10, 2026

Copy link
Copy Markdown
Collaborator

Summary

Currently, continuous batching exposes the parameter use_default_compile_configs to the users so that they may enable torch.compile easily. But that parameter enables compile with max-autotune-no-cudagraphs and dynamic=False for best performances. This makes for a long warmup, which is not great for tests or to iterate over benchmarks.

This PR cleanly deprecates this parameter and instead exposes default_compile_level which is an integer parameter that controls default compile behavior. Level 0 means no compile, and level 1-3 activate compile, with increasing performance and warmup duration.
If a user passes the old parameter, a warning is raised, and the level is set to 3, so that the behaviour is the same as with the old parameter.

Performance

No behavioral change, so same perf.

Tests

TBD

Review

Reviewed by claude + coumpound engineering.

@remi-or remi-or requested a review from ArthurZucker June 10, 2026 07:38
@remi-or remi-or self-assigned this Jun 10, 2026
@remi-or remi-or moved this from Backlog to In review in Continuous batching Jun 10, 2026
@remi-or remi-or changed the title [CB] (Minor) Add parameter to tune default compile level [CB] [Minor] Add parameter to tune default compile level Jun 10, 2026
@HuggingFaceDocBuilderDev

Copy link
Copy Markdown

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@github-actions

Copy link
Copy Markdown
Contributor

CI Dashboard: View test results in Grafana

@remi-or remi-or enabled auto-merge June 12, 2026 06:58
@remi-or remi-or added this pull request to the merge queue Jun 12, 2026
@github-merge-queue github-merge-queue Bot removed this pull request from the merge queue due to failed status checks Jun 12, 2026
@remi-or remi-or added this pull request to the merge queue Jun 12, 2026
Merged via the queue into main with commit 26b7389 Jun 12, 2026
121 checks passed
@remi-or remi-or deleted the cb-shorter-compile branch June 12, 2026 08:21
@github-project-automation github-project-automation Bot moved this from In review to Done in Continuous batching Jun 12, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

3 participants