Add serialization function for StaticCache #38879

xadupre · 2025-06-18T06:21:45Z

What does this PR do?

Implements serialization functions for StaticCache similar to the one implemented for DynamicCache. Fixes pytorch/pytorch#155862.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

Rocketknight1 · 2025-06-18T12:52:41Z

cc @gante!

justinchuby · 2025-06-19T17:05:50Z

I think #38917 would be possible with this

gante

In general LGTM, as it enables a relevant feature.

However, I have two minor grudges :D It would be great if we could solve them, for long-term scalability of the library:

This is torch.export-related code that runs on basic imports, regardless of torch.export being used or not. For instance, from transformers import AutoModelForCausalLM will run these lines, which seems wasteful. Is there some way to enable lazy execution? Ideally, these would only be run at export time.
This torch.export-related code lives alongside the cache definition, which hurts readability for all other users. Ideally, this code would live in transformers/integrations/export.py (or torch_export.py), but we would have to careful with circular imports.

gante · 2025-06-24T11:52:22Z

src/transformers/cache_utils.py

@@ -9,7 +9,7 @@
 import torch
 from packaging import version

-from transformers.pytorch_utils import is_torch_greater_or_equal_than_2_6
+from transformers.pytorch_utils import is_torch_greater_or_equal_than_2_6, is_torch_greater_or_equal_than_2_7


We're moving all these flags to is_torch_greater_or_equal, which is already imported here
e.g. is_torch_greater_or_equal("2.7.0"), or is_torch_greater_or_equal("2.7.0", accept_dev=True) if you also want to accept dev versions of 2.7

I saw this snippet of code. Do you want me to remove the flag I inserted and use is_torch_greater_or_equal("2.7", accept_dev=True) directly in the code?

is_torch_greater_or_equal_than_2_7 = is_torch_greater_or_equal("2.7", accept_dev=True) # the line I added is_torch_greater_or_equal_than_2_6 = is_torch_greater_or_equal("2.6", accept_dev=True)

…to static

justinchuby · 2025-07-30T16:58:25Z

@tugsbayasgalan

xadupre · 2025-08-05T18:10:52Z

Replaced by #39931.

Add serialization function for StaticCache

a81079c

xadupre mentioned this pull request Jun 18, 2025

Export Huggingface models with StaticCache pytorch/pytorch#155862

Open

xadupre added 3 commits June 18, 2025 08:45

move code

c5f7c8e

fix test

11d2d67

ruff

2091dad

justinchuby mentioned this pull request Jun 19, 2025

Add past_key_values as inputs to TorchExportableModuleWithStaticCache forward #38917

Open

gante reviewed Jun 24, 2025

View reviewed changes

xadupre and others added 2 commits July 2, 2025 10:57

Merge branch 'main' of https://github.com/huggingface/transformers in…

a66ef71

…to static

Merge branch 'main' into static

09ef0d1

xadupre closed this Aug 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add serialization function for StaticCache #38879

Add serialization function for StaticCache #38879

Uh oh!

xadupre commented Jun 18, 2025

Uh oh!

Rocketknight1 commented Jun 18, 2025

Uh oh!

justinchuby commented Jun 19, 2025

Uh oh!

gante left a comment

Uh oh!

gante Jun 24, 2025

Uh oh!

xadupre Jul 2, 2025

Uh oh!

justinchuby commented Jul 30, 2025

Uh oh!

xadupre commented Aug 5, 2025

Uh oh!

Uh oh!

Add serialization function for StaticCache #38879

Add serialization function for StaticCache #38879

Uh oh!

Conversation

xadupre commented Jun 18, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

Rocketknight1 commented Jun 18, 2025

Uh oh!

justinchuby commented Jun 19, 2025

Uh oh!

gante left a comment

Choose a reason for hiding this comment

Uh oh!

gante Jun 24, 2025

Choose a reason for hiding this comment

Uh oh!

xadupre Jul 2, 2025

Choose a reason for hiding this comment

Uh oh!

justinchuby commented Jul 30, 2025

Uh oh!

xadupre commented Aug 5, 2025

Uh oh!

Uh oh!