Fix(informer): Correct tensor shape for input_size=1 #38856

Flink-ddd · 2025-06-17T06:03:45Z

Hi @Rocketknight1, thanks for the great guidance on the previous PR!

This new pull request follows your suggestion to fix the bug at its source. It resolves a RuntimeError that occurs in time series models inheriting from TimeSeriesTransformerModel (such as InformerModel) when config.input_size is set to 1.

The root cause was that when input_size=1, the loc and scale tensors calculated by the scaler retained an extra dimension (e.g., shape [B, 1, 1] instead of [B, 1]). This incorrect shape caused a dimension mismatch error during a later expand() operation.

Instead of overriding the method in the child class, this PR applies a minimal and robust fix directly to the create_network_inputs method in the parent TimeSeriesTransformerModel. It refactors the logic to unconditionally apply .squeeze(1) to both the loc and scale tensors. This approach handles all input_size cases correctly and avoids code duplication.

Fixes #38745

The create_network_inputs function in TimeSeriesTransformerModel handled the scaler's loc and scale tensors inconsistently. When input_size=1, the tensors were not squeezed, leading to downstream dimension errors for models like Informer. This commit refactors the logic to unconditionally apply .squeeze(1), which correctly handles all input_size cases and fixes the bug at its source. Fixes huggingface#38745

Rocketknight1 · 2025-06-17T13:08:49Z

cc @kashif since you worked on the original informer PR , can you take a look? I suggested this change be moved to the original function for TimeSeriesTransformer, but I'm not certain about that - I don't know the models well enough to know what the expected behaviour should be for config.input_size == 1

kashif · 2025-06-18T06:20:09Z

thanks @Flink-ddd can you also kindly confirm that autoformer model also works?

Flink-ddd · 2025-06-18T08:29:31Z

Hi @kashif and @Rocketknight1 ,

Thanks for the great suggestion to check Autoformer!

You were right to be cautious. I can confirm that AutoformerModel also suffers from the exact same bug. I was able to verify this locally by running a test against the unfixed code, which failed with the identical RuntimeError as Informer did.

This PR fixes the bug in the shared parent class, TimeSeriesTransformerModel, so it should resolve the issue for both Informer, Autoformer, and any other inheriting models.

All relevant CI checks are now passing. Thanks for helping me get to a much cleaner and more robust solution!

Flink-ddd · 2025-06-23T00:09:49Z

Hi @kashif , thanks again for the approval! Just wanted to gently check in and see if there is anything else needed from my side to get this merged. Thanks!

HuggingFaceDocBuilderDev · 2025-06-23T09:36:52Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Flink-ddd added 2 commits June 17, 2025 13:59

kashif approved these changes Jun 18, 2025

View reviewed changes

Merge branch 'main' into fix/timeseries-scaler-squeeze

fec9b7f

kashif approved these changes Jun 23, 2025

View reviewed changes

kashif merged commit 334bf91 into huggingface:main Jun 23, 2025
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix(informer): Correct tensor shape for input_size=1 #38856

Fix(informer): Correct tensor shape for input_size=1 #38856

Uh oh!

Flink-ddd commented Jun 17, 2025

Uh oh!

Rocketknight1 commented Jun 17, 2025

Uh oh!

kashif commented Jun 18, 2025

Uh oh!

Flink-ddd commented Jun 18, 2025

Uh oh!

Flink-ddd commented Jun 23, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jun 23, 2025

Uh oh!

Uh oh!

Uh oh!

Fix(informer): Correct tensor shape for input_size=1 #38856

Fix(informer): Correct tensor shape for input_size=1 #38856

Uh oh!

Conversation

Flink-ddd commented Jun 17, 2025

Uh oh!

Rocketknight1 commented Jun 17, 2025

Uh oh!

kashif commented Jun 18, 2025

Uh oh!

Flink-ddd commented Jun 18, 2025

Uh oh!

Flink-ddd commented Jun 23, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jun 23, 2025

Uh oh!

Uh oh!

Uh oh!