Exploring use of kwargs for timm model and transforms creation #35819

rwightman · 2025-01-21T16:54:08Z

What does this PR do?

Allow kwargs to be passed through to timm create_model and create_transforms fns via the TimmWrapperModel and TimmWrapperImageProcessor classes respectively.

Current exploring use and not ready to finalize. Would like to allow flexibility to configure model and transforms at runtime for added flexibility.

Examples:

Allow dynamic image size for vits
model = AutoModelForImageClassification.from_pretrained("timm/vit_base_patch16_224.augreg2_in21k_ft_in1k", dynamic_img_size=True)

Change image size of pre-processor

model = AutoModel.from_pretrained("timm/vit_large_patch14_dinov2.lvd142m", dynamic_img_size=True)
proc = transformers.AutoImageProcessor.from_pretrained("timm/vit_large_patch14_dinov2.lvd142m", img_size=448)

Customize image transforms, this will enable auto_augment and rand-erasing in train transforms and use the train resolution instead of test target size for eval transform

proc = transformers.AutoImageProcessor.from_pretrained("timm/resnet50.a1_in1k", use_train_size=True, auto_augment='rand-m9-inc1-mstd101', re_prob=0.3)

rwightman · 2025-01-21T16:56:42Z

I though this could also be neat to wire through in the image classification example script, though not quite sure how to add the args to HfArgumentParser, would be least obtrusive if a --model-kwargs and --preprocessor-kwargs could be added that'd pass through lists of key-value pairs as kwargs to the AutoModel and AutoImageProcessor...

rwightman · 2025-01-21T17:01:17Z

Not sure who else would have opinions re the example scripts, this could allow all the advanced timm augmentations, and also enabling of some model regularization like stochastic depth (e.g. drop_path=0.2) for training through transformers. @merveenoyan @NielsRogge

qubvel

Thanks for updating! Can you please add test cases?

qubvel · 2025-01-21T17:15:51Z

src/transformers/models/timm_wrapper/image_processing_timm_wrapper.py

+        }
+
+        # Merge kwargs that should be passed through to timm transform factory into image_processor_dict
+        for k in _DATA_ARG_KEYS + tuple(inspect.signature(timm.data.create_transform).parameters.keys()):


Would be great to have a variable for tuple(inspect.signature(timm.data.create_transform).parameters.keys()), easier to read and debug

@qubvel changed that code a little in last commit

* default to train input size (less surprising) * add properties to mimic .size .crop_size .image_mean .image_std attributes in many Transformers image preproc (works with autotrain now) * try to make key check / inspect code more clear

rwightman · 2025-01-22T00:13:35Z

I added properties to the image processor so use cases where people were read image_processor.size, image_processor.image_mean, etc will work. autotrain relied on this and now it works with timm models.

qubvel · 2025-01-23T14:31:10Z

Great! Can you please add some test cases to tests/models/timm_wrapper/? And would be great to add some code snippets to the docs timm_wrapper.md to make sure it's clear how to modify image processor/model parameters.

rwightman · 2025-01-23T17:58:50Z

Great! Can you please add some test cases to tests/models/timm_wrapper/? And would be great to add some code snippets to the docs timm_wrapper.md to make sure it's clear how to modify image processor/model parameters.

I'll get to this eventually, I wanted to explore the use cases a bit more with some notebooks, train runs and see if any other changes are worthwhile.

Exploring use of kwargs for timm model and transforms creation

da30662

rwightman marked this pull request as draft January 21, 2025 16:54

rwightman requested a review from qubvel January 21, 2025 16:58

qubvel reviewed Jan 21, 2025

View reviewed changes

shuminghu mentioned this pull request May 23, 2025

PerceptionLM #37878

Merged

qubvel mentioned this pull request Jun 17, 2025

Add kwargs for timm.create_model in TimmWrapper #38860

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Exploring use of kwargs for timm model and transforms creation #35819

Exploring use of kwargs for timm model and transforms creation #35819

Uh oh!

rwightman commented Jan 21, 2025

Uh oh!

rwightman commented Jan 21, 2025

Uh oh!

rwightman commented Jan 21, 2025

Uh oh!

qubvel left a comment

Uh oh!

qubvel Jan 21, 2025 •

edited

Loading

Uh oh!

rwightman Jan 22, 2025

Uh oh!

rwightman commented Jan 22, 2025

Uh oh!

qubvel commented Jan 23, 2025 •

edited

Loading

Uh oh!

rwightman commented Jan 23, 2025

Uh oh!

Uh oh!

Exploring use of kwargs for timm model and transforms creation #35819

Are you sure you want to change the base?

Exploring use of kwargs for timm model and transforms creation #35819

Uh oh!

Conversation

rwightman commented Jan 21, 2025

What does this PR do?

Uh oh!

rwightman commented Jan 21, 2025

Uh oh!

rwightman commented Jan 21, 2025

Uh oh!

qubvel left a comment

Choose a reason for hiding this comment

Uh oh!

qubvel Jan 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rwightman Jan 22, 2025

Choose a reason for hiding this comment

Uh oh!

rwightman commented Jan 22, 2025

Uh oh!

qubvel commented Jan 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rwightman commented Jan 23, 2025

Uh oh!

Uh oh!

qubvel Jan 21, 2025 •

edited

Loading

qubvel commented Jan 23, 2025 •

edited

Loading