API for getting intermediate image and text features, forward_intermediates() by rwightman · Pull Request #1035 · mlfoundations/open_clip

rwightman · 2025-02-22T23:05:23Z

Compatible with timm approach (passes through to timm image encoders) and adding to builtin image and text...

This is a different approach than #731, but as with timm, I found the idea of integrating the output_hidden_states approach to be like HF Transformers to be a bit of a headache and risk too many regressions. This approach duplicates some code, but it makes typing less problematic, and keeps the main forward functions lean for training, normal use.

…cases, refining.

…ore cases.

…dd partial impl to CoCa model.

…r CoCa

rwightman added 2 commits February 22, 2025 15:04

Intermediate features, WIP

d7c157e

More work on intermediates. Functionality working, testing different …

9641148

…cases, refining.

rwightman marked this pull request as draft February 22, 2025 23:06

Add forward_intermediates to ResNet tower, fix several issues, test m…

f13735a

…ore cases.

rwightman marked this pull request as ready for review February 24, 2025 23:10

rwightman added 5 commits February 24, 2025 15:59

Fix corner case with intermediates_only and normalization active

10abe2b

Reorganize, rename some output args. Split image/text extra tokens. A…

c59d0a0

…dd partial impl to CoCa model.

Fix getting intermediates prefix tokens from timm image towers

0c6b247

Check for features on normalize

43d3850

key check on normalize for CoCa, use_reentrant=False checkpointing fo…

efb843c

…r CoCa

rwightman merged commit c4fba83 into main Mar 1, 2025
4 checks passed

rwightman deleted the intermediates branch March 1, 2025 00:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API for getting intermediate image and text features, forward_intermediates()#1035

API for getting intermediate image and text features, forward_intermediates()#1035
rwightman merged 8 commits intomainfrom
intermediates

rwightman commented Feb 22, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

rwightman commented Feb 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

rwightman commented Feb 22, 2025 •

edited

Loading