Update VITDet to conform to KerasCV scaling standards #2086

ianstenbit · 2023-09-22T22:10:33Z

I've verified that the numerics for SAM match correctly after this change.

This removes the requirement that users do the awkward imagenet mean+stddev scaling themselves -- now the model rescales correctly as it should (this is how we did the scaling for ViT as well)

ianstenbit · 2023-09-22T22:10:53Z

/gcbrun

ianstenbit · 2023-09-22T22:18:44Z

/gcbrun

ianstenbit · 2023-09-22T22:31:45Z

/gcbrun

jbischof · 2023-09-22T23:11:32Z

keras_cv/models/backbones/vit_det/vit_det_backbone.py

@@ -123,6 +124,11 @@ def __init__(
            # Use common rescaling strategy across keras_cv
            x = keras.layers.Rescaling(1.0 / 255.0)(x)

+        # VITDet scales inputs based on the standard ImageNet mean/stddev.


So we want to apply both types of rescaling? I'm a bit confused.

The include_rescaling check and associated rescaling layer make sure that the inputs are scaled from 0-1.

The subsequent bit rescales that using the mean and stddev of ImageNet, with the prior assumption that inputs are scaled 0-1

jbischof

Sounds like you already verified, but wouldn't rescaling by 255 affect the ImageNet rescaling?

ianstenbit · 2023-09-22T23:12:59Z

Sounds like you already verified, but wouldn't rescaling by 255 affect the ImageNet rescaling?

Correct -- see comment inline

tirthasheshpatel · 2023-09-23T21:04:35Z

keras_cv/models/backbones/vit_det/vit_det_backbone.py

+        # VITDet scales inputs based on the standard ImageNet mean/stddev.
+        x = (x - ops.array([0.229, 0.224, 0.225], dtype=x.dtype)) / (
+            ops.array([0.485, 0.456, 0.406], dtype=x.dtype)
+        )


Two things:

I think you got the mean and std mixed up :) From the SAM repo, mean is [123.675, 116.28, 103.53] ([0.485, 0.456, 0.406] when normalized) and std is [58.395, 57.12, 57.375] ([0.229, 0.224, 0.225] when normalized).

We should not do this because the SAM model first normalizes, then pads. If padded inputs are passed, the preprocessing operation would not remain the same. It's giving me suboptimal outputs when I run the demos.

@ianstenbit Let's revert this and document the preprocess step, what do you think?

Yes this does look like the mean/std are backwards -- I'll fix that. Can you send your demos so that I can run them?
My test demo looked fine with this change, which makes me think it's not sensitive enough.

Yes this does look like the mean/std are backwards -- I'll fix that. Can you send your demos so that I can run them?

Here: https://colab.research.google.com/drive/1wHTsYfmmZVuC71I4St1NshaAOQ6nUPFg?usp=sharing

I ran them with the patch in #2087 and the outputs are looking better. Output masks are now much closer to the demo on the original repo, with slight noise here and there (I think this is because the padded outlines having a non-zero value because of the normalization step). Nothing big though.

My test demo looked fine with this change, which makes me think it's not sensitive enough.

I agree, it's not super-sensitive, which is good! I just thought there might be cases where this could lead to a huge difference. What do you think about keeping the normalization step an opt-in rather than always having it on?

* Update VITDet to conform to KerasCV scaling standards * dtype fix

Update VITDet to conform to KerasCV scaling standards

7830813

dtype fix

231c28e

ianstenbit requested review from jbischof and mattdangerw September 22, 2023 22:36

mattdangerw approved these changes Sep 22, 2023

View reviewed changes

jbischof reviewed Sep 22, 2023

View reviewed changes

jbischof approved these changes Sep 22, 2023

View reviewed changes

ianstenbit merged commit 7712a81 into keras-team:master Sep 22, 2023

ianstenbit deleted the vitdet-scaling branch September 22, 2023 23:31

tirthasheshpatel reviewed Sep 23, 2023

View reviewed changes

tirthasheshpatel mentioned this pull request Sep 25, 2023

Resolve mean/std swap for VITDet backbone #2087

Merged

ghost pushed a commit to y-vectorfield/keras-cv that referenced this pull request Nov 16, 2023

Update VITDet to conform to KerasCV scaling standards (keras-team#2086)

febe3e2

* Update VITDet to conform to KerasCV scaling standards * dtype fix

yuvraj-wale pushed a commit to yuvraj-wale/keras-cv that referenced this pull request Feb 8, 2024

Update VITDet to conform to KerasCV scaling standards (keras-team#2086)

bec2706

* Update VITDet to conform to KerasCV scaling standards * dtype fix

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update VITDet to conform to KerasCV scaling standards #2086

Update VITDet to conform to KerasCV scaling standards #2086

Uh oh!

ianstenbit commented Sep 22, 2023

Uh oh!

ianstenbit commented Sep 22, 2023

Uh oh!

ianstenbit commented Sep 22, 2023

Uh oh!

ianstenbit commented Sep 22, 2023

Uh oh!

jbischof Sep 22, 2023

Uh oh!

ianstenbit Sep 22, 2023

Uh oh!

jbischof left a comment

Uh oh!

ianstenbit commented Sep 22, 2023

Uh oh!

tirthasheshpatel Sep 23, 2023 •

edited

Loading

Uh oh!

ianstenbit Sep 25, 2023

Uh oh!

tirthasheshpatel Sep 25, 2023

Uh oh!

Uh oh!

Update VITDet to conform to KerasCV scaling standards #2086

Update VITDet to conform to KerasCV scaling standards #2086

Uh oh!

Conversation

ianstenbit commented Sep 22, 2023

Uh oh!

ianstenbit commented Sep 22, 2023

Uh oh!

ianstenbit commented Sep 22, 2023

Uh oh!

ianstenbit commented Sep 22, 2023

Uh oh!

jbischof Sep 22, 2023

Choose a reason for hiding this comment

Uh oh!

ianstenbit Sep 22, 2023

Choose a reason for hiding this comment

Uh oh!

jbischof left a comment

Choose a reason for hiding this comment

Uh oh!

ianstenbit commented Sep 22, 2023

Uh oh!

tirthasheshpatel Sep 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ianstenbit Sep 25, 2023

Choose a reason for hiding this comment

Uh oh!

tirthasheshpatel Sep 25, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tirthasheshpatel Sep 23, 2023 •

edited

Loading