Manifold Loss and P2SGrad Loss #635

domenicoMuscill0 · 2023-06-02T16:03:22Z

I have implemented a first version of the manifold loss introduced in the paper https://openaccess.thecvf.com/content_CVPR_2019/papers/Aziere_Ensemble_Deep_Manifold_Similarity_Learning_Using_Hard_Proxies_CVPR_2019_paper.pdf
I have also changed the function to_device of the common_functions file to accept a list or tuple of tensors.
I have found one issue that the paper does not explain: the similarity matrix S of the graph is defined by using scalar products, but if the sum of the rows is negative exponentiating to -1/2 the degree matrix D gives torch.nan
To solve this i used abs(D) instead of D.

KevinMusgrave · 2023-06-03T12:16:10Z

Thanks @domenicoMuscill0!

Would it be possible to write a test for this loss function? Usually for tests I compare the implementation with a more manual version of the loss function, like using for-loops instead of vectorized operations. Or I compare the implementation with the official implementation, if it exists.

domenicoMuscill0 · 2023-06-06T00:51:23Z

I have managed to get the original version of the code for the Manifold loss and i am writing the tests. I have also implemented the P2SGrad loss, written and passed some basic tests. When i wll finish (hopefully tomorrow) will it be enough to just commit on my branch dev?

KevinMusgrave · 2023-06-06T12:40:01Z

Yes

… manifold loss.

KevinMusgrave · 2023-06-07T20:07:04Z

Looks like the tests are passing now. Thanks @domenicoMuscill0!

Can you remove the commented-out code and also run ./format_code.sh?

You'll need to install black, isort, and nbqa.

KevinMusgrave · 2023-06-07T20:24:03Z

Also run ./run_linter.sh and see if there are any linter warnings. (This has more linter rules than the github workflow action.)

You'll need to install flake8.

domenicoMuscill0 · 2023-06-07T22:44:34Z

I have removed the commented code lines that i have left there before because they were in the original file the author (@azieren) sent me. I have left commented only the parts in which i explain the changes i made with respect to his original version.
I ran ./run_linter.sh and i got 0 warnings. Moreover i must highlight that the relative error bounds I selected are quite large, due to some noticeable issues with numerical stability that I've encountered in the original implementation. I have already contacted @azieren privately to discuss about this. I have run many experiments and i came to the conclusion that this is the only reason why the results differ so much. However i am not an expert and i am available to apport the necessary changes if it will be needed. Thank you very much @KevinMusgrave for creating this library!

domenicoMuscill0 · 2023-06-07T22:51:07Z

I have removed the commented code lines that i have left there before because they were in the original file the author (@azieren) sent me. I have left commented only the parts in which i explain the changes i made with respect to his original version. I ran ./run_linter.sh and i got 0 warnings. Moreover i must highlight that the relative error bounds I selected are quite large, due to some noticeable issues with numerical stability that I've encountered in the original implementation. I have already contacted @azieren privately to discuss about this. I have run many experiments and i came to the conclusion that this is the only reason why the results differ so much. However i am not an expert and i am available to apport the necessary changes if it will be needed. Thank you very much @KevinMusgrave for creating this library!

However the numerical instability regards only the Manifol Loss. Instead the P2SGrad loss matches both with existing implementation and with theoretical results of the gradients described in the paper.

KevinMusgrave · 2023-06-08T14:14:33Z

Thanks @domenicoMuscill0!

KevinMusgrave · 2023-06-18T01:51:37Z

@domenicoMuscill0 For ManifoldLoss is there a reason for passing in labels as indices_tuple = labels, instead of labels = labels ?

…_to_coord. Lower the rtol

…_to_coord.

domenicoMuscill0 · 2023-06-18T14:29:42Z

Those labels are just the indices of the metaclasses/clusters to which the embeddings belong. I preferred to interpret them as indices tuples because in this way it will be possible to add Manifold Loss among the supported self supervised losses in the future. These indices tuples are not strictly required, in fact in lines 61-66 in manifold_loss.py we initialize the metaclasses with random embeddings (as it is reported in the paper) if no given cluster indices are passed to the function as indices_tuple argument.

KevinMusgrave · 2023-06-18T17:38:58Z

@domenicoMuscill0 Ok, makes sense. Thanks for implementing these difficult loss functions!

domen added 4 commits May 31, 2023 01:56

first commit

5163af8

last commit

5b92303

adapted to_device for accepting lists of tensors

7284f36

fix

c031900

bug fix and implementation of P2SGrad

184dac8

domen added 4 commits June 7, 2023 14:04

bug fix. There is a problem of numerical cancellation in the original…

b818858

… manifold loss.

fixed problem with dtype conversion in to_device

6871fcc

fixed cuda requirement in original manifold loss implementation

a34bfe3

removed torch.inf. using np.inf

de84885

code style adjustments: ./run_linter.sh and ./format_code.sh

40ac34f

Merge branch 'dev' into dev

7aeade8

domen added 2 commits June 9, 2023 19:01

solved problem with backward computation

a55a77e

Merge remote-tracking branch 'origin/dev' into dev

ab47660

KevinMusgrave changed the title ~~Manifold Loss~~ Manifold Loss and P2SGrad Loss Jun 12, 2023

This was linked to issues Jun 12, 2023

Manifold similarity loss #383

Closed

P2SGrad: Refined Gradients for Optimizing Deep Face Models #9

Closed

KevinMusgrave mentioned this pull request Jun 18, 2023

Make to_device support list inputs #640

Open

KevinMusgrave added 2 commits June 17, 2023 21:22

Revert to_device change (keep it for a future PR).

3f64bd6

is_stat=False for ManifoldLoss parameters

38869be

KevinMusgrave added 2 commits June 17, 2023 22:17

test_manifold_loss: generate random embeddings intead of using angles…

67b6aa0

…_to_coord. Lower the rtol

test_p2s_grad_loss: generate random embeddings intead of using angles…

e594f14

…_to_coord.

Minor comment changes

ab744cd

KevinMusgrave merged commit 5e1bbec into KevinMusgrave:dev Jun 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Manifold Loss and P2SGrad Loss #635

Manifold Loss and P2SGrad Loss #635

Uh oh!

domenicoMuscill0 commented Jun 2, 2023

Uh oh!

KevinMusgrave commented Jun 3, 2023

Uh oh!

domenicoMuscill0 commented Jun 6, 2023

Uh oh!

KevinMusgrave commented Jun 6, 2023

Uh oh!

KevinMusgrave commented Jun 7, 2023 •

edited

Loading

Uh oh!

KevinMusgrave commented Jun 7, 2023

Uh oh!

domenicoMuscill0 commented Jun 7, 2023

Uh oh!

domenicoMuscill0 commented Jun 7, 2023

Uh oh!

KevinMusgrave commented Jun 8, 2023

Uh oh!

KevinMusgrave commented Jun 18, 2023

Uh oh!

domenicoMuscill0 commented Jun 18, 2023

Uh oh!

KevinMusgrave commented Jun 18, 2023

Uh oh!

Uh oh!

Manifold Loss and P2SGrad Loss #635

Manifold Loss and P2SGrad Loss #635

Uh oh!

Conversation

domenicoMuscill0 commented Jun 2, 2023

Uh oh!

KevinMusgrave commented Jun 3, 2023

Uh oh!

domenicoMuscill0 commented Jun 6, 2023

Uh oh!

KevinMusgrave commented Jun 6, 2023

Uh oh!

KevinMusgrave commented Jun 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

KevinMusgrave commented Jun 7, 2023

Uh oh!

domenicoMuscill0 commented Jun 7, 2023

Uh oh!

domenicoMuscill0 commented Jun 7, 2023

Uh oh!

KevinMusgrave commented Jun 8, 2023

Uh oh!

KevinMusgrave commented Jun 18, 2023

Uh oh!

domenicoMuscill0 commented Jun 18, 2023

Uh oh!

KevinMusgrave commented Jun 18, 2023

Uh oh!

Uh oh!

KevinMusgrave commented Jun 7, 2023 •

edited

Loading