gradient_accumulation_steps influences scale of the loss

## 🐛 Bug description

[Because of the division here](https://github.com/pytorch/ignite/blob/master/ignite/engine/__init__.py#L185), and similar functions, the loss has the wrong scale when using `gradient_accumulation_steps`. This makes it confusingly low in comparison to the valid loss.

One option is to only divide on the backward call, i.e. doing this:
```
scaler.scale(loss / gradient_accumulation_steps).backward()
```
or one could multiply the loss again with `gradient_accumulation_steps` before returning it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

gradient_accumulation_steps influences scale of the loss #2707

🐛 Bug description

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

gradient_accumulation_steps influences scale of the loss #2707

Description

🐛 Bug description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions