Make transforms work on video tensor inputs or batch of images

## 🚀 Feature

Following https://github.com/pytorch/vision/issues/2292#issuecomment-671325017 and discussion with @fmassa and @bjuncek , this feature request is to improve the transforms module to support video inputs as `torch.Tensor` of shape `(C, T, H, W)`, where C - number of image channels (e.g. 3 for RGB), T - number of frames, H, W - image dimensions. 

Points to discuss:
- Convention for geometric transforms: 2 last dimensions are H, W ?
- Convention for color transforms: 1 dimension is color, like above `(C, T, H, W)` ?
  - should we also support `(T, C, H, W)` ?



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make transforms work on video tensor inputs or batch of images #2583

🚀 Feature

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Make transforms work on video tensor inputs or batch of images #2583

Description

🚀 Feature

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions