Skip to content

Make transforms work on video tensor inputs or batch of images #2583

Closed
@vfdev-5

Description

@vfdev-5

🚀 Feature

Following #2292 (comment) and discussion with @fmassa and @bjuncek , this feature request is to improve the transforms module to support video inputs as torch.Tensor of shape (C, T, H, W), where C - number of image channels (e.g. 3 for RGB), T - number of frames, H, W - image dimensions.

Points to discuss:

  • Convention for geometric transforms: 2 last dimensions are H, W ?
  • Convention for color transforms: 1 dimension is color, like above (C, T, H, W) ?
    • should we also support (T, C, H, W) ?

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions