Closed
Description
🚀 Feature
Following #2292 (comment) and discussion with @fmassa and @bjuncek , this feature request is to improve the transforms module to support video inputs as torch.Tensor
of shape (C, T, H, W)
, where C - number of image channels (e.g. 3 for RGB), T - number of frames, H, W - image dimensions.
Points to discuss:
- Convention for geometric transforms: 2 last dimensions are H, W ?
- Convention for color transforms: 1 dimension is color, like above
(C, T, H, W)
?- should we also support
(T, C, H, W)
?
- should we also support