Closed
Description
🚀 Feature
Implement the Stochastic Depth as described in Deep Networks with Stochastic Depth to randomly drop residual branches in residual architectures.
The operator should support 2 modes, dropping the entire batch as described on the original paper and dropping individual rows from the batch as utilized on the EfficientNet paper. We should provide both a functional and a Layer form.