If we use random regions of data from full dataset, Can it improve the performance further? Have you experimented this? (currently, random regions are sampled from 'mini-batch')