**Describe the bug** At dataset creation, the dataset generated will always get the cached version despite change in file. **To Reproduce** 1. Run `toolkit.py` 2. Ctrl-C 3. Add a line in the dataset 4. `toolkit.py` will not create a new dataset with desired changes **Expected behavior** 1. Dataset to be generated with new data **Environment:** - OS: `Ubuntu`