```[tasklist] ### Tasks - [ ] Add barebone MoE without expert parallelism - [ ] Prototype expert parallel with MoE - [ ] e2e integration and perf validation with torchtrain ```