😯
SIU
Reinforcement Learning PhD | Intern @facebookresearch | Building autonomous decision making system | Prev @microsoft @deepseek-ai
-
National University of Singapore
- Singapore
-
02:10
(UTC +08:00) - benjamin-eecs.github.io
- @Benjamin_eecs
- in/bo-liu-eecs
- https://huggingface.co/Benjamin-eecs
- https://benjamin-eecs.medium.com/
Highlights
Pinned Loading
-
spiral-rl/spiral
spiral-rl/spiral PublicSPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
-
deepseek-ai/DeepSeek-V2
deepseek-ai/DeepSeek-V2 PublicDeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
-
deepseek-ai/DeepSeek-VL
deepseek-ai/DeepSeek-VL PublicDeepSeek-VL: Towards Real-World Vision-Language Understanding
-
metaopt/torchopt
metaopt/torchopt PublicTorchOpt is an efficient library for differentiable optimization built upon PyTorch.
-
sail-sg/envpool
sail-sg/envpool PublicC++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.