-
https://www.linkedin.com/in/drabovich/
- Palo Alto, CA
-
23:25
(UTC -08:00)
Pinned Loading
-
-
quickreduce
quickreduce PublicForked from mk1-project/quickreduce
QuickReduce is a performant all-reduce library designed for AMD ROCm that supports inline compression.
C++
-
TensorScope
TensorScope PublicEasily benchmark training of any model by (op type+parameters) to spot real bottlenecks. Catches more details then native tfprof, especially for RNN/LSTM models
HTML 1
-
-
DeepSpeed
DeepSpeed PublicForked from deepspeedai/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
Python 1
-
Kimi-Linear
Kimi-Linear PublicForked from MoonshotAI/Kimi-Linear
Up to 6x decoding throughput improvement for context as long as 1M tokens, reduction of KV cache by up to 75%
If the problem persists, check the GitHub status page or contact support.

