Popular repositories Loading
-
-
-
JMTRandom
JMTRandom PublicForked from okamumu/JMTRandom
A Java library for Mersenne twister and SMFT
Java
-
metaseq
metaseq PublicForked from facebookresearch/metaseq
Repo for external large-scale work
Python
-
Repositories
- OpenRLHF Public Forked from OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
linkessence/OpenRLHF’s past year of commit activity - DeepEP Public Forked from deepseek-ai/DeepEP
DeepEP: an efficient expert-parallel communication library
linkessence/DeepEP’s past year of commit activity - simpleRL-reason Public Forked from hkust-nlp/simpleRL-reason
This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
linkessence/simpleRL-reason’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…