Change the repository type filter
All
Repositories list
98 repositories
oat
Public🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.jrystal
PublicA JAX-based Differentiable Density Functional Theory Framework for Materialsd4ft
PublicPrecision-RL
PublicPrecision-RL-verl
PublicNDA
PublicSkyLadder
Publictty-use
Publicimperceptible-jailbreaks
Publicvariational-reasoning
PublicLifelongSafetyAlignment
Publicautofd
PublicAutomatic Functional Differentiation in JAXBanditSpec
Publicunderstand-r1-zero
PublicAnytimeReasoner
PublicLongSpec
PublicAttention-Sink
Public[ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)VeriFree
PublicAdan
PublicTreeMeshGPT
PublicContinualBench
PublicFlowReasoner
PublicMeta-Unlearning
PublicLightTrans
PublicActivePRM
Publicdice
PublicOfficial implementation of Bootstrapping Language Models via DPO Implicit Rewardsoat-zero
Public