For deep RL and the future of AI.
PyTorch code for Neural Networks and Deep Learning written by Michael Nielsen
Made for a reading group at the Center for Safe AGI.
PyTorch implementation of "From Sparse to Soft Mixtures of Experts"
800,000 step-level correctness labels on LLM solutions to MATH problems
Formalizing stochastic doubly-efficient debate