• Serif

  • Using Transformers to teach Transformers how to train Transformers

  • Forecasting Deep Learning Dynamics for Hyperparameter Tuning

  • Discrete Autoencoders

  • Exploration by Random Network Distillation

  • PureScript & Screeps