Author of the publication

Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets

, , , , and . (2022)cite arxiv:2201.02177Comment: Correspondence to alethea@openai.com. Code available at: https://github.com/openai/grok.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Scaling Language Models: Methods, Analysis & Insights from Training Gopher., , , , , , , , , and 70 other author(s). CoRR, (2021)AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning., , , , , , , , , and 14 other author(s). CoRR, (2023)Evaluating Large Language Models Trained on Code., , , , , , , , , and 48 other author(s). CoRR, (2021)Relational Deep Reinforcement Learning., , , , , , , , , and 6 other author(s). CoRR, (2018)Formal Mathematics Statement Curriculum Learning., , , , , and . ICLR, OpenReview.net, (2023)Everware toolkit. Supporting reproducible science and challenge-driven education., , , and . CoRR, (2017)SunPy: A Python package for Solar Physics., , , , , , , , , and 114 other author(s). J. Open Source Softw., 5 (46): 1832 (2020)Unsupervised Neural Machine Translation with Generative Language Models Only., , , , , , , , , and 1 other author(s). CoRR, (2021)Deep reinforcement learning with relational inductive biases., , , , , , , , , and 6 other author(s). ICLR (Poster), OpenReview.net, (2019)Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer., , , , , , , , , and . CoRR, (2022)