Author of the publication

Randomized Positional Encodings Boost Length Generalization of Transformers.

, , , , , , , and . ACL (2), page 1889-1903. Association for Computational Linguistics, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

The Arcade Learning Environment: An Evaluation Platform for General Agents (Extended Abstract)., , , and . IJCAI, page 4148-4152. AAAI Press, (2015)Reinforcement Learning with Information-Theoretic Actuation., , and . AGI, volume 13539 of Lecture Notes in Computer Science, page 188-198. Springer, (2022)Gated Linear Networks., , , , , , , , and . CoRR, (2019)Language Modeling Is Compression., , , , , , , , , and 2 other author(s). CoRR, (2023)Shaking the foundations: delusions in sequence models for interaction and control., , , , , , , , , and 9 other author(s). CoRR, (2021)Reinforcement Learning with Information-Theoretic Actuation., , and . CoRR, (2021)Neural Networks and the Chomsky Hierarchy, , , , , , , , , and 1 other author(s). The Eleventh International Conference on Learning Representations, (2023)Learning Universal Predictors., , , , , , , , , and 1 other author(s). CoRR, (2024)A Combinatorial Perspective on Transfer Learning., , , , and . NeurIPS, (2020)Online Learning in Contextual Bandits using Gated Linear Networks., , , , and . NeurIPS, (2020)