Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Scaling Transformer to 1M tokens and beyond with RMT., , and . CoRR, (2023)In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs Miss., , , , , and . CoRR, (2024)Knowledge Distillation of Russian Language Models with Reduction of Vocabulary., , , and . CoRR, (2022)Recurrent Memory Transformer., , and . NeurIPS, (2022)DeepPavlov: Open-Source Library for Dialogue Systems., , , , , , , , , and 10 other author(s). ACL (4), page 122-127. Association for Computational Linguistics, (2018)Goal-Oriented Multi-Task BERT-Based Dialogue State Tracker., , , , , and . CoRR, (2020)Tuning Multilingual Transformers for Language-Specific Named Entity Recognition., , , and . BSNLP@ACL, page 89-93. Association for Computational Linguistics, (2019)Beyond Attention: Breaking the Limits of Transformer Context Length with Recurrent Memory., , , and . AAAI, page 17700-17708. AAAI Press, (2024)Adaptation of Deep Bidirectional Multilingual Transformers for Russian Language., and . CoRR, (2019)Better Together: Enhancing Generative Knowledge Graph Completion with Language Models and Neighborhood Information., , , and . EMNLP (Findings), page 5306-5316. Association for Computational Linguistics, (2023)