S. Arora, A. May, J. Zhang, and C. Ré. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, page 2650--2663. Online, Association for Computational Linguistics, (July 2020)
S. Merity. (2019)cite arxiv:1911.11423Comment: Addition of citations and contextual results (no attention head, single attention head, attention per layer), removal of wordpiece WikiText-103 numbers due to normalization issues, fix of SHA attention figure Q arrow, other minor fixes.
H. Spieker, A. Gotlieb, D. Marijan, and M. Mossige. (2018)cite arxiv:1811.04122Comment: Spieker, H., Gotlieb, A., Marijan, D., & Mossige, M. (2017). Reinforcement Learning for Automatic Test Case Prioritization and Selection in Continuous Integration. In Proceedings of 26th International Symposium on Software Testing and Analysis (ISSTA'17) (pp. 12--22). ACM.
H. Law, and J. Deng. (2018)cite arxiv:1808.01244Comment: Extended version with additional results. Test AP on MS COOO improved from 42.1% to 42.2% after a bug fix.