копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

German's Next Language Model

B. Chan, S. Schweter, и T. Möller. (2020)cite arxiv:2010.10906Comment: Accepted by COLING2020.

Аннотация

In this work we present the experiments which lead to the creation of our BERT and ELECTRA based German language models, GBERT and GELECTRA. By varying the input training data, model size, and the presence of Whole Word Masking (WWM) we were able to attain SoTA performance across a set of document classification and named entity recognition (NER) tasks for both models of base and large size. We adopt an evaluation driven approach in training these models and our results indicate that both adding more data and utilizing WWM improve model performance. By benchmarking against existing German models, we show that these models are the best German models to date. Our trained models will be made publicly available to the research community.

Описание

German's Next Language Model

Линки и ресурсы

ключ BibTeX: chan2020germans
тип записи: misc
год: 2020
url: http://arxiv.org/abs/2010.10906
Примечание: cite arxiv:2010.10906Comment: Accepted by COLING2020

тэги

@albinzehe- тэги данного пользователя выделены

Цитировать эту публикацию

искать в

Метаданные

Последнее изменение год назад
Создан год назад

Комментарии и рецензии
(0)

Комментарии, или рецензии отсутствуют. Вы можете их написать!