Inproceedings,

ActiveGLAE: A Benchmark for Deep Active Learning with Transformers

L. Rauch, M. Aßenmacher, D. Huseljic, M. Wirth, B. Bischl, and B. Sick.
European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD), page 55--74. Springer, (2023)
DOI: 10.1007/978-3-031-43412-9_4

Abstract

Deep active learning (DAL) seeks to reduce annotation costs by enabling the model to actively query instance annotations from which it expects to learn the most. Despite extensive research, there is cur- rently no standardized evaluation protocol for transformer-based lan- guage models in the field of DAL. Diverse experimental settings lead to difficulties in comparing research and deriving recommendations for prac- titioners. To tackle this challenge, we propose the ActiveGLAE bench- mark, a comprehensive collection of data sets and evaluation guidelines for assessing DAL. Our benchmark aims to facilitate and streamline the evaluation process of novel DAL strategies. Additionally, we provide an extensive overview of current practice in DAL with transformer-based language models. We identify three key challenges - data set selection, model training, and DAL settings - that pose difficulties in comparing query strategies. We establish baseline results through an extensive set of experiments as a reference point for evaluating future work. Based on our findings, we provide guidelines for researchers and practitioners.

BibTeX key: rauch2023activeglae
entry type: inproceedings
booktitle: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD)
year: 2023
pages: 55--74
publisher: Springer
DOI: 10.1007/978-3-031-43412-9_4
codeurl: https://github.com/dhuseljic/dal-toolbox/tree/main/experiments/aglae
url: https://link.springer.com/chapter/10.1007/978-3-031-43412-9_4

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@inproceedings{rauch2023activeglae, abstract = {Deep active learning (DAL) seeks to reduce annotation costs by enabling the model to actively query instance annotations from which it expects to learn the most. Despite extensive research, there is cur- rently no standardized evaluation protocol for transformer-based lan- guage models in the field of DAL. Diverse experimental settings lead to difficulties in comparing research and deriving recommendations for prac- titioners. To tackle this challenge, we propose the ActiveGLAE bench- mark, a comprehensive collection of data sets and evaluation guidelines for assessing DAL. Our benchmark aims to facilitate and streamline the evaluation process of novel DAL strategies. Additionally, we provide an extensive overview of current practice in DAL with transformer-based language models. We identify three key challenges - data set selection, model training, and DAL settings - that pose difficulties in comparing query strategies. We establish baseline results through an extensive set of experiments as a reference point for evaluating future work. Based on our findings, we provide guidelines for researchers and practitioners.}, added-at = {2023-12-20T14:16:07.000+0100}, author = {Rauch, Lukas and Aßenmacher, Matthias and Huseljic, Denis and Wirth, Moritz and Bischl, Bernd and Sick, Bernhard}, biburl = {https://www.bibsonomy.org/bibtex/2f87a767cae5229336439f53f134fec32/ies}, booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD)}, codeurl = {https://github.com/dhuseljic/dal-toolbox/tree/main/experiments/aglae}, doi = {10.1007/978-3-031-43412-9_4}, interhash = {f53db50db8e5db94b91b8faae266cbc5}, intrahash = {f87a767cae5229336439f53f134fec32}, keywords = {imported itegpub isac-www}, pages = {55--74}, publisher = {Springer}, timestamp = {2023-12-20T14:16:07.000+0100}, title = {ActiveGLAE: A Benchmark for Deep Active Learning with Transformers}, url = {https://link.springer.com/chapter/10.1007/978-3-031-43412-9_4}, year = 2023 }

BibSonomy

ActiveGLAE: A Benchmark for Deep Active Learning with Transformers

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on