Active Learning for Efficient Audio Annotation and Classification with a Large Amount of Unlabeled Data

Y. Wang, A. Mendez Mendez, M. Cartwright, и J. Bello.
ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), стр. 880-884. (мая 2019)
DOI: 10.1109/ICASSP.2019.8683063

Аннотация

There are many sound classification problems that have target classes which are rare or unique to the context of the problem. For these problems, existing data sets are not sufficient and we must create new problem-specific datasets to train classification models. However, annotating a new dataset for every new problem is costly. Active learning could potentially reduce this annotation cost, but it has been understudied in the context of audio annotation. In this work, we investigate active learning to reduce the annotation cost of a sound classification dataset unique to a particular problem. We evaluate three certainty-based active learning query strategies and propose a new strategy: alternating confidence sampling. Using this strategy, we demonstrate reduced annotation costs when actively training models with both experts and non-experts, and we perform a qualitative analysis on 20k unlabeled recordings to show our approach results in a model that generalizes well to unseen data.

ключ BibTeX: 8683063
тип записи: inproceedings
название книги: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
год: 2019
месяц: may
страницы: 880-884
issn: 2379-190X
DOI: 10.1109/ICASSP.2019.8683063
url: https://ieeexplore.ieee.org/abstract/document/8683063

тэги

Пользователи данного ресурса

Комментарии и рецензиипоказать / перейти в невидимый режим

Пожалуйста, войдите в систему, чтобы принять участие в дискуссии (добавить собственные рецензию, или комментарий)

Цитировать эту публикацию

@inproceedings{8683063, abstract = {There are many sound classification problems that have target classes which are rare or unique to the context of the problem. For these problems, existing data sets are not sufficient and we must create new problem-specific datasets to train classification models. However, annotating a new dataset for every new problem is costly. Active learning could potentially reduce this annotation cost, but it has been understudied in the context of audio annotation. In this work, we investigate active learning to reduce the annotation cost of a sound classification dataset unique to a particular problem. We evaluate three certainty-based active learning query strategies and propose a new strategy: alternating confidence sampling. Using this strategy, we demonstrate reduced annotation costs when actively training models with both experts and non-experts, and we perform a qualitative analysis on 20k unlabeled recordings to show our approach results in a model that generalizes well to unseen data.}, added-at = {2022-09-08T17:29:49.000+0200}, author = {Wang, Yu and Mendez Mendez, Ana Elisa and Cartwright, Mark and Bello, Juan Pablo}, biburl = {https://www.bibsonomy.org/bibtex/2cee2499a7232ba11d85bb281c1bad06c/simonha94}, booktitle = {ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)}, description = {Active Learning for Efficient Audio Annotation and Classification with a Large Amount of Unlabeled Data | IEEE Conference Publication | IEEE Xplore}, doi = {10.1109/ICASSP.2019.8683063}, interhash = {fc31eeaf8eaef76ff70a66a724089a4b}, intrahash = {cee2499a7232ba11d85bb281c1bad06c}, issn = {2379-190X}, keywords = {acoustics youtube}, month = may, pages = {880-884}, timestamp = {2022-09-08T17:29:49.000+0200}, title = {Active Learning for Efficient Audio Annotation and Classification with a Large Amount of Unlabeled Data}, url = {https://ieeexplore.ieee.org/abstract/document/8683063}, year = 2019 }

BibSonomy