Stream-Based Active Learning in Changing Environments under Verification Latency
T. Pham. Organic Computing -- Doctoral Dissertation Colloquium 2021, kassel university press, (2022)
Abstract
Labeling data often requires human interaction, which is slow and costly depending on the required expertise.
Stream-based active learning reduces the cost of labeling for data stream mining by only querying the labels for instances that improve the classifier the most. However, state-of-the-art stream-based active learning strategies assume that verification latency, i.e., the time between a query and the availability of the queried label, is negligible. In reality, this is usually not the case as the experts who provide such labels are not available all the time, and the labeling process itself requires time.
This article investigates the impact of verification latency within a stream-based active learning scenario for changing data streams. We focus on two different types of changes that occur over time, i.e., changes in the number of features and changes in the distribution of instances and/or labels. In particular, we identify challenges that occur due to verification latency and propose ideas to overcome these problems within a stream-based active learning strategy.
%0 Book Section
%1 pham2022stream-based
%A Pham, Tuan
%B Organic Computing -- Doctoral Dissertation Colloquium 2021
%D 2022
%E Tomforde, Sven
%E Krupitzer, Christian
%I kassel university press
%K imported itegpub isac-www
%P 152--164
%T Stream-Based Active Learning in Changing Environments under Verification Latency
%X Labeling data often requires human interaction, which is slow and costly depending on the required expertise.
Stream-based active learning reduces the cost of labeling for data stream mining by only querying the labels for instances that improve the classifier the most. However, state-of-the-art stream-based active learning strategies assume that verification latency, i.e., the time between a query and the availability of the queried label, is negligible. In reality, this is usually not the case as the experts who provide such labels are not available all the time, and the labeling process itself requires time.
This article investigates the impact of verification latency within a stream-based active learning scenario for changing data streams. We focus on two different types of changes that occur over time, i.e., changes in the number of features and changes in the distribution of instances and/or labels. In particular, we identify challenges that occur due to verification latency and propose ideas to overcome these problems within a stream-based active learning strategy.
@incollection{pham2022stream-based,
abstract = {Labeling data often requires human interaction, which is slow and costly depending on the required expertise.
Stream-based active learning reduces the cost of labeling for data stream mining by only querying the labels for instances that improve the classifier the most. However, state-of-the-art stream-based active learning strategies assume that verification latency, i.e., the time between a query and the availability of the queried label, is negligible. In reality, this is usually not the case as the experts who provide such labels are not available all the time, and the labeling process itself requires time.
This article investigates the impact of verification latency within a stream-based active learning scenario for changing data streams. We focus on two different types of changes that occur over time, i.e., changes in the number of features and changes in the distribution of instances and/or labels. In particular, we identify challenges that occur due to verification latency and propose ideas to overcome these problems within a stream-based active learning strategy.},
added-at = {2022-07-20T14:57:02.000+0200},
author = {Pham, Tuan},
biburl = {https://www.bibsonomy.org/bibtex/2c7c756b490bebadf955cac7d4b1c6d59/ies},
booktitle = {Organic Computing -- Doctoral Dissertation Colloquium 2021},
editor = {Tomforde, Sven and Krupitzer, Christian},
interhash = {545f4ad8cd40501325c60a49b49fd945},
intrahash = {c7c756b490bebadf955cac7d4b1c6d59},
keywords = {imported itegpub isac-www},
pages = {152--164},
publisher = {kassel university press},
timestamp = {2022-07-20T14:57:02.000+0200},
title = {Stream-Based Active Learning in Changing Environments under Verification Latency},
year = 2022
}