@kmd-ovgu

Joining Imputation and Active Feature Acquisition for Cost Saving on Data Streams with Missing Features

, , and . Discovery Science, page 308-322. Springer Nature Switzerland, (2023)
DOI: 10.1007/978-3-031-45275-8_21

Abstract

Replacing missing features in data streams is an important task in order to enable many machine learning algorithms that require feature-complete instances for training and prediction. Two popular methods for dealing with missing features are imputation and active feature acquisition, where in the former missing values are approximated, whereas in the latter, missing features are provided by an expert for a cost and within a limited budget. In this work, we present a hybridized approach, where we employ an active feature acquisition method in the first stage to pick candidate features on which we would require a costly expert and then check in a second stage how well we could impute these candidate features. If the imputation is expected to be of a certain quality, we skip the purchase and impute instead. We provide a framework for such a scenario and used it to run extensive experiments. Our results on 6 data sets show that our proposed method can achieve a similar classification performance while spending 1\% to 27\% less budget.

Links and resources

Tags

community

  • @kmd-ovgu
  • @dblp
@kmd-ovgu's tags highlighted