Abstract
Replacing missing features in data streams is an important task in order to enable many machine learning algorithms that require feature-complete instances for training and prediction. Two popular methods for dealing with missing features are imputation and active feature acquisition, where in the former missing values are approximated, whereas in the latter, missing features are provided by an expert for a cost and within a limited budget. In this work, we present a hybridized approach, where we employ an active feature acquisition method in the first stage to pick candidate features on which we would require a costly expert and then check in a second stage how well we could impute these candidate features. If the imputation is expected to be of a certain quality, we skip the purchase and impute instead. We provide a framework for such a scenario and used it to run extensive experiments. Our results on 6 data sets show that our proposed method can achieve a similar classification performance while spending 1\% to 27\% less budget.
Users
Please
log in to take part in the discussion (add own reviews or comments).