Inproceedings,

Addressing the Curse of Imbalanced Training Sets: One-Sided Selection

M. Kubat, and S. Matwin.
In Proceedings of the Fourteenth International Conference on Machine Learning, page 179--186. Morgan Kaufmann, (1997)

Abstract

Adding examples of the majority class to the training set can have a detrimental effect on the learner's behavior: noisy or otherwise unreliable examples from the majority class can overwhelm the minority class. The paper discusses criteria to evaluate the utility of classifiers induced from such imbalanced training sets, gives explanation of the poor behavior of some learners under these circumstances, and suggests as a solution a simple technique called one-sided selection of examples. 1 Introduction The general topic of this paper is learning from examples described by pairs (x; c(x), where x is a vector of attribute values and c(x) is the corresponding concept label. For simplicity, we consider only problems where c(x) is either positive or negative, and all attributes are continuous. Since Fisher (1936), this task has received plenty of attention from statisticians as well as from researchers in artificial neural networks, AI, and ML. A typical scenario assumes the e...

BibTeX key: Kubat97addressingthe
entry type: inproceedings
booktitle: In Proceedings of the Fourteenth International Conference on Machine Learning
year: 1997
pages: 179--186
publisher: Morgan Kaufmann
url: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.43.4487

BibSonomy

Addressing the Curse of Imbalanced Training Sets: One-Sided Selection

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on