@utahell

Efficient incremental constrained clustering

, , and . KDD '07: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, page 240--249. New York, NY, USA, ACM, (2007)
DOI: 10.1145/1281192.1281221

Abstract

Clustering with constraints is an emerging area of data mining research. However, most work assumes that the constraints are given as one large batch. In this paper we explore the situation where the constraints are incrementally given. In this way the user after seeing a clustering can provide positive and negative feedback via constraints to critique a clustering solution. We consider the problem of efficiently updating a clustering to satisfy the new and old constraints rather than reclustering the entire data set. We show that the problem of incremental clustering under constraints is NP-hard in general, but identify several sufficient conditions which lead to efficiently solvable versions. These translate into a set of rules on the types of constraints thatcan be added and constraint set properties that must be maintained. We demonstrate that this approach is more efficient than re-clustering the entire data set and has several other advantages.

Description

Efficient incremental constrained clustering

Links and resources

Tags

community

  • @dblp
  • @utahell
@utahell's tags highlighted