Article,

BIRCH: A New Data Clustering Algorithm and Its Applications

, , and .
Data Mining and Knowledge Discovery, 1 (2): 141--182 (Jun 1, 1997)
DOI: 10.1023/a:1009783824328

Abstract

Data clustering is an important technique for exploratory data analysis, and has been studied for several years. It has been shown to be useful in many practical domains such as data classification and image processing. Recently, there has been a growing emphasis on exploratory analysis of very large datasets to discover useful patterns and/or correlations among attributes. This is called data mining, and data clustering is regarded as a particular branch. However existing data clustering methods do not adequately address the problem of processing large datasets with a limited amount of resources (e.g., memory and cpu cycles). So as the dataset size increases, they do not scale up well in terms of memory requirement, running time, and result quality.

Tags

Users

  • @cdevries
  • @karthikraman
  • @pierpaolo.pk81
  • @jbeneke
  • @dblp

Comments and Reviews