Generalization in Clustering with Unobserved Features

Abstract

We argue that when objects are characterized by many attributes, clus- tering them on the basis of a relatively small random subset of these attributes can capture information on the unobserved attributes as well. Moreover, we show that under mild technical conditions, clustering the objects on the basis of such a random subset performs almost as well as clustering with the full attribute set. We prove a finite sample general- ization theorems for this novel learning scheme that extends analogous results from the supervised learning setting. The scheme is demonstrated for collaborative filtering of users with movies rating as attributes.

BibTeX key: krupka-generalization-clustering-unobserved-2005
entry type: incollection
address: Cambridge, MA
booktitle: Advances in Neural Information Processing Systems 18
year: 2006
pages: 683--690
publisher: MIT Press

BibSonomy

Generalization in Clustering with Unobserved Features

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on