Discovering Structure in High-Dimensional Data Through Correlation Explanation.

Abstract

We introduce a method to learn a hierarchy of successively more abstract representations of complex data based on optimizing an information-theoretic objective. Intuitively, the optimization searches for a set of latent factors that best explain the correlations in the data as measured by multivariate mutual information. The method is unsupervised, requires no model assumptions, and scales linearly with the number of variables which makes it an attractive approach for very high dimensional systems. We demonstrate that Correlation Explanation (CorEx) automatically discovers meaningful structure for data from diverse sources including personality tests, DNA, and human language.

BibTeX key: steeg2014discovering
entry type: preprint
year: 2014
journal: CoRR
volume: abs/1406.1222
type: Publication
url: http://arxiv.org/abs/1406.1222
note: cite arxiv:1406.1222Comment: 15 pages, 6 figures. Includes supplementary material and link to code. Published in the proceedings of the 28th Annual Conference on Neural Information Processing Systems, NIPS 2014

Users

Comments and Reviewsshow / hide

This publication ist of type "preprint". To see comments and reviews from other users, you have to create your own comment or review for this post first.

Please log in to take part in the discussion (add own reviews or comments).

BibSonomy

Discovering Structure in High-Dimensional Data Through Correlation Explanation.

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on