copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Faster Dimension Reduction

N. Ailon, and B. Chazelle. Commun. ACM, 53 (2): 97--104 (February 2010)
DOI: 10.1145/1646353.1646379

Abstract

Data represented geometrically in high-dimensional vector spaces can be found in many applications. Images and videos, are often represented by assigning a dimension for every pixel (and time). Text documents may be represented in a vector space where each word in the dictionary incurs a dimension. The need to manipulate such data in huge corpora such as the web and to support various query types gives rise to the question of how to represent the data in a lower-dimensional space to allow more space and time efficient computation. Linear mappings are an attractive approach to this problem because the mapped input can be readily fed into popular algorithms that operate on linear spaces (such as principal-component analysis, PCA) while avoiding the curse of dimensionality. The fact that such mappings even exist became known in computer science following seminal work by Johnson and Lindenstrauss in the early 1980s. The underlying technique is often called "random projection." The complexity of the mapping itself, essentially the product of a vector with a dense matrix, did not attract much attention until recently. In 2006, we discovered a way to "sparsify" the matrix via a computational version of Heisenberg's Uncertainty Principle. This led to a significant speedup, which also retained the practical simplicity of the standard Johnson-Lindenstrauss projection. We describe the improvement in this article, together with some of its applications.

Description

Faster dimension reduction

@jil's tags highlighted

Cite this publication

@article{ailon2010faster, abstract = {Data represented geometrically in high-dimensional vector spaces can be found in many applications. Images and videos, are often represented by assigning a dimension for every pixel (and time). Text documents may be represented in a vector space where each word in the dictionary incurs a dimension. The need to manipulate such data in huge corpora such as the web and to support various query types gives rise to the question of how to represent the data in a lower-dimensional space to allow more space and time efficient computation. Linear mappings are an attractive approach to this problem because the mapped input can be readily fed into popular algorithms that operate on linear spaces (such as principal-component analysis, PCA) while avoiding the curse of dimensionality. The fact that such mappings even exist became known in computer science following seminal work by Johnson and Lindenstrauss in the early 1980s. The underlying technique is often called "random projection." The complexity of the mapping itself, essentially the product of a vector with a dense matrix, did not attract much attention until recently. In 2006, we discovered a way to "sparsify" the matrix via a computational version of Heisenberg's Uncertainty Principle. This led to a significant speedup, which also retained the practical simplicity of the standard Johnson-Lindenstrauss projection. We describe the improvement in this article, together with some of its applications.}, acmid = {1646379}, added-at = {2014-10-14T18:33:49.000+0200}, address = {New York, NY, USA}, author = {Ailon, Nir and Chazelle, Bernard}, biburl = {https://www.bibsonomy.org/bibtex/20b79f0a9b1d6de3ac0dd63b11564d86b/jil}, description = {Faster dimension reduction}, doi = {10.1145/1646353.1646379}, interhash = {848d19c0e7a6bd66d39860660ca50e1f}, intrahash = {0b79f0a9b1d6de3ac0dd63b11564d86b}, issn = {0001-0782}, issue_date = {February 2010}, journal = {Commun. ACM}, keywords = {acm communications dimensionality journal of projection random reduction seminar2014 the}, month = feb, number = 2, numpages = {8}, pages = {97--104}, publisher = {ACM}, timestamp = {2014-10-14T18:33:49.000+0200}, title = {Faster Dimension Reduction}, url = {http://doi.acm.org/10.1145/1646353.1646379}, volume = 53, year = 2010 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Faster Dimension Reduction

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Faster Dimension Reduction

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Faster Dimension Reduction

Comments and Reviews
(0)