Towards explicit semantic features using independent component analysis

Abstract

Latent semantic analysis (LSA) can be used to create an implicit semantic vectorial representation for words. Independent component analysis (ICA) can be derived as an extension to LSA that rotates the latent semantic space so that it becomes explicit, that is, the features correspond more with those resulting from human cognitive activity. This enables nonlinear filtering of the features, such as thresholding that forces sparse ICA components for words. We will demonstrate this with multiple choice semantic vocabulary tests generated from a multilingual thesaurus. The experiments are conducted in English, Finnish and Swedish.

BibTeX key: Vayrynen-Honkela-Lindqvist:2007:SCAR
entry type: inproceedings
address: Stockholm, Sweden
booktitle: Proceedings of the Workshop Semantic Content Acquisition and Representation (SCAR)
year: 2007
publisher: Swedish Institute of Computer Science
note: SICS Technical Report T2007-06

BibSonomy

Towards explicit semantic features using independent component analysis

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on