Towards explicit semantic features using independent component analysis
J. Väyrynen, T. Honkela, and L. Lindqvist. Proceedings of the Workshop Semantic Content Acquisition and Representation (SCAR), Stockholm, Sweden, Swedish Institute of Computer Science, (2007)SICS Technical Report T2007-06.
Abstract
Latent semantic analysis (LSA) can be used to create an implicit semantic vectorial representation for words. Independent component analysis (ICA) can be derived as an extension to LSA that rotates the latent semantic space so that it becomes explicit, that is, the features correspond more with those resulting from human cognitive activity. This enables nonlinear filtering of the features, such as thresholding that forces sparse ICA components for words. We will demonstrate this with multiple choice semantic vocabulary tests generated from a multilingual thesaurus. The experiments are conducted in English, Finnish and Swedish.
%0 Conference Paper
%1 Vayrynen-Honkela-Lindqvist:2007:SCAR
%A Väyrynen, Jaakko J.
%A Honkela, Timo
%A Lindqvist, Lasse
%B Proceedings of the Workshop Semantic Content Acquisition and Representation (SCAR)
%C Stockholm, Sweden
%D 2007
%I Swedish Institute of Computer Science
%K context feature ica semantic synonym thresholding word
%T Towards explicit semantic features using independent component analysis
%X Latent semantic analysis (LSA) can be used to create an implicit semantic vectorial representation for words. Independent component analysis (ICA) can be derived as an extension to LSA that rotates the latent semantic space so that it becomes explicit, that is, the features correspond more with those resulting from human cognitive activity. This enables nonlinear filtering of the features, such as thresholding that forces sparse ICA components for words. We will demonstrate this with multiple choice semantic vocabulary tests generated from a multilingual thesaurus. The experiments are conducted in English, Finnish and Swedish.
@inproceedings{Vayrynen-Honkela-Lindqvist:2007:SCAR,
abstract = {Latent semantic analysis (LSA) can be used to create an implicit semantic vectorial representation for words. Independent component analysis (ICA) can be derived as an extension to LSA that rotates the latent semantic space so that it becomes explicit, that is, the features correspond more with those resulting from human cognitive activity. This enables nonlinear filtering of the features, such as thresholding that forces sparse ICA components for words. We will demonstrate this with multiple choice semantic vocabulary tests generated from a multilingual thesaurus. The experiments are conducted in English, Finnish and Swedish.
},
added-at = {2008-11-26T15:16:54.000+0100},
address = {Stockholm, Sweden},
author = {V{\"{a}}yrynen, Jaakko J. and Honkela, Timo and Lindqvist, Lasse},
biburl = {https://www.bibsonomy.org/bibtex/22076f357079829c526581171c3a49bee/jjv},
booktitle = {Proceedings of the Workshop Semantic Content Acquisition and Representation (SCAR)},
interhash = {f71b245117864a05f77d8f7f98a52e13},
intrahash = {2076f357079829c526581171c3a49bee},
keywords = {context feature ica semantic synonym thresholding word},
note = {SICS Technical Report T2007-06},
publisher = {Swedish Institute of Computer Science},
timestamp = {2008-11-26T15:16:54.000+0100},
title = {Towards explicit semantic features using independent component analysis},
year = 2007
}