The use of machine translation tools for cross-lingual text mining

Abstract

Eigen-analysis such as LSI or KCCA was already successfully applied to cross-lingual information retrieval. This approach has a weakness in that it needs an aligned training set of documents. In this paper we address this weakness and show that it can be successfully avoided through the use of machine translation. We show that the performance is similar on the domains where human generated training seta are available. However for other domains artificial training sets can be generated that significantly outperform human generated ones obtained from a different domain.

BibTeX key: fortuna2005machine
entry type: misc
year: 2005
type: Conference or Workshop Item, PeerReviewed
id: http://eprints.pascal-network.org/archive/00001208/
url: http://eprints.pascal-network.org/archive/00001208/

BibSonomy

The use of machine translation tools for cross-lingual text mining

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on