Abstract
We present a translation-free technique for multilingual information
retrieval. This technique is based on an ontological representation of
documents and queries. For each language, we use
a dictionary (set of lexical reference for concepts) to
map a term to its corresponding concept. The same mapping is applied
to each document and each query. Then, we use a classic
vector space model based on concept for indexing and querying the
document corpus. The main advantages of our approach are:
no merging phase is required; no dependency on automatic translators
between all pairs of languages; and adding a new language
only requires a new mapping dictionary to be added into the multilingual
ontology. Experimental results on the CLEF 2005 multi8
collection show that this approach is efficient, even with relatively
small and low fidelity dictionaries and without word
sense disambiguation.
Users
Please
log in to take part in the discussion (add own reviews or comments).