@seandalai

Automatic term recognition based on statistics of compound nouns and their components

, and . Terminology, 9 (2): 201-219 (2003)

Abstract

In this paper, we propose a new approach to enhance automatic recognition systems for domain-specific terms. The approach is based on the statistics about the relation between a compound noun and its constituents that are simple nouns. More precisely, we focus on how many nouns adjoin the noun in question to form compound nouns. We propose several scoring methods based on this approach and experimentally evaluate them on the NTCIR1 TMREC test collection. The results are very promising, especially in low and high recall.

Links and resources

Tags