@goalscoringsuperstarhero

You Can't Beat Frequency (Unless You Use Linguistic Knowledge) -- A Qualitative Evaluation of Association Measures for Collocation and Term Extraction

, and . 44th Annual Meeting of the Association for Computational Linguistics, page 785--792. Sydney, Australia, Association for Computational Linguistics, (July 2006)

Abstract

In the past years, a number of lexical association measures have been studied to help extract new scientific terminology or general-language collocations. The implicit assumption of this research was that newly designed term measures involving more sophisticated statistical criteria would outperform simple counts of cooccurrence frequencies. We here explicitly test this assumption. By way of four qualitative criteria, we show that purely statistics-based measures reveal virtually no difference compared with frequency of occurrence counts, while linguistically more informed metrics do reveal such a marked difference.

Links and resources

Tags

community