Abstract
This paper presents a supervised machine learning
approach that uses a decision tree learning algorithm for
recognition of Bengali noun-noun compounds as multiword
expression (M WE) from Bengali corpus. Our proposed
approach to MWE recognition has two steps: (1) extraction of
candidate multi-word expressions using chunk information
and various heuristic rules and (2) training the machine
learning algorithm to recognize a candidate multi-word
expression as Multi-word expression or not. A variety of
association measures have been used as features for
identifying MWEs. The proposed system is tested on a Bengali
corpus for identifying noun-noun compound MWEs from the
corpus.
Users
Please
log in to take part in the discussion (add own reviews or comments).