BibSonomy :: bibtex  ::

tag user group author concept BibTeX key search:all search:seandalai
A blue social bookmark and publication sharing system.
tags · relations · groups · popular
help · blog · about
login · register
seandalai's BibTeX entry:  

Detecting Novel Compounds: The Role of Distributional Evidence

Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics, : 235--242, 2003.
Authors: Mirella Lapata and Alex Lascarides
URL: http://acl.ldc.upenn.edu/E/E03/E03-1073.pdf
Tags: 2003 compounds eacl nlp
Abstract: Research on the discovery of terms from corpora has focused on word sequences whose recurrent occurrence in a corpus is indicative of their terminological status, and has not addressed the issue of discovering terms when data is sparse. This becomes apparent in the case of noun compounding, which is extremely productive: more than half of the candidate compounds extracted from a corpus are attested only once. We show how evidence about established (i.e.,frequent) compounds can be used to estimate features that can discriminate rare valid compounds from rare nonce terms in addition to a variety of linguistic features than can be easily gleaned from corpora without relying on parsed text.
| URL | BibTeX  
@inproceedings{Lapata:Lascarides:03,
title = {Detecting Novel Compounds: The Role of Distributional Evidence},
author = {Mirella Lapata and Alex Lascarides},
booktitle = {Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics},
pages = {235--242},
url = {http://acl.ldc.upenn.edu/E/E03/E03-1073.pdf},
year = {2003},
abstract = {Research on the discovery of terms from corpora has focused on word sequences whose recurrent occurrence in a corpus is indicative of their terminological status, and has not addressed the issue of discovering terms when data is sparse. This becomes apparent in the case of noun compounding, which is extremely productive: more than half of the candidate compounds extracted from a corpus are attested only once. We show how evidence about established (i.e.,frequent) compounds can be used to estimate features that can discriminate rare valid compounds from rare nonce terms in addition to a variety of linguistic features than can be easily gleaned from corpora without relying on parsed text.},
keywords = {2003 compounds eacl nlp }
}