copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Domain-specific sense distributions and predominant sense acquisition

R. Koeling, D. McCarthy, and J. Carroll. Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, page 419--426. Stroudsburg, PA, USA, Association for Computational Linguistics, (2005)
DOI: http://dx.doi.org/10.3115/1220575.1220628

Abstract

Distributions of the senses of words are often highly skewed. This fact is exploited by word sense disambiguation (WSD) systems which back off to the predominant sense of a word when contextual clues are not strong enough. The domain of a document has a strong influence on the sense distribution of words, but it is not feasible to produce large manually annotated corpora for every domain of interest. In this paper we describe the construction of three sense annotated corpora in different domains for a sample of English words. We apply an existing method for acquiring predominant sense information automatically from raw text, and for our sample demonstrate that (1) acquiring such information automatically from a mixed-domain corpus is more accurate than deriving it from SemCor, and (2) acquiring it automatically from text in the same domain as the target domain performs best by a large margin. We also show that for an all words WSD task this automatic method is best focussed on words that are salient to the domain, and on words with a different acquired predominant sense in that domain compared to that acquired from a balanced corpus.

Links and resources

BibTeX key: koeling2005
entry type: inproceedings
address: Stroudsburg, PA, USA
booktitle: Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
year: 2005
pages: 419--426
publisher: Association for Computational Linguistics
series: HLT '05
location: Vancouver, British Columbia, Canada
acmid: 1220628
numpages: 8
DOI: http://dx.doi.org/10.3115/1220575.1220628
url: http://dx.doi.org/10.3115/1220575.1220628

@jil's tags highlighted

Cite this publication

@inproceedings{koeling2005, abstract = {Distributions of the senses of words are often highly skewed. This fact is exploited by word sense disambiguation (WSD) systems which back off to the predominant sense of a word when contextual clues are not strong enough. The domain of a document has a strong influence on the sense distribution of words, but it is not feasible to produce large manually annotated corpora for every domain of interest. In this paper we describe the construction of three sense annotated corpora in different domains for a sample of English words. We apply an existing method for acquiring predominant sense information automatically from raw text, and for our sample demonstrate that (1) acquiring such information automatically from a mixed-domain corpus is more accurate than deriving it from SemCor, and (2) acquiring it automatically from text in the same domain as the target domain performs best by a large margin. We also show that for an all words WSD task this automatic method is best focussed on words that are salient to the domain, and on words with a different acquired predominant sense in that domain compared to that acquired from a balanced corpus.}, acmid = {1220628}, added-at = {2011-09-16T18:49:45.000+0200}, address = {Stroudsburg, PA, USA}, author = {Koeling, Rob and McCarthy, Diana and Carroll, John}, biburl = {https://www.bibsonomy.org/bibtex/279949d571bdf478a7a7c6a2e73693aa2/jil}, booktitle = {Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing}, doi = {http://dx.doi.org/10.3115/1220575.1220628}, interhash = {d9b13822d00e3ae3a9116c8d42156518}, intrahash = {79949d571bdf478a7a7c6a2e73693aa2}, keywords = {change distribution domain sense specific specificity word wsd}, location = {Vancouver, British Columbia, Canada}, numpages = {8}, pages = {419--426}, publisher = {Association for Computational Linguistics}, series = {HLT '05}, timestamp = {2013-11-23T20:11:51.000+0100}, title = {Domain-specific sense distributions and predominant sense acquisition}, url = {http://dx.doi.org/10.3115/1220575.1220628}, year = 2005 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Domain-specific sense distributions and predominant sense acquisition

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Domain-specific sense distributions and predominant sense acquisition

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Domain-specific sense distributions and predominant sense acquisition

Comments and Reviews
(0)