copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Disambiguating toponyms in news

E. Garbin, and I. Mani. Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, page 363--370. Stroudsburg, PA, USA, Association for Computational Linguistics, (2005)
DOI: 10.3115/1220575.1220621

Abstract

This research is aimed at the problem of disambiguating toponyms (place names) in terms of a classification derived by merging information from two publicly available gazetteers. To establish the difficulty of the problem, we measured the degree of ambiguity, with respect to a gazetteer, for toponyms in news. We found that 67.82% of the toponyms found in a corpus that were ambiguous in a gazetteer lacked a local discriminator in the text. Given the scarcity of human-annotated data, our method used unsupervised machine learning to develop disambiguation rules. Toponyms were automatically tagged with information about them found in a gazetteer. A toponym that was ambiguous in the gazetteer was automatically disambiguated based on preference heuristics. This automatically tagged data was used to train a machine learner, which disambiguated toponyms in a human-annotated news corpus at 78.5% accuracy.

Links and resources

BibTeX key: garbin2005disambiguating
entry type: inproceedings
address: Stroudsburg, PA, USA
booktitle: Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
year: 2005
pages: 363--370
publisher: Association for Computational Linguistics
location: Vancouver, British Columbia, Canada
acmid: 1220621
numpages: 8
DOI: 10.3115/1220575.1220621
url: http://dx.doi.org/10.3115/1220575.1220621

@jaeschke's tags highlighted

Cite this publication

search on

Meta data

Last update 10 years ago
Created 12 years ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Disambiguating toponyms in news

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Disambiguating toponyms in news

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Disambiguating toponyms in news

Comments and Reviews
(0)