copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

A maximum entropy approach to identifying sentence boundaries

J. Reynar, and A. Ratnaparkhi. Proceedings of the fifth conference on Applied natural language processing, page 16--19. Stroudsburg, PA, USA, Association for Computational Linguistics, (1997)
DOI: 10.3115/974557.974561

Abstract

We present a trainable model for identifying sentence boundaries in raw text. Given a corpus annotated with sentence boundaries, our model learns to classify each occurrence of., ?, and ! as either a valid or invalid sentence boundary. The training procedure requires no hand-crafted rules, lexica, part-of-speech tags, or domain-specific information. The model can therefore be trained easily on any genre of English, and should be trainable on any other Romanalphabet language. Performance is comparable to or better than the performance of similar systems, but we emphasize the simplicity of retraining for new domains.

Description

A maximum entropy approach to identifying sentence boundaries

Links and resources

BibTeX key: reynar1997
entry type: inproceedings
address: Stroudsburg, PA, USA
booktitle: Proceedings of the fifth conference on Applied natural language processing
year: 1997
pages: 16--19
publisher: Association for Computational Linguistics
series: ANLC '97
location: Washington, DC
acmid: 974561
numpages: 4
DOI: 10.3115/974557.974561
url: http://dx.doi.org/10.3115/974557.974561

@jil's tags highlighted

Cite this publication

search on

Meta data

Last update 11 years ago
Created 12 years ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

A maximum entropy approach to identifying sentence boundaries

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML A maximum entropy approach to identifying sentence boundaries

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

A maximum entropy approach to identifying sentence boundaries

Comments and Reviews
(0)