Class-based n-gram models of natural language

Abstract

We address the problem of predicting a word from previous words in a sample of text. In particular, we discuss n-gram models based on classes of words. We also discuss several statistical algorithms for assigning words to classes based on the frequency of their co-occurrence with other words. We find that we are able to extract classes that have the flavor of either syntactically based groupings or semantically based groupings, depending on the nature of the underlying statistics.

BibTeX key: brown_class-based_1992
entry type: article
year: 1992
journal: Comput. Linguist.
number: 4
pages: 467--479
volume: 18
url: http://dl.acm.org/citation.cfm?id=176313.176316

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

BibSonomy

Class-based n-gram models of natural language

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on