Abstract
The chapter discusses the various types of corpora, and provides a sense of how words behave inside them. Quantitative exploration of individual words in corpus is shown using frequency and information content measures. Quantitative exploration of co-occurrences of words, called collocations, is shown using the point-wise mutual information and other measures. Concordancers, a tool for viewing words in their immediate contextual environment within a corpus, are introduced for qualitative exploration of corpora. Experiment: Comparing word frequencies between domain-specific corpora.
Users
Please
log in to take part in the discussion (add own reviews or comments).