Book,

Applications of automatic analysis of content based on the mathematical theory of information (English)

.
(2002)

Abstract

Analyzes the most important proposals following Shannon and Weaver's mathematic theory of communication which have influenced automatic content analysis. Explains the methodological applications of this theory to the field of documentation, especially with respect to information retrieval. Then describes the mathematical models applied to automatic content analysis: the laws of Zipf and Goffman, anti-dictionaries to permuted indexes, statistical indexing of terms by frequencies, n-grams, and stemming algorithms. Studies the methods of relation and classification, such as clusters by value of discrimination and by relevance of terms, e.g., methods of relations based in graph theory, mass core, the K-means or incremental K-means, and the ISODATA algorithm. Concludes by explaining scientometric indicators such as Chen's co-wording and methods with learning systems. Analyzes the most important proposals following Shannon and Weaver's mathematic theory of communication which have influenced automatic content analysis. Explains the methodological applications of this theory to the field of documentation, especially with respect to information retrieval. Then describes the mathematical models applied to automatic content analysis: the laws of Zipf and Goffman, anti-dictionaries to permuted indexes, statistical indexing of terms by frequencies, n-grams, and stemming algorithms. Studies the methods of relation and classification, such as clusters by value of discrimination and by relevance of terms, e.g., methods of relations based in graph theory, mass core, the K-means or incremental K-means, and the ISODATA algorithm. Concludes by explaining scientometric indicators such as Chen's co-wording and methods with learning systems.

Tags

Users

  • @sercarfe

Comments and Reviews