Context recognition for hierarchical text classification
R. Liu. Journal of the American Society for Information Science and Technology, 9999 (9999):
NA+(2009)
DOI: 10.1002/asi.21022
Abstract
Information is often organized as a text hierarchy. A hierarchical text-classification system is thus essential for the management, sharing, and dissemination of information. It aims to automatically classify each incoming document into zero, one, or several categories in the text hierarchy. In this paper, we present a technique called CRHTC (context recognition for hierarchical text classification) that performs hierarchical text classification by recognizing the context of discussion (COD) of each category. A category's COD is governed by its ancestor categories, whose contents indicate contextual backgrounds of the category. A document may be classified into a category only if its content matches the category's COD. CRHTC does not require any trials to manually set parameters, and hence is more portable and easier to implement than other methods. It is empirically evaluated under various conditions. The results show that CRHTC achieves both better and more stable performance than several hierarchical and nonhierarchical text-classification methodologies.
%0 Journal Article
%1 citeulike:3979048
%A Liu, Rey-Long
%C Department of Medical Informatics, Tzu Chi University, Hualien, Taiwan, ROC
%D 2009
%J Journal of the American Society for Information Science and Technology
%K categories, classification, context, context\_dependent, hierarchy, ontology
%N 9999
%P NA+
%R 10.1002/asi.21022
%T Context recognition for hierarchical text classification
%U http://dx.doi.org/10.1002/asi.21022
%V 9999
%X Information is often organized as a text hierarchy. A hierarchical text-classification system is thus essential for the management, sharing, and dissemination of information. It aims to automatically classify each incoming document into zero, one, or several categories in the text hierarchy. In this paper, we present a technique called CRHTC (context recognition for hierarchical text classification) that performs hierarchical text classification by recognizing the context of discussion (COD) of each category. A category's COD is governed by its ancestor categories, whose contents indicate contextual backgrounds of the category. A document may be classified into a category only if its content matches the category's COD. CRHTC does not require any trials to manually set parameters, and hence is more portable and easier to implement than other methods. It is empirically evaluated under various conditions. The results show that CRHTC achieves both better and more stable performance than several hierarchical and nonhierarchical text-classification methodologies.
@article{citeulike:3979048,
abstract = {Information is often organized as a text hierarchy. A hierarchical text-classification system is thus essential for the management, sharing, and dissemination of information. It aims to automatically classify each incoming document into zero, one, or several categories in the text hierarchy. In this paper, we present a technique called CRHTC (context recognition for hierarchical text classification) that performs hierarchical text classification by recognizing the context of discussion (COD) of each category. A category's COD is governed by its ancestor categories, whose contents indicate contextual backgrounds of the category. A document may be classified into a category only if its content matches the category's COD. CRHTC does not require any trials to manually set parameters, and hence is more portable and easier to implement than other methods. It is empirically evaluated under various conditions. The results show that CRHTC achieves both better and more stable performance than several hierarchical and nonhierarchical text-classification methodologies.},
added-at = {2009-12-11T23:34:46.000+0100},
address = {Department of Medical Informatics, Tzu Chi University, Hualien, Taiwan, ROC},
author = {Liu, Rey-Long},
biburl = {https://www.bibsonomy.org/bibtex/2737f01f0dd2c6ffc7f32854af05e8da1/djsaab},
citeulike-article-id = {3979048},
citeulike-linkout-0 = {http://dx.doi.org/10.1002/asi.21022},
citeulike-linkout-1 = {http://www3.interscience.wiley.com/cgi-bin/abstract/121664871/ABSTRACT},
description = {djsaab's CiteULike library 20091211},
doi = {10.1002/asi.21022},
interhash = {139a17aa45009d393aa0b5632d1789a2},
intrahash = {737f01f0dd2c6ffc7f32854af05e8da1},
journal = {Journal of the American Society for Information Science and Technology},
keywords = {categories, classification, context, context\_dependent, hierarchy, ontology},
number = 9999,
pages = {NA+},
posted-at = {2009-01-29 14:09:29},
priority = {2},
timestamp = {2009-12-11T23:34:55.000+0100},
title = {Context recognition for hierarchical text classification},
url = {http://dx.doi.org/10.1002/asi.21022},
volume = 9999,
year = 2009
}