@telekoma

Exploiting Structural Information for Text Classification on the WWW

. Advances in Intelligent Data Analysis, volume 1642 of Lecture Notes in Computer Science, Springer, Berlin / Heidelberg, (1999)
DOI: 10.1007/3-540-48412-4_41

Abstract

In this paper, we report on a set of experiments that explore the utility of making use of the structural information of WWW documents. Our working hypothesis is that it is often easier to classify a hypertext page using information provided on pages that point to it instead of using information that is provided on the page itself. We present experimental evidence that confirms this hypothesis on a set of Web-pages that relate to Computer Science Departments.

Description

SpringerLink - Abstract

Links and resources

Tags

community

  • @telekoma
  • @dblp
@telekoma's tags highlighted