@lee_peck

KeyGraph: Automatic Indexing by Co-occurrence Graph based on Building Construction Metaphor

, , and . ADL '98: Proceedings of the Advances in Digital Libraries Conference, page 12. Washington, DC, USA, IEEE Computer Society, (1998)

Abstract

In this paper, we present an algorithm for extracting keywords representing the asserted main point in a document, without relying on external devices such as natural language processing tools or a document corpus. Our algorithm KeyGraph is based on the segmentation of a graph, representing the co-occurrence between terms in a document, into clusters. Each cluster corresponds to a concept on which author's idea is based, and top ranked terms by a statistic based on each term's relationship to these clusters are selected as keywords. This strategy comes from considering that a document is constructed like a building for expressing new ideas based on traditional concepts.The experimental results show that thus extracted terms match author's point quite accurately, even though KeyGraph does not use each term's average frequency in a corpus, i.e., KeyGraph is a content-sensitive, domain independent device of indexing.

Description

KeyGraph

Links and resources

Tags

community

  • @dblp
  • @lee_peck
@lee_peck's tags highlighted