Article,

Scientific Reference Mining using Semantic Information through Topic Modeling

, , and .
Research Journal of Engineering & Technology, 28 (2): 253--262 (2009)

Abstract

This paper addresses one time-consuming task in preparing scientific paper of finding appropriate citations (references). In the past this task is usually performed manually or by using links information of papers. Link information only used title of paper and ignored the semantic-based text information present in the paper contents. Due to overlaps between different fields e.g. in computer science only title of a paper cannot be real representative of different other hidden topics of that paper. We think it is necessary to model the semantic information present in papers to provide more appropriate citations by capturing hidden topics. In this paper, we address this issue by modeling citations on the basis of latent topics present in the papers. Latent topics can provide us semantic correlations present between the papers. We propose a topic modeling approach in which each citation of a paper is represented as a probability distribution over latent topics, and each latent topic is represented as a probability distribution over words of paper for that topic, which can provide us more appropriate citations for a given paper. Experimental results on citeceer corpus shows the effectiveness of proposed approach and detailed interpretation of results reveals interesting information about scientific recommendation.

Tags

Users

  • @jaeschke

Comments and Reviews