copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Querying and clustering Web pages about persons and organizations

S. Ye, T. Chua, and J. Kei. (2003)

Abstract

One of the most frequent Web surfing tasks is to search for names of persons and organizations. Such names are often not distinctive, commonly occurring, and nonunique. Thus, a single name may be mapped to several entities. We describe a methodology to cluster the Web pages returned by the search engine so that pages belonging to different entities are clustered into different groups. The algorithm uses a combination of named entities, link-based and structure-based information as features to partition the document set into direct and indirect pages using a decision model. It then uses the distinct direct pages as seeds to cluster the document set into different clusters. The algorithm has been found to be effective for Web-based applications.

Links and resources

BibTeX key: citeulike:447616
entry type: proceedings
year: 2003
journal: Web Intelligence, 2003. WI 2003. Proceedings. IEEE/WIC International Conference on
pages: 344--350
citeulike-article-id: 447616
citeulike-linkout-0: http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=1241214
priority: 2
posted-at: 2005-12-23 10:03:49
url: http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=1241214

@fernand0's tags highlighted

Cite this publication

search on

Meta data

Last update 7 years ago
Created 7 years ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Querying and clustering Web pages about persons and organizations

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Querying and clustering Web pages about persons and organizations

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Querying and clustering Web pages about persons and organizations

Comments and Reviews
(0)