copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Incremental Diversification for Very Large Sets: a Streaming-based Approach

E. Minack, W. Siberski, and W. Nejdl. Proc. of 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2011, (2011)

Abstract

Result diversification is an effective method to reduce the risk that none of the returned results satisfies a user's query intention. It has been shown to decrease query abandonment substantially. On the other hand, computing an optimally diverse set is NP-hard for the usual objectives. Even the greedy diversification algorithms usually exhibit quadratic complexity and require random access to the input set, rendering them impractical in the context of large result sets or continuous data. To solve this issue, we present a novel diversification approach which treats the input as a stream and processes each element in an incremental fashion, maintaining a near-optimal diverse set at any point in the stream. Our approach exhibits a linear computation and constant memory complexity with respect to input size, without significant loss of diversification quality. In an extensive evaluation on several real-world data sets we show the applicability and efficiency of our algorithm for large result sets as well as for continuous query scenarios such as news stream subscriptions.

Links and resources

BibTeX key: L3S_eccf70441b8a3fa9a76c655d9f626f2f99f22749
entry type: inproceedings
booktitle: Proc. of 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2011
year: 2011

@l3s's tags highlighted

Cite this publication

search on

Meta data

Last update 12 years ago
Created 12 years ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Incremental Diversification for Very Large Sets: a Streaming-based Approach

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Incremental Diversification for Very Large Sets: a Streaming-based Approach

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Incremental Diversification for Very Large Sets: a Streaming-based Approach

Comments and Reviews
(0)