Inproceedings,

Beyond independent relevance: methods and evaluation metrics for subtopic retrieval

C. Zhai, W. Cohen, and J. Lafferty.
SIGIR '03: Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, page 10--17. New York, NY, USA, ACM, (2003)
DOI: 10.1145/860435.860440

Abstract

We present a non-traditional retrieval problem we call subtopic retrieval . The subtopic retrieval problem is concerned with finding documents that cover many different subtopics of a query topic. In such a problem, the utility of a document in a ranking is dependent on other documents in the ranking, violating the assumption of independent relevance which is assumed in most traditional retrieval methods. Subtopic retrieval poses challenges for evaluating performance, as well as for developing effective algorithms. We propose a framework for evaluating subtopic retrieval which generalizes the traditional precision and recall metrics by accounting for intrinsic topic difficulty as well as redundancy in documents. We propose and systematically evaluate several methods for performing subtopic retrieval using statistical language models and a maximal marginal relevance (MMR) ranking strategy. A mixture model combined with query likelihood relevance ranking is shown to modestly outperform a baseline relevance ranking on a data set used in the TREC interactive track.

BibTeX key: zhai_2003_subtopic
entry type: inproceedings
address: New York, NY, USA
booktitle: SIGIR '03: Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
year: 2003
pages: 10--17
publisher: ACM
posted-at: 2009-01-21 15:39:46
location: Toronto, Canada
citeulike-article-id: 3795990
citeulike-linkout-1: http://dx.doi.org/10.1145/860435.860440
priority: 1
isbn: 1-58113-646-3
citeulike-linkout-0: http://portal.acm.org/citation.cfm?id=860435.860440
DOI: 10.1145/860435.860440
url: http://dx.doi.org/10.1145/860435.860440

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@inproceedings{zhai_2003_subtopic, abstract = {We present a non-traditional retrieval problem we call subtopic retrieval . The subtopic retrieval problem is concerned with finding documents that cover many different subtopics of a query topic. In such a problem, the utility of a document in a ranking is dependent on other documents in the ranking, violating the assumption of independent relevance which is assumed in most traditional retrieval methods. Subtopic retrieval poses challenges for evaluating performance, as well as for developing effective algorithms. We propose a framework for evaluating subtopic retrieval which generalizes the traditional precision and recall metrics by accounting for intrinsic topic difficulty as well as redundancy in documents. We propose and systematically evaluate several methods for performing subtopic retrieval using statistical language models and a maximal marginal relevance (MMR) ranking strategy. A mixture model combined with query likelihood relevance ranking is shown to modestly outperform a baseline relevance ranking on a data set used in the TREC interactive track.}, added-at = {2009-08-06T15:16:38.000+0200}, address = {New York, NY, USA}, author = {Zhai, Cheng X. and Cohen, William W. and Lafferty, John}, biburl = {https://www.bibsonomy.org/bibtex/21512a8c7da38bb1a18c4223353a8daeb/chato}, booktitle = {SIGIR '03: Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval}, citeulike-article-id = {3795990}, citeulike-linkout-0 = {http://portal.acm.org/citation.cfm?id=860435.860440}, citeulike-linkout-1 = {http://dx.doi.org/10.1145/860435.860440}, doi = {10.1145/860435.860440}, interhash = {7ee606936d29b789a5fc06c93bca6127}, intrahash = {1512a8c7da38bb1a18c4223353a8daeb}, isbn = {1-58113-646-3}, keywords = {classification, ranking, similarity, text}, location = {Toronto, Canada}, pages = {10--17}, posted-at = {2009-01-21 15:39:46}, priority = {1}, publisher = {ACM}, timestamp = {2009-08-06T15:16:43.000+0200}, title = {Beyond independent relevance: methods and evaluation metrics for subtopic retrieval}, url = {http://dx.doi.org/10.1145/860435.860440}, year = 2003 }

BibSonomy

Beyond independent relevance: methods and evaluation metrics for subtopic retrieval

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on