Article,

Some Common Mistakes In IR Evaluation, And How They Can Be Avoided

.
SIGIR Forum, 51 (3): 32--41 (February 2018)
DOI: 10.1145/3190580.3190586

Abstract

This paper points out some mistakes that can be frequently found in IR publications: MRR and ERR violate basic requirements for a metric, MAP is based on unrealistic assumptions, the numbers shown overstate the precision of the result, relative improvements of arithmetic means are inappropriate, the simple holdout method yields unreliable results, hypotheses are often formulated after the experiment, significance tests frequently ignore the multiple comparisons problem, effect sizes are ignored, reproducibility of the experiments might be nearly impossible, and sometimes authors claim proof by experimentation.

Tags

Users

  • @jaeschke

Comments and Reviews