@jamesh

Beyond Accuracy, F-Score and ROC: A Family of Discriminant Measures for Performance Evaluation

, , and . AI 2006: Advances in Artificial Intelligence, (2006)

Abstract

Different evaluation measures assess different characteristics of machine learning algorithms. The empirical evaluation of algorithms and classifiers is a matter of on-going debate among researchers. Most measures in use today focus on a classifier’sability to identify classes correctly. We note other useful properties, such as failure avoidance or class discrimination,and we suggest measures to evaluate such properties. These measures – Youden’s index, likelihood, Discriminant power – areused in medical diagnosis. We show that they are interrelated, and we apply them to a case study from the field of electronicnegotiations. We also list other learning problems which may benefit from the application of these measures.

Links and resources

Tags

community

  • @dblp
  • @jamesh
@jamesh's tags highlighted