Relative cost curves: An alternative to AUC and an extension to 3-class problems

Olga Montvida; Frank Klawonn

Kybernetika (2014)

  • Volume: 50, Issue: 5, page 647-660
  • ISSN: 0023-5954

Abstract

top
Performance evaluation of classifiers is a crucial step for selecting the best classifier or the best set of parameters for a classifier. Receiver Operating Characteristic (ROC) curves and Area Under the ROC Curve (AUC) are widely used to analyse performance of a classifier. However, the approach does not take into account that misclassification for different classes might have more or less serious consequences. On the other hand, it is often difficult to specify exactly the consequences or costs of misclassifications. This paper is devoted to Relative Cost Curves (RCC) - a graphical technique for visualising the performance of binary classifiers over the full range of possible relative misclassification costs. This curve provides helpful information to choose the best set of classifiers or to estimate misclassification costs if those are not known precisely. In this paper, the concept of Area Above the RCC (AAC) is introduced, a scalar measure of classifier performance under unequal misclassification costs problem. We also extend RCC to multicategory problems when misclassification costs depend only on the true class.

How to cite

top

Montvida, Olga, and Klawonn, Frank. "Relative cost curves: An alternative to AUC and an extension to 3-class problems." Kybernetika 50.5 (2014): 647-660. <http://eudml.org/doc/262159>.

@article{Montvida2014,
abstract = {Performance evaluation of classifiers is a crucial step for selecting the best classifier or the best set of parameters for a classifier. Receiver Operating Characteristic (ROC) curves and Area Under the ROC Curve (AUC) are widely used to analyse performance of a classifier. However, the approach does not take into account that misclassification for different classes might have more or less serious consequences. On the other hand, it is often difficult to specify exactly the consequences or costs of misclassifications. This paper is devoted to Relative Cost Curves (RCC) - a graphical technique for visualising the performance of binary classifiers over the full range of possible relative misclassification costs. This curve provides helpful information to choose the best set of classifiers or to estimate misclassification costs if those are not known precisely. In this paper, the concept of Area Above the RCC (AAC) is introduced, a scalar measure of classifier performance under unequal misclassification costs problem. We also extend RCC to multicategory problems when misclassification costs depend only on the true class.},
author = {Montvida, Olga, Klawonn, Frank},
journal = {Kybernetika},
keywords = {classifier; performance evaluation; misclassification costs; cost curves; ROC curves; AUC; classifier; performance evaluation; misclassification costs; cost curves; receiver operating characteristic (ROC) curves; area under the ROC curve (AUC)},
language = {eng},
number = {5},
pages = {647-660},
publisher = {Institute of Information Theory and Automation AS CR},
title = {Relative cost curves: An alternative to AUC and an extension to 3-class problems},
url = {http://eudml.org/doc/262159},
volume = {50},
year = {2014},
}

TY - JOUR
AU - Montvida, Olga
AU - Klawonn, Frank
TI - Relative cost curves: An alternative to AUC and an extension to 3-class problems
JO - Kybernetika
PY - 2014
PB - Institute of Information Theory and Automation AS CR
VL - 50
IS - 5
SP - 647
EP - 660
AB - Performance evaluation of classifiers is a crucial step for selecting the best classifier or the best set of parameters for a classifier. Receiver Operating Characteristic (ROC) curves and Area Under the ROC Curve (AUC) are widely used to analyse performance of a classifier. However, the approach does not take into account that misclassification for different classes might have more or less serious consequences. On the other hand, it is often difficult to specify exactly the consequences or costs of misclassifications. This paper is devoted to Relative Cost Curves (RCC) - a graphical technique for visualising the performance of binary classifiers over the full range of possible relative misclassification costs. This curve provides helpful information to choose the best set of classifiers or to estimate misclassification costs if those are not known precisely. In this paper, the concept of Area Above the RCC (AAC) is introduced, a scalar measure of classifier performance under unequal misclassification costs problem. We also extend RCC to multicategory problems when misclassification costs depend only on the true class.
LA - eng
KW - classifier; performance evaluation; misclassification costs; cost curves; ROC curves; AUC; classifier; performance evaluation; misclassification costs; cost curves; receiver operating characteristic (ROC) curves; area under the ROC curve (AUC)
UR - http://eudml.org/doc/262159
ER -

References

top
  1. Bradley, A. P., 10.1016/S0031-3203(96)00142-2, Pattern Recognition 30 (1997), 1145-1159. DOI10.1016/S0031-3203(96)00142-2
  2. Drummond, C., Holte, R. C., 10.1007/s10994-006-8199-5, Machine Learning 65 (2006) 95-130. DOI10.1007/s10994-006-8199-5
  3. Fawcett, T., 10.1016/j.patrec.2005.10.010, Pattern Recognition Lett. 27 (2006), 861-874. DOI10.1016/j.patrec.2005.10.010
  4. Hand, D. J., 10.1007/s10994-009-5119-5, Machine Learning 77 (2009), 103-123. DOI10.1007/s10994-009-5119-5
  5. Hand, D. J., Till, R. J., 10.1023/A:1010920819831, Machine Learning 45 (2001), 171-186. Zbl1007.68180DOI10.1023/A:1010920819831
  6. Hanley, J. A., Receiver operating characteristic (ROC) methodology: the state of the art., Critical Reviews in Diagnostic Imaging 29 (1989), 307-335. 
  7. Hernández-Orallo, J., Flach, P., Ferri, C., Brier curves: a new cost-based visualisation of classifier performance., In: Proc. 28th International Conference on Machine Learning (ICML-11) (L. Getoor and T. Scheffer, eds.), ACM, New York 2011, pp. 585-592. 
  8. Klawonn, F., Höppner, F., May, S., An alternative to ROC and AUC analysis of classifiers., In: Advances in Intelligent Data Analysis X, (J. Gama, E. Bradley, and J. Hollmén, eds.), Springer, Berlin 2011, p. 210-221. 
  9. Krzanowski, W. J., Hand, D. J., ROC Curves for Continuous data., Chapman and Hall, London 2009. Zbl1288.62005MR2522628
  10. Li, J., Fine, J. P., 10.1093/biostatistics/kxm050, Biostatistics 9 (2008), 566-576. Zbl1143.62083DOI10.1093/biostatistics/kxm050
  11. Murphy, P. M., Aha, D. W., Uci repository of machine learning databases., 1992. Avaible: http://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+(Diagnostic). 
  12. Mossman, D., 10.1177/0272989X9901900110, Medical Decision Making 19 (1999), 78-89. DOI10.1177/0272989X9901900110

NotesEmbed ?

top

You must be logged in to post comments.

To embed these notes on your page include the following JavaScript code on your page where you want the notes to appear.

Only the controls for the widget will be shown in your chosen language. Notes will be shown in their authored language.

Tells the widget how many notes to show per page. You can cycle through additional notes using the next and previous controls.

    
                

Note: Best practice suggests putting the JavaScript code just before the closing </body> tag.