The ROC Diagonal is Not Layperson’s Chance: A New Baseline Shows the Useful Area

Carrington, André M.; Fieguth, Paul W.; Mayr, Franz; James, Nick D.; Holzinger, Andreas; Pickering, John W.; Aviv, Richard I.

doi:10.1007/978-3-031-14463-9_7

André M. Carrington¹¹,
Paul W. Fieguth¹²,
Franz Mayr¹³,
Nick D. James¹⁴,
Andreas Holzinger¹⁵,
John W. Pickering¹⁶ &
…
Richard I. Aviv¹¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13480))

Included in the following conference series:

International Cross-Domain Conference for Machine Learning and Knowledge Extraction

1183 Accesses
5 Altmetric

Abstract

In many areas of our daily lives (e.g., healthcare), the performance of a binary diagnostic test or classification model is often represented as a curve in a Receiver Operating Characteristic (ROC) plot and a quantity known as the area under the ROC curve (AUC or AUROC). In ROC plots, the main diagonal is often referred to as “chance” or the “random line”. In general, however, this does not correspond to the layperson’s concept of chance or randomness for binary outcomes. Rather, this represents a special case of layperson’s chance, or the ROC curve for a classifier that has the same distribution of scores for the positive class and negative class. Where the ROC curve of a model deviates from the main diagonal, there is information. However, not all information is “useful information” compared to chance, including some areas and points above the diagonal. We define the binary chance baseline to identify areas and points in a ROC plot that are more useful than chance. In this paper, we explain this novel contribution about the state-of-art and provide examples that classify benchmark data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Receiver Operating Characteristic (ROC) Curve Analysis for Diagnostic Studies

ROC Curve Analysis in the Presence of Imperfect Reference Standards

Article 01 June 2017

What Is the Right Comparison? ROC Curve and Trade-Off Between Key Diagnostic Test Errors (ROCKE)

Notes

1.
Two probability density functions (PDF) are the same “almost everywhere” if they disagree on, at most, a set of isolated points (more formally, on a set of measure zero). This qualification is, admittedly, somewhat pedantic but necessary because any two such PDFs are effectively the same (and share the same cumulative distribution function). Changing a PDF at only individual points has no actual effect on the corresponding random variable it describes.

References

Althouse, A.D.: Statistical graphics in action: making better sense of the ROC curve. Int. J. Cardiol. 100(215), 9–10 (2016)
Article Google Scholar
Beran, R.: Minimum Hellinger distance estimates for parametric models. Ann. Stat. 5(3), 445–463 (1977)
Article MathSciNet MATH Google Scholar
Carrington, A.M., et al.: A new concordant partial AUC and partial C statistic for imbalanced data in the evaluation of machine learning algorithms. BMC Med. Inform. Decis. Making 20(1), 1–12 (2020)
Article Google Scholar
Chance Noun: In the Cambridge Dictionary. Cambridge University Press. https://dictionary.cambridge.org/dictionary/english/chance
Cohen, J.: A coefficient of agreement for nominal scales. Educ. Psychol. Measur. 20(1), 37–46 (1960)
Article Google Scholar
Cohen, J.: Weighted kappa: nominal scale agreement provision for scaled disagreement or partial credit. Psychol. Bull. 70(4), 213 (1968)
Article Google Scholar
Cover, T.M., Thomas, J.A.: Elements of Information Theory. Wiley, Hoboken (2012)
MATH Google Scholar
Dua, D., Graff, C.: UCI machine learning repository (2017). http://archive.ics.uci.edu/ml
Fawcett, T.: An introduction to ROC analysis. Pattern Recogn. Lett. 27(8), 861–874 (2006)
Article MathSciNet Google Scholar
Flach, P.A.: The geometry of ROC space: understanding machine learning metrics through ROC isometrics. In: Proceedings of the Twentieth International Conference on Machine Learning (2003)
Google Scholar
Halligan, S., Altman, D.G., Mallett, S.: Disadvantages of using the area under the receiver operating characteristic curve to assess imaging tests: a discussion and proposal for an alternative approach. Eur. Radiol. 25(4), 932–939 (2015). https://doi.org/10.1007/s00330-014-3487-0
Article Google Scholar
Hand, D.J.: Measuring classifier performance: a coherent alternative to the area under the ROC curve. Mach. Learn. 77(1), 103–123 (2009). https://doi.org/10.1007/s10994-009-5119-5
Article MATH Google Scholar
Inácio, V., Rodríguez-Álvarez, M.X., Gayoso-Diz, P.: Statistical evaluation of medical tests. Ann. Rev. Stat. Appl. 8, 41–67 (2021)
Article MathSciNet Google Scholar
Lin, J.: Divergence measures based on the Shannon entropy. IEEE Trans. Inf. Theory 37(1), 145–151 (1991). https://doi.org/10.1109/18.61115. http://ieeexplore.ieee.org/document/61115/
Menéndez, M., Pardo, J., Pardo, L., Pardo, M.: The Jensen-Shannon divergence. J. Franklin Inst. 334(2), 307–318 (1997). Publisher: Elsevier
Article MathSciNet Google Scholar
Metz, C.E.: Basic principles of ROC analysis. In: Seminars in Nuclear Medicine, vol. 8, pp. 283–298. Elsevier (1978)
Google Scholar
Nielsen, F.: On a variational definition for the Jensen-Shannon symmetrization of distances based on the information radius. Entropy 23(4), 464 (2021)
Article MathSciNet Google Scholar
Obuchowski, N.A.: Receiver operating characteristic curves and their use in radiology. Radiology 229(1), 3–8 (2003). https://doi.org/10.1148/radiol.2291010898
Article MathSciNet Google Scholar
Obuchowski, N.A., Bullen, J.A.: Receiver operating characteristic (ROC) curves: review of methods with applications in diagnostic medicine. Phys. Med. Biol. 63(7), 07TR01 (2018)
Google Scholar
Powers, D.M.W.: Evaluation: from precision, recall and F-factor to ROC, informedness, markedness & correlation. Technical report, Flinders University, December 2007
Google Scholar
Provost, F., Fawcett, T.: Robust classification for imprecise environments. Mach. Learn. 42, 203–231 (2001). https://doi.org/10.1023/A:1007601015854
Article MATH Google Scholar
Sackett, D.L., Straus, S.: On some clinically useful measures of the accuracy of diagnostic tests. BMJ Evid.-Based Med. 3(3), 68 (1998)
Google Scholar
Streiner, D.L., Cairney, J.: What’s under the ROC? An introduction to receiver operating characteristics curves. Can. J. Psychiatry 52(2), 121–128 (2007)
Article Google Scholar
Subtil, F., Rabilloud, M.: An enhancement of ROC curves made them clinically relevant for diagnostic-test comparison and optimal-threshold determination. J. Clin. Epidemiol. 68(7), 752–759 (2015)
Article Google Scholar
Van den Hout, W.B.: The area under an ROC curve with limited information. Med. Decis. Making 23(2), 160–166 (2003). https://doi.org/10.1177/0272989X03251246
Article Google Scholar
Wolberg, W.H., Mangasarian, O.L.: Multisurface method of pattern separation for medical diagnosis applied to breast cytology. Proc. Natl. Acad. Sci. 87(23), 9193–9196 (1990)
Article Google Scholar
Zhou, X.H., McClish, D.K., Obuchowski, N.A.: Statistical Methods in Diagnostic Medicine, vol. 569. Wiley, Hoboken (2002)
Book Google Scholar

Download references

Acknowledgements

Parts of this work has received funding by the Austrian Science Fund (FWF), Project: P-32554 “A reference model for explainable Artificial Intelligence in the medical domain”.

Author information

Authors and Affiliations

Department of Radiology, Radiation Oncology and Medical Physics, Faculty of Medicine, University of Ottawa and the Ottawa Hospital, Ottawa, Canada
André M. Carrington & Richard I. Aviv
Department of Systems Design Engineering, University of Waterloo, Waterloo, Canada
Paul W. Fieguth
Faculty of Engineering, Universidad ORT Uruguay, Montevideo, Uruguay
Franz Mayr
Software Solutions, Systems Integration and Architecture, The Ottawa Hospital, Ottawa, Canada
Nick D. James
University of Natural Resources and Life Sciences Vienna, Vienna, Austria
Andreas Holzinger
Christchurch Heart Institute, Department of Medicine, University of Otago, Christchurch, New Zealand
John W. Pickering

Authors

André M. Carrington
View author publications
You can also search for this author in PubMed Google Scholar
Paul W. Fieguth
View author publications
You can also search for this author in PubMed Google Scholar
Franz Mayr
View author publications
You can also search for this author in PubMed Google Scholar
Nick D. James
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Holzinger
View author publications
You can also search for this author in PubMed Google Scholar
John W. Pickering
View author publications
You can also search for this author in PubMed Google Scholar
Richard I. Aviv
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to André M. Carrington .

Editor information

Editors and Affiliations

University of Natural Resources and Life Sciences Vienna, Vienna, Austria
Andreas Holzinger
St. Pölten University of Applied Sciences, St. Pölten, Austria
Peter Kieseberg
TU Wien, Vienna, Austria
A Min Tjoa
SBA Research, Vienna, Austria
Edgar Weippl

Ethics declarations

Contributions

All authors contributed in writing this article. AC conceived the main ideas initially. In consultation with PF, JP, FM, and NJ various ideas were further developed and refined, with AH and RA providing guidance. Experiments were conducted and coded by AC and FM. All authors reviewed and provided edits to the article.

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Carrington, A.M. et al. (2022). The ROC Diagonal is Not Layperson’s Chance: A New Baseline Shows the Useful Area. In: Holzinger, A., Kieseberg, P., Tjoa, A.M., Weippl, E. (eds) Machine Learning and Knowledge Extraction. CD-MAKE 2022. Lecture Notes in Computer Science, vol 13480. Springer, Cham. https://doi.org/10.1007/978-3-031-14463-9_7

Download citation

DOI: https://doi.org/10.1007/978-3-031-14463-9_7
Published: 11 August 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-14462-2
Online ISBN: 978-3-031-14463-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Federation for Information Processing (opens in a new tab)

The ROC Diagonal is Not Layperson’s Chance: A New Baseline Shows the Useful Area

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Receiver Operating Characteristic (ROC) Curve Analysis for Diagnostic Studies

ROC Curve Analysis in the Presence of Imperfect Reference Standards

What Is the Right Comparison? ROC Curve and Trade-Off Between Key Diagnostic Test Errors (ROCKE)

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Ethics declarations

Contributions

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Subscribe and save

Buy Now

Navigation

The ROC Diagonal is Not Layperson’s Chance: A New Baseline Shows the Useful Area

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Receiver Operating Characteristic (ROC) Curve Analysis for Diagnostic Studies

ROC Curve Analysis in the Presence of Imperfect Reference Standards

What Is the Right Comparison? ROC Curve and Trade-Off Between Key Diagnostic Test Errors (ROCKE)

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Ethics declarations

Contributions

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation