tutorial

How to Calibrate your Neural Network Classifier: Getting True Probabilities from a Classification Model

Authors:

Alan MoscaAuthors Info & Claims

KDD '20: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Pages 3499 - 3500

https://doi.org/10.1145/3394486.3406700

Published: 20 August 2020 Publication History

Get Access

Abstract

Research in Machine Learning (ML) for classification tasks has been primarily guided by metrics that derive from a confusion matrix (e.g. accuracy, precision and recall). Several works have highlighted that this has lead to training practices that produce over-confident models and void the assumption that the model learns a probability distribution over the classification targets; this is referred to as miscalibration. Consequently, modern ML architectures struggle to perform in applications where a probabilistic forecaster is needed. Research efforts on calibration techniques have explored the possibility of recovering probability distributions from traditional architectures. This tutorial covers the key concepts required to understand the motivations behind calibration and aims at providing participants with the tools that they require assess the calibration of ML models and calibrate them when required.

References

[1]

Chuan Guo, Geoff Pleiss, Yu Sun, and Kilian Q Weinberger. 2017. On calibration of modern neural networks. arXiv preprint arXiv:1706.04599 (2017).

Google Scholar

[2]

Andrey Malinin and Mark Gales. 2018. Predictive uncertainty estimation via prior networks. In Advances in Neural Information Processing Systems. 7047--7058.

Google Scholar

[3]

Mahdi Pakdaman Naeini, Gregory Cooper, and Milos Hauskrecht. 2015. Obtaining well calibrated probabilities using bayesian binning. In Twenty-Ninth AAAI Conference on Artificial Intelligence.

Digital Library

Google Scholar

[4]

Alexandru Niculescu-Mizil and Rich Caruana. 2005. Predicting good probabilities with supervised learning. In Proceedings of the 22nd international conference on Machine learning. 625--632.

Digital Library

Google Scholar

[5]

John Platt et al. 1999. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. Advances in large margin classifiers 10, 3 (1999), 61--74.

Google Scholar

[6]

Bianca Zadrozny and Charles Elkan. 2001. Obtaining calibrated probability estimates from decision trees and naive Bayesian classifiers. In Icml, Vol. 1. Citeseer, 609--616.

Google Scholar

[7]

Bianca Zadrozny and Charles Elkan. 2002. Transforming classifier scores into accurate multiclass probability estimates. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining. 694--699.

Digital Library

Google Scholar

[8]

Jize Zhang, Bhavya Kailkhura, and T Han. 2020. Mix-n-Match: Ensemble and Compositional Methods for Uncertainty Calibration in Deep Learning. arXiv preprint arXiv:2003.07329 (2020).

Google Scholar

Cited By

View all

Hovhannisyan VZachares PGrushka-Cockayne YMosca ALedezma C(2023)Data-Driven Schedule Risk Forecasting for Construction Mega-ProjectsSSRN Electronic Journal10.2139/ssrn.4496119Online publication date: 2023
https://doi.org/10.2139/ssrn.4496119
Zachares PHovhannisyan VLedezma CGante JMosca A(2022)On Forecasting Project Activity Durations with Neural NetworksEngineering Applications of Neural Networks10.1007/978-3-031-08223-8_9(103-114)Online publication date: 10-Jun-2022
https://doi.org/10.1007/978-3-031-08223-8_9

Index Terms

How to Calibrate your Neural Network Classifier: Getting True Probabilities from a Classification Model
1. Computing methodologies
  1. Machine learning
    1. Learning paradigms
      1. Supervised learning
        Supervised learning by classification
    2. Machine learning approaches
      1. Neural networks

Recommendations

Learning to calibrate: Reinforcement learning for guided calibration of visual–inertial rigs

We present a new approach to assisted intrinsic and extrinsic calibration with an observability-aware visual–inertial calibration system that guides the user through the calibration procedure by suggesting easy-to-perform motions that render the ...
On calibration data selection

A stormwater quality model should be calibrated and verified against available data before it can be confidently used. This paper mainly examines two questions: how do the size and selection of calibration data sets affect model performances and how ...
Calibrate to Interpret
Machine Learning and Knowledge Discovery in Databases
Abstract
Trustworthy Machine learning (ML) is driving a large number of ML community works in order to improve ML acceptance and adoption. The main aspect of trustworthy ML are the followings: fairness, uncertainty, robustness, explainability and formal ...

Comments

Information & Contributors

Information

Published In

KDD '20: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

August 2020

3664 pages

ISBN:9781450379984

DOI:10.1145/3394486

General Chairs:
Rajesh Gupta
UC San Diego, USA
,
Yan Liu
USC, USA
,
Program Chairs:
Mohak Shah
LG Electronics, USA
,
Suju Rajan
Linkedin, USA
,
Publications Chairs:
Jiliang Tang
Michigan State, USA
,
B. Aditya Prakash
Georgia Tech, USA

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 August 2020

Check for updates

Author Tags

Qualifiers

Tutorial

Conference

KDD '20

Sponsor:

KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

July 6 - 10, 2020

CA, Virtual Event, USA

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Upcoming Conference

KDD '25

Sponsor:
sigkdd
sigkdd

The 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 3 - 7, 2025

Toronto , ON , Canada

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
247
Total Downloads

Downloads (Last 12 months)8
Downloads (Last 6 weeks)3

Reflects downloads up to 08 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Hovhannisyan VZachares PGrushka-Cockayne YMosca ALedezma C(2023)Data-Driven Schedule Risk Forecasting for Construction Mega-ProjectsSSRN Electronic Journal10.2139/ssrn.4496119Online publication date: 2023
https://doi.org/10.2139/ssrn.4496119
Zachares PHovhannisyan VLedezma CGante JMosca A(2022)On Forecasting Project Activity Durations with Neural NetworksEngineering Applications of Neural Networks10.1007/978-3-031-08223-8_9(103-114)Online publication date: 10-Jun-2022
https://doi.org/10.1007/978-3-031-08223-8_9

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations

Learning to calibrate: Reinforcement learning for guided calibration of visual–inertial rigs

On calibration data selection

Calibrate to Interpret

Comments

Information

Published In

Sponsors

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations