Learn then test: Calibrating predictive algorithms to achieve risk control

Anastasios N. Angelopoulos; Stephen Bates; Emmanuel J. Candès; Michael I. Jordan; Lihua Lei

doi:10.1214/24-AOAS1998

June 2025 Learn then test: Calibrating predictive algorithms to achieve risk control

Anastasios N. Angelopoulos, Stephen Bates, Emmanuel J. Candès, Michael I. Jordan, Lihua Lei

Anastasios N. Angelopoulos,¹ Stephen Bates,² Emmanuel J. Candès,³ Michael I. Jordan,¹ Lihua Lei⁴
¹Department of Electrical Engineering and Computer Science, University of California, Berkeley
²Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology
³Department of Statistics, Stanford University
⁴Stanford Graduate School of Business

Ann. Appl. Stat. 19(2): 1641-1662 (June 2025). DOI: 10.1214/24-AOAS1998

PURCHASE THIS CONTENT

PURCHASE SINGLE ARTICLE

Price: 30ドル.00 ADD TO CART

Includes PDF & HTML, when available

PURCHASE SINGLE ARTICLE

This article is only available to subscribers. It is not available for individual sale.

This will count as one of your downloads.

You will have access to both the presentation and article (if available).

DOWNLOAD NOW

Abstract

We introduce a framework for calibrating machine learning models to satisfy finite-sample statistical guarantees. Our calibration algorithms work with any model and (unknown) data-generating distribution and do not require retraining. The algorithms address, among other examples, false discovery rate control in multilabel classification, intersection-over-union control in instance segmentation, and simultaneous control of the type-1 outlier error and confidence set coverage in classification or regression. Our main insight is to reframe risk control as multiple hypothesis testing, enabling different mathematical arguments. We demonstrate our algorithms with detailed worked examples in computer vision and tabular medical data. The computer vision experiments demonstrate the utility of our approach in calibrating state-of-the-art predictive architectures that have been deployed widely, such as the $detectron 2$ object detection system.

Funding Statement

This work was supported in part by the Mathematical Data Science program of the Office of Naval Research under grant number N00014-21-1-2840.

Acknowledgments

The authors would like to thank the anonymous referees, an Associate Editor, and the Editor for their constructive comments that improved the quality of this paper. Lihua Lei is grateful for the support of National Science Foundation Grant DMS-2338464.

Citation

Download Citation

Anastasios N. Angelopoulos. Stephen Bates. Emmanuel J. Candès. Michael I. Jordan. Lihua Lei. "Learn then test: Calibrating predictive algorithms to achieve risk control." Ann. Appl. Stat. 19 (2) 1641 - 1662, June 2025. https://doi.org/10.1214/24-AOAS1998

Information

Received: 1 July 2023; Revised: 1 November 2024; Published: June 2025

First available in Project Euclid: 28 May 2025

Digital Object Identifier: 10.1214/24-AOAS1998

Keywords: Computer vision , conformal prediction , deep learning , machine learning

JOURNAL ARTICLE
22 PAGES

DOWNLOAD PDF + SAVE TO MY LIBRARY

GET CITATION

< Previous Article

|

Next Article >

Ann. Appl. Stat.

Vol.19 • No. 2 • June 2025

Institute of Mathematical Statistics

Subscribe to Project Euclid

Receive erratum alerts for this article

Anastasios N. Angelopoulos, Stephen Bates, Emmanuel J. Candès, Michael I. Jordan, Lihua Lei "Learn then test: Calibrating predictive algorithms to achieve risk control," The Annals of Applied Statistics, Ann. Appl. Stat. 19(2), 1641-1662, (June 2025)

Include:

Citation Only

Citation & Abstract

Format:

RIS

EndNote

BibTex

Print Friendly Version (PDF)

Abstract

Funding Statement

Acknowledgments

Citation

Information

KEYWORDS/PHRASES

PUBLICATION TITLE:

PUBLICATION YEARS