Learn then Test: Calibrating Predictive Algorithms to Achieve Risk Control

Anastasios N. Angelopoulos; Stephen Bates; Emmanuel J. Candès; Michael I. Jordan; Lihua Lei

学習してからテストする：リスク管理を実現するための予測アルゴリズムの調整

機械学習モデルをキャリブレーションするためのフレームワークであるLearnthen Testを紹介します。これにより、基礎となるモデルや（未知の）データ生成分布に関係なく、予測が明示的な有限サンプルの統計的保証を満たします。このフレームワークは、他の例の中でも、マルチラベル分類での偽発見率制御、インスタンスセグメンテーションでの交差オーバーユニオン制御、および分類または回帰での外れ値検出と信頼セットカバレッジのタイプ1エラーの同時制御に対処します。これを達成するために、私たちは重要な技術的課題を解決します。それは、必ずしも単調ではない任意のリスクの管理です。私たちの主な洞察は、リスク制御問題を多重仮説検定として再構成し、以前の文献とは異なる手法と数学的議論を可能にすることです。フレームワークを使用して、コンピュータービジョンの詳細な実例を使用して、いくつかのコア機械学習タスクに新しいキャリブレーション方法を提供します。

We introduce Learn then Test, a framework for calibrating machine learning models so that their predictions satisfy explicit, finite-sample statistical guarantees regardless of the underlying model and (unknown) data-generating distribution. The framework addresses, among other examples, false discovery rate control in multi-label classification, intersection-over-union control in instance segmentation, and the simultaneous control of the type-1 error of outlier detection and confidence set coverage in classification or regression. To accomplish this, we solve a key technical challenge: the control of arbitrary risks that are not necessarily monotonic. Our main insight is to reframe the risk-control problem as multiple hypothesis testing, enabling techniques and mathematical arguments different from those in the previous literature. We use our framework to provide new calibration methods for several core machine learning tasks with detailed worked examples in computer vision.

updated: Thu Oct 14 2021 17:04:46 GMT+0000 (UTC)

published: Sun Oct 03 2021 17:42:03 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト