Deep Probability Estimation

Sheng Liu; Aakash Kaku; Weicheng Zhu; Matan Leibovich; Sreyas Mohan; Boyang Yu; Haoxiang Huang; Laure Zanna; Narges Razavian; Jonathan Niles-Weed; Carlos Fernandez-Granda

深い確率推定

信頼性の高い確率推定は、固有の (偶然の) 不確実性がある多くの現実世界のアプリケーションで非常に重要です。確率推定モデルは、観察された結果 (例: 雨が降ったかどうか、または患者が死亡したかどうか) に基づいてトレーニングされます。これは、関心のあるイベントのグラウンドトゥルース確率が通常未知であるためです。したがって、この問題はバイナリ分類に似ていますが、目的が特定の結果を予測することではなく、確率を推定することであるという違いがあります。この研究では、ディープニューラルネットワークを使用した高次元データからの確率推定について調査しています。これらのモデルによって生成される確率を改善する方法はいくつかありますが、それらは主にモデル (認識論的) の不確実性に焦点を当てています。固有の不確実性を伴う問題の場合、グラウンドトゥルース確率にアクセスせずにパフォーマンスを評価することは困難です。これに対処するために、合成データセットを構築して、さまざまな計算可能な指標を調査および比較します。合成データと 3 つの現実世界の確率推定タスクに関する既存の方法を評価します。これらのタスクにはすべて固有の不確実性が伴います。レーダー画像からの降水予測、組織病理画像からのがん患者の生存率の予測、ダッシュカムビデオからの自動車事故の予測です。また、実験で明らかになったいくつかの現象を再現する高次元確率推定モデルの理論的解析も行います。最後に、ニューラルネットワークを使用した確率推定の新しい方法を提案します。これは、トレーニングプロセスを変更して、データから計算された経験的確率と一致する出力確率を促進します。この方法は、シミュレートされたデータと実際のデータのほとんどのメトリックで既存のアプローチよりも優れています。

Reliable probability estimation is of crucial importance in many real-world applications where there is inherent (aleatoric) uncertainty. Probability-estimation models are trained on observed outcomes (e.g. whether it has rained or not, or whether a patient has died or not), because the ground-truth probabilities of the events of interest are typically unknown. The problem is therefore analogous to binary classification, with the difference that the objective is to estimate probabilities rather than predicting the specific outcome. This work investigates probability estimation from high-dimensional data using deep neural networks. There exist several methods to improve the probabilities generated by these models but they mostly focus on model (epistemic) uncertainty. For problems with inherent uncertainty, it is challenging to evaluate performance without access to ground-truth probabilities. To address this, we build a synthetic dataset to study and compare different computable metrics. We evaluate existing methods on the synthetic data as well as on three real-world probability estimation tasks, all of which involve inherent uncertainty: precipitation forecasting from radar images, predicting cancer patient survival from histopathology images, and predicting car crashes from dashcam videos. We also give a theoretical analysis of a model for high-dimensional probability estimation which reproduces several of the phenomena evinced in our experiments. Finally, we propose a new method for probability estimation using neural networks, which modifies the training process to promote output probabilities that are consistent with empirical probabilities computed from the data. The method outperforms existing approaches on most metrics on the simulated as well as real-world data.

updated: Tue Oct 11 2022 19:59:57 GMT+0000 (UTC)

published: Sun Nov 21 2021 03:55:50 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト