Positive-Congruent Training: Towards Regression-Free Model Updates

Sijie Yan; Yuanjun Xiong; Kaustav Kundu; Shuo Yang; Siqi Deng; Meng Wang; Wei Xia; Stefano Soatto

ポジティブ-一致トレーニング：回帰のないモデルの更新に向けて

AIシステムのさまざまなバージョンの動作の不整合を減らすことは、実際には、全体的なエラーを減らすことと同じくらい重要です。画像分類では、サンプルごとの不一致は「ネガティブフリップ」として表示されます。新しいモデルは、古い（参照）モデルによって正しく分類されたテストサンプルの出力を誤って予測します。正の合同（PC）トレーニングは、エラー率を減らすと同時に負の反転を減らすことを目的としているため、モデルの蒸留とは異なり、正の予測でのみ参照モデルとの一致を最大化します。 PCトレーニングの簡単なアプローチであるFocalDistillationを提案します。これは、正しく分類されたサンプルにより多くの重みを与えることにより、参照モデルとの一致を強制します。また、参照モデル自体を複数のディープニューラルネットワークのアンサンブルとして選択できる場合、新しいモデルの精度に影響を与えることなく、負のフリップをさらに減らすことができることもわかりました。

Reducing inconsistencies in the behavior of different versions of an AI system can be as important in practice as reducing its overall error. In image classification, sample-wise inconsistencies appear as "negative flips:" A new model incorrectly predicts the output for a test sample that was correctly classified by the old (reference) model. Positive-congruent (PC) training aims at reducing error rate while at the same time reducing negative flips, thus maximizing congruency with the reference model only on positive predictions, unlike model distillation. We propose a simple approach for PC training, Focal Distillation, which enforces congruence with the reference model by giving more weights to samples that were correctly classified. We also found that, if the reference model itself can be chosen as an ensemble of multiple deep neural networks, negative flips can be further reduced without affecting the new model's accuracy.

updated: Wed Nov 18 2020 09:00:44 GMT+0000 (UTC)

published: Wed Nov 18 2020 09:00:44 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト