Learning Stable Classifiers by Transferring Unstable Features

Yujia Bao; Shiyu Chang; Regina Barzilay

不安定な特徴を転送することによる安定した分類器の学習

偏りのない機械学習モデルは多くのアプリケーションに不可欠ですが、偏りは人間が定義した概念であり、タスクによって異なる可能性があります。入力とラベルのペアのみが与えられた場合、アルゴリズムには、安定した（原因となる）機能と不安定な（偽の）機能を区別するための十分な情報が不足している可能性があります。ただし、関連するタスクは多くの場合、同様のバイアスを共有します。これは、転送設定で安定した分類子を開発するために利用できる観察結果です。この作業では、ソースタスクの不安定な機能についてターゲット分類子に明示的に通知します。具体的には、ソースタスクのさまざまなデータ環境を対比することにより、不安定な機能をエンコードする表現を導き出します。この表現に従ってターゲットタスクのデータをクラスタリングし、これらのクラスター全体で最悪の場合のリスクを最小限に抑えることで、堅牢性を実現します。テキストと画像の両方の分類でメソッドを評価します。経験的な結果は、私たちのアルゴリズムが、合成的に生成された環境と実際の環境の両方で、ターゲットタスクの堅牢性を維持できることを示しています。

While unbiased machine learning models are essential for many applications, bias is a human-defined concept that can vary across tasks. Given only input-label pairs, algorithms may lack sufficient information to distinguish stable (causal) features from unstable (spurious) features. However, related tasks often share similar biases -- an observation we may leverage to develop stable classifiers in the transfer setting. In this work, we explicitly inform the target classifier about unstable features in the source tasks. Specifically, we derive a representation that encodes the unstable features by contrasting different data environments in the source task. We achieve robustness by clustering data of the target task according to this representation and minimizing the worst-case risk across these clusters. We evaluate our method on both text and image classifications. Empirical results demonstrate that our algorithm is able to maintain robustness on the target task for both synthetically generated environments and real-world environments.

updated: Tue Jan 25 2022 16:15:06 GMT+0000 (UTC)

published: Tue Jun 15 2021 02:41:12 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト