Two4Two: Evaluating Interpretable Machine Learning - A Synthetic Dataset For Controlled Experiments

Martin Schuessler; Philipp Weiß; Leon Sixt

Two4Two：解釈可能な機械学習の評価-制御された実験のための合成データセット

画像分類の説明を生成するためのアプローチが増えています。ただし、これらのアプローチのいくつかは、研究者の制御の及ばない本質的な要因を残しているため、自然画像データセットを使用して制御された実験を設計することが難しいため、人体評価の対象になります。私たちのアプローチでは、研究者はわずかなパラメータで目的のデータセットを記述することができます。これらに基づいて、私たちのライブラリは2つの3D抽象動物の合成画像データを生成します。結果として得られるデータは、アルゴリズムによる評価だけでなく、人間を対象とした評価にも適しています。私たちのユーザー調査結果は、私たちの方法が分類器にとって十分に予測的で、データを視覚的に検査する2人に1人の参加者だけが気付くほど微妙なバイアスを作成できることを示しています。私たちのアプローチは、人間の被験者の評価を実施するための障壁を大幅に下げ、それによって解釈可能な機械学習のより厳密な調査を容易にします。ライブラリとデータセットについては、https：//github.com/mschuessler/two4two/を参照してください。

A growing number of approaches exist to generate explanations for image classification. However, few of these approaches are subjected to human-subject evaluations, partly because it is challenging to design controlled experiments with natural image datasets, as they leave essential factors out of the researcher's control. With our approach, researchers can describe their desired dataset with only a few parameters. Based on these, our library generates synthetic image data of two 3D abstract animals. The resulting data is suitable for algorithmic as well as human-subject evaluations. Our user study results demonstrate that our method can create biases predictive enough for a classifier and subtle enough to be noticeable only to every second participant inspecting the data visually. Our approach significantly lowers the barrier for conducting human subject evaluations, thereby facilitating more rigorous investigations into interpretable machine learning. For our library and datasets see, https://github.com/mschuessler/two4two/

updated: Thu May 06 2021 17:14:39 GMT+0000 (UTC)

published: Thu May 06 2021 17:14:39 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト