Improving Explainability of Disentangled Representations using Multipath-Attribution Mappings

Lukas Klein; João B. S. Carvalho; Mennatallah El-Assady; Paolo Penna; Joachim M. Buhmann; Paul F. Jaeger

マルチパス属性マッピングを使用したもつれ解除表現の説明可能性の向上

Explainable AI は、モデルの動作を人間が理解できるようにすることを目的としています。これは、相関パターンから因果関係を抽出する中間ステップと見なすことができます。画像ベースの臨床診断では致命的な意思決定が発生する可能性が高いため、これらの安全性が重要なシステムに説明可能な AI を統合する必要があります。現在の説明方法は通常、入力画像内のピクセル領域に属性スコアを割り当て、モデルの決定に対するその重要性を示します。しかし、視覚的な特徴が使用される理由を説明するには不十分です。我々は、下流タスク予測のために解釈可能なもつれ解除表現を利用するフレームワークを提案します。解きほぐされた表現を視覚化することで、専門家が専門分野の知識を活用して考えられる因果関係を調査できるようになります。さらに、説明を充実させ検証するためにマルチパスアトリビューションマッピングを導入します。合成ベンチマークスイートと 2 つの医療データセットに対するアプローチの有効性を実証します。このフレームワークは因果関係抽出の触媒として機能するだけでなく、分布シフト下でのテストを必要とせずにショートカット検出を可能にすることでモデルの堅牢性も強化することを示します。

Explainable AI aims to render model behavior understandable by humans, which can be seen as an intermediate step in extracting causal relations from correlative patterns. Due to the high risk of possible fatal decisions in image-based clinical diagnostics, it is necessary to integrate explainable AI into these safety-critical systems. Current explanatory methods typically assign attribution scores to pixel regions in the input image, indicating their importance for a model's decision. However, they fall short when explaining why a visual feature is used. We propose a framework that utilizes interpretable disentangled representations for downstream-task prediction. Through visualizing the disentangled representations, we enable experts to investigate possible causation effects by leveraging their domain knowledge. Additionally, we deploy a multi-path attribution mapping for enriching and validating explanations. We demonstrate the effectiveness of our approach on a synthetic benchmark suite and two medical datasets. We show that the framework not only acts as a catalyst for causal relation extraction but also enhances model robustness by enabling shortcut detection without the need for testing under distribution shifts.

updated: Thu Jun 15 2023 10:52:29 GMT+0000 (UTC)

published: Thu Jun 15 2023 10:52:29 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト