Global explainability in aligned image modalities

Justin Engelmann; Amos Storkey; Miguel O. Bernabeu

整列された画像モダリティにおけるグローバルな説明可能性

ディープラーニング（DL）モデルは、多くのコンピュータービジョンの問題に非常に効果的であり、重要なアプリケーションでますます使用されています。それらは本質的にブラックボックスでもあります。開業医が特定の画像のモデル予測を理解および検証できるようにする画像ごとの説明を生成するための多くの方法が存在します。それを超えて、DLモデルが一般的に賢明な方法で機能すること、つまりドメイン知識と一致し、望ましくないデータアーティファクトに依存しないことを検証することが望ましいでしょう。この目的のために、モデルはグローバルに説明される必要があります。この作業では、医用画像で一般的であるように、各ピクセル位置が画像化されたオブジェクト上の同様の相対位置を表すように自然に位置合わせされる画像モダリティに焦点を当てます。ラベルごとの全体的なグローバルな説明を取得するための簡単な方法として、画像ごとの説明のピクセルごとの集約を提案します。これらは、モデルの検証、知識の発見、および画像ごとの説明の検査から導き出された定性的な結論を伝達するための効率的な方法として使用できます。さらに、これらのグローバルな説明がモデルの予測方法に忠実であることを定量的に検証する方法として、Progressive Erasing Plus Progressive Restoration（PEPPR）を提案します。次に、これらの方法を、自然に整列したモダリティである超広視野網膜画像に適用します。グローバルな説明はドメイン知識と一致しており、モデルの動作を忠実に反映していることがわかります。

Deep learning (DL) models are very effective on many computer vision problems and increasingly used in critical applications. They are also inherently black box. A number of methods exist to generate image-wise explanations that allow practitioners to understand and verify model predictions for a given image. Beyond that, it would be desirable to validate that a DL model generally works in a sensible way, i.e. consistent with domain knowledge and not relying on undesirable data artefacts. For this purpose, the model needs to be explained globally. In this work, we focus on image modalities that are naturally aligned such that each pixel position represents a similar relative position on the imaged object, as is common in medical imaging. We propose the pixel-wise aggregation of image-wise explanations as a simple method to obtain label-wise and overall global explanations. These can then be used for model validation, knowledge discovery, and as an efficient way to communicate qualitative conclusions drawn from inspecting image-wise explanations. We further propose Progressive Erasing Plus Progressive Restoration (PEPPR) as a method to quantitatively validate that these global explanations are faithful to how the model makes its predictions. We then apply these methods to ultra-widefield retinal images, a naturally aligned modality. We find that the global explanations are consistent with domain knowledge and faithfully reflect the model's workings.

updated: Fri Dec 17 2021 16:05:11 GMT+0000 (UTC)

published: Fri Dec 17 2021 16:05:11 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト