Interpreting Face Inference Models using Hierarchical Network Dissection

Divyang Teotia; Agata Lapedriza; Sarah Ostadabbas

階層的ネットワーク解剖を使用した顔推論モデルの解釈

このホワイトペーパーでは、顔中心の推論モデルの内部表現を解釈するための一般的なパイプラインである階層型ネットワーク分析について説明します。確率的定式化を使用して、パイプラインはモデルのユニットを、対応するサンプル画像を含む顔の概念のコレクションである「顔辞書」の概念とペアにします。私たちのパイプラインは、オブジェクト中心およびシーン中心のモデルで人気のある解釈可能性モデルであるネットワーク解剖に触発されています。ただし、私たちの定式化では、ネットワーク解剖では対処できない顔中心モデルの2つの重要な課題に対処できます。（1）概念の空間的重複：「鼻」のように、画像の同じ領域で同時に発生するさまざまな顔の概念があります。（顔の部分）と「とがった鼻」（顔の属性）; （2）グローバルコンセプト：顔の特定の場所（例：見かけの年齢）を参照しないコンセプトに親和性のあるユニットがあります。 Hierarchical Network Dissectionを使用して、広く使用されている顔のデータセットでトレーニングされたさまざまな顔中心の推論モデルを分析します。結果は、さまざまなタスク用にトレーニングされたモデルがさまざまな内部表現を学習したことを示しています。さらに、解釈可能性の結果は、トレーニングデータのいくつかのバイアスと、顔中心の推論タスクのいくつかの興味深い特性を明らかにする可能性があります。最後に、バイアスの発見のための階層的ネットワーク解剖の可能性を示すために、バイアスのかかったデータに対して制御された実験を行います。結果は、モデルにエンコードされているトレーニングデータのバイアスを検出して定量化するために、階層型ネットワーク分析をどのように使用できるかを示しています。

This paper presents Hierarchical Network Dissection, a general pipeline to interpret the internal representation of face-centric inference models. Using a probabilistic formulation, our pipeline pairs units of the model with concepts in our "Face Dictionary", a collection of facial concepts with corresponding sample images. Our pipeline is inspired by Network Dissection, a popular interpretability model for object-centric and scene-centric models. However, our formulation allows to deal with two important challenges of face-centric models that Network Dissection cannot address: (1) spacial overlap of concepts: there are different facial concepts that simultaneously occur in the same region of the image, like "nose" (facial part) and "pointy nose" (facial attribute); and (2) global concepts: there are units with affinity to concepts that do not refer to specific locations of the face (e.g. apparent age). We use Hierarchical Network Dissection to dissect different face-centric inference models trained on widely-used facial datasets. The results show models trained for different tasks learned different internal representations. Furthermore, the interpretability results can reveal some biases in the training data and some interesting characteristics of the face-centric inference tasks. Finally, we conduct controlled experiments on biased data to showcase the potential of Hierarchical Network Dissection for bias discovery. The results illustrate how Hierarchical Network Dissection can be used to discover and quantify bias in the training data that is also encoded in the model.

updated: Tue Mar 29 2022 03:06:23 GMT+0000 (UTC)

published: Mon Aug 23 2021 18:52:47 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト