CLIP-Dissect: Automatic Description of Neuron Representations in Deep Vision Networks

Tuomas Oikarinen; Tsui-Wei Weng

CLIP-Dissect: ディープビジョンネットワークにおけるニューロン表現の自動記述

この論文では、ビジョンネットワーク内の個々の隠れニューロンの機能を自動的に記述する新しい手法であるCLIP-Dissectを提案します。 CLIP-Dissect は、マルチモーダルビジョン/言語モデルの最近の進歩を活用して、ラベル付けされたデータや人間の例を必要とせずに、自由な概念で内部ニューロンをラベル付けします。 CLIP-Dissect は、グラウンドトゥルースが利用可能な最終層ニューロンの既存の方法よりも正確な記述と、隠れ層ニューロンの質的に優れた記述を提供することを示します。さらに、私たちの方法は非常に柔軟です。モデルにとらわれず、新しい概念を簡単に処理でき、将来的にはより優れたマルチモーダルモデルを利用するために拡張できます。最後に、CLIP-Dissect は計算効率が高く、わずか 4 分で ResNet-50 の 5 つのレイヤーからすべてのニューロンにラベルを付けることができます。これは、既存の方法よりも 10 倍以上高速です。コードは https://github.com/Trustworthy-ML-Lab/CLIP-dissect で入手できます。

In this paper, we propose CLIP-Dissect, a new technique to automatically describe the function of individual hidden neurons inside vision networks. CLIP-Dissect leverages recent advances in multimodal vision/language models to label internal neurons with open-ended concepts without the need for any labeled data or human examples. We show that CLIP-Dissect provides more accurate descriptions than existing methods for last layer neurons where the ground-truth is available as well as qualitatively good descriptions for hidden layer neurons. In addition, our method is very flexible: it is model agnostic, can easily handle new concepts and can be extended to take advantage of better multimodal models in the future. Finally CLIP-Dissect is computationally efficient and can label all neurons from five layers of ResNet-50 in just 4 minutes, which is more than 10 times faster than existing methods. Our code is available at https://github.com/Trustworthy-ML-Lab/CLIP-dissect.

updated: Sat Mar 25 2023 22:31:42 GMT+0000 (UTC)

published: Sat Apr 23 2022 00:40:02 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト