Route, Interpret, Repeat: Blurring the Line Between Post hoc Explainability and Interpretable Models

Shantanu Ghosh; Ke Yu; Forough Arabshahi; Kayhan Batmanghelich

ルーティング、解釈、繰り返し: 事後説明可能性と解釈可能なモデルの間の境界線を曖昧にする

ML モデルの設計に対する現在のアプローチは、柔軟なブラックボックスモデルを選択して事後的に説明するか、解釈可能なモデルから始めるかのいずれかです。ブラックボックスモデルは柔軟ですが、説明が困難ですが、解釈可能なモデルは説明できるように設計されています。ただし、解釈可能なモデルを開発するには広範な ML の知識が必要であり、結果として得られるモデルは柔軟性に欠ける傾向があり、Blackbox の同等物と比較して標準以下のパフォーマンスを提供する可能性があります。この論文の目的は、BlackBox の事後的な説明と解釈可能なモデルの構築との区別をあいまいにすることです。柔軟な BlackBox モデルから始めて、解釈可能なモデルと残差ネットワークの混合物を徐々に切り出すことを提案します。私たちの設計は、サンプルのサブセットを識別し、それらを解釈可能なモデルにルーティングします。残りのサンプルは、柔軟な残余ネットワークを介してルーティングされます。解釈可能なモデルのバックボーンとして、BlackBox モデルから取得した概念に関する基本的な推論を提供する First Order Logic (FOL) を採用しています。残差ネットワークでは、残差ネットワークによって説明されるデータの割合が目的のしきい値を下回るまで、この方法を繰り返します。私たちのアプローチにはいくつかの利点があります。まず、解釈可能で柔軟な残差ネットワークの混合により、パフォーマンスがほとんど妥協されません。第 2 に、ルート、解釈、繰り返しのアプローチにより、非常に柔軟な解釈可能なモデルが得られます。私たちの広範な実験は、さまざまなデータセットでのモデルのパフォーマンスを示しています。 FOL モデルを編集することで、元の BlackBox モデルによって学習されたショートカットを修正できることを示します。最後に、私たちの方法は、トレーニングが簡単で、多くのアプリケーションに適応できるハイブリッドシンボリックコネクショニストネットワークのフレームワークを提供します。

The current approach to ML model design is either to choose a flexible Blackbox model and explain it post hoc or to start with an interpretable model. Blackbox models are flexible but difficult to explain, whereas interpretable models are designed to be explainable. However, developing interpretable models necessitates extensive ML knowledge, and the resulting models tend to be less flexible, offering potentially subpar performance compared to their Blackbox equivalents. This paper aims to blur the distinction between a post hoc explanation of a BlackBox and constructing interpretable models. We propose beginning with a flexible BlackBox model and gradually carving out a mixture of interpretable models and a residual network. Our design identifies a subset of samples and routes them through the interpretable models. The remaining samples are routed through a flexible residual network. We adopt First Order Logic (FOL) as the interpretable model's backbone, which provides basic reasoning on concepts retrieved from the BlackBox model. On the residual network, we repeat the method until the proportion of data explained by the residual network falls below a desired threshold. Our approach offers several advantages. First, the mixture of interpretable and flexible residual networks results in almost no compromise in performance. Second, the route, interpret, and repeat approach yields a highly flexible interpretable model. Our extensive experiment demonstrates the performance of the model on various datasets. We show that by editing the FOL model, we can fix the shortcut learned by the original BlackBox model. Finally, our method provides a framework for a hybrid symbolic-connectionist network that is simple to train and adaptable to many applications.

updated: Mon Feb 20 2023 20:25:41 GMT+0000 (UTC)

published: Mon Feb 20 2023 20:25:41 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト