On the Post-hoc Explainability of Deep Echo State Networks for Time Series Forecasting, Image and Video Classification

Alejandro Barredo Arrieta; Sergio Gil-Lopez; Ibai Laña; Miren Nekane Bilbao; Javier Del Ser

時系列予測、画像およびビデオ分類のためのディープエコー状態ネットワークの事後説明可能性について

当初から、Reservoir Computingパラダイムの下での学習手法は、他のアプローチに必要なコンピューティングオーバーヘッドなしで、反復システムの優れたモデリング機能を示してきました。それらの中で、エコー状態ネットワークのさまざまなフレーバーは、主に学習アルゴリズムの単純さと計算効率のために、時間の経過とともに多くの注目を集めてきました。ただし、これらの利点は、エコー状態ネットワークがブラックボックスモデルのままであり、その決定を一般の聴衆に簡単に説明できないという事実を補うものではありません。この作業は、時系列、画像、およびビデオデータを使用した学習タスクに適用された場合に、エコー状態ネットワークの説明可能性調査を実施することにより、この問題に対処します。具体的には、この研究は、これらの反復モデルによって把握された知識に関する理解可能な情報、すなわち、潜在的な記憶、時間的パターン、およびピクセル不在効果を引き出すことができる3つの異なる技術を提案します。潜在的なメモリは、時間情報を格納するモデルの機能におけるリザーバーサイズの影響に関連する質問に対処しますが、時間パターンは、時間の経過とともにモデルによってキャプチャされた繰り返しの関係を明らかにします。最後に、ピクセル不在効果は、エコー状態ネットワークモデルが画像とビデオの分類に使用される場合に、特定のピクセルの不在の効果を評価しようとします。時系列モデリング、画像、および関連文献で初めてビデオ分類という3つの異なる適用範囲に対する提案された一連の手法の利点を紹介します。私たちの結果は、提案された手法がこれらのモデルの動作方法の情報に基づいた理解を可能にするだけでなく、データから継承された問題（たとえば隠れたバイアスの存在）を検出できる診断ツールとしても役立つことを明らかにしています。

Since their inception, learning techniques under the Reservoir Computing paradigm have shown a great modeling capability for recurrent systems without the computing overheads required for other approaches. Among them, different flavors of echo state networks have attracted many stares through time, mainly due to the simplicity and computational efficiency of their learning algorithm. However, these advantages do not compensate for the fact that echo state networks remain as black-box models whose decisions cannot be easily explained to the general audience. This work addresses this issue by conducting an explainability study of Echo State Networks when applied to learning tasks with time series, image and video data. Specifically, the study proposes three different techniques capable of eliciting understandable information about the knowledge grasped by these recurrent models, namely, potential memory, temporal patterns and pixel absence effect. Potential memory addresses questions related to the effect of the reservoir size in the capability of the model to store temporal information, whereas temporal patterns unveils the recurrent relationships captured by the model over time. Finally, pixel absence effect attempts at evaluating the effect of the absence of a given pixel when the echo state network model is used for image and video classification. We showcase the benefits of our proposed suite of techniques over three different domains of applicability: time series modeling, image and, for the first time in the related literature, video classification. Our results reveal that the proposed techniques not only allow for a informed understanding of the way these models work, but also serve as diagnostic tools capable of detecting issues inherited from data (e.g. presence of hidden bias).

updated: Wed Feb 17 2021 08:56:33 GMT+0000 (UTC)

published: Wed Feb 17 2021 08:56:33 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト