Thought Flow Nets: From Single Predictions to Trains of Model Thought

Hendrik Schuff; Heike Adel; Ngoc Thang Vu

思考フローネット：単一の予測からモデル思考の列まで

人間が複雑な問題を解決するとき、すぐに決断を下すことはめったにありません。代わりに、彼らは直感的な決定から始め、それを熟考し、間違いを見つけ、矛盾を解決し、異なる仮説の間をジャンプします。したがって、彼らは一連のアイデアを作成し、最終的に決定的な決定に達する一連の思考に従います。これとは対照的に、今日の神経分類モデルは、ほとんどの場合、入力を単一の固定出力にマッピングするようにトレーニングされています。この論文では、モデルに2番目、3番目、およびk番目の思考の機会を与える方法を調査します。ヘーゲルの弁証法からインスピレーションを得て、既存の分類器のクラス予測（画像クラスの森など）を一連の予測（森→木→きのこなど）に変換する方法を提案します。具体的には、モデルの正しさを推定するようにトレーニングされた補正モジュールと、予測の勾配に基づいた反復予測更新を提案します。私たちのアプローチは、クラス確率分布x2014思考フローに対する動的システムをもたらします。コンピュータービジョンと自然言語処理からの多様なデータセットとタスクで私たちの方法を評価します。驚くほど複雑ですが直感的な動作を観察し、私たちの方法が（i）誤分類を修正できる、（ii）モデルのパフォーマンスを強化する、（iii）高レベルの敵対的攻撃に対して堅牢である、（iv）精度を最大4％向上できることを示しています。 label-distribution-shift設定および（iv）は、単一の分布予測では見えないままであるモデルの知識を明らかにする、モデルの解釈可能性のためのツールを提供します。

When humans solve complex problems, they rarely come up with a decision right-away. Instead, they start with an intuitive decision, reflect upon it, spot mistakes, resolve contradictions and jump between different hypotheses. Thus, they create a sequence of ideas and follow a train of thought that ultimately reaches a conclusive decision. Contrary to this, today's neural classification models are mostly trained to map an input to one single and fixed output. In this paper, we investigate how we can give models the opportunity of a second, third and k-th thought. We take inspiration from Hegel's dialectics and propose a method that turns an existing classifier's class prediction (such as the image class forest) into a sequence of predictions (such as forest → tree → mushroom). Concretely, we propose a correction module that is trained to estimate the model's correctness as well as an iterative prediction update based on the prediction's gradient. Our approach results in a dynamic system over class probability distributions x2014 the thought flow. We evaluate our method on diverse datasets and tasks from computer vision and natural language processing. We observe surprisingly complex but intuitive behavior and demonstrate that our method (i) can correct misclassifications, (ii) strengthens model performance, (iii) is robust to high levels of adversarial attacks, (iv) can increase accuracy up to 4% in a label-distribution-shift setting and (iv) provides a tool for model interpretability that uncovers model knowledge which otherwise remains invisible in a single distribution prediction.

updated: Mon Jul 26 2021 13:56:37 GMT+0000 (UTC)

published: Mon Jul 26 2021 13:56:37 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト