Reinforcement Based Learning on Classification Task Could Yield Better Generalization and Adversarial Accuracy

Shashi Kant Gupta

分類タスクに関する強化ベースの学習は、より良い一般化と敵対的精度をもたらす可能性があります

ディープラーニングは、コンピュータービジョンで興味深い人気を博しており、主にさまざまなビジョンタスクで人間レベルのパフォーマンスに近いかそれ以上のパフォーマンスを達成しています。しかし、最近の研究では、これらのディープニューラルネットワークが敵対的な例（敵対的な例-元のデータに自然に類似しているが、モデルを間違ったクラスに分類することでモデルをだます）に対して非常に脆弱であることも示されています。人間はそのような摂動に対して非常に頑強です。考えられる理由の1つは、人間が「ターゲットラベル」と「予測ラベル」の間のエラーに基づいて分類することを学習しないが、おそらく予測で受けた強化のためである可能性があります。この作業では、画像分類タスクで深層学習モデルをトレーニングするための新しい方法を提案しました。従来のクロスエントロピー損失の代わりに、強化学習で使用されるバニラポリシー勾配法と同様の報酬ベースの最適化関数を使用してモデルをトレーニングしました。 cifar10データセットの経験的評価は、私たちの方法が、クロスエントロピー損失関数（敵対的トレーニング）を使用してトレーニングされた同じモデルアーキテクチャよりも堅牢な分類器を学習することを示しました。同時に、私たちの方法は、ほとんどの場合、差が2％を超えるクロスエントロピーと比較して、ほとんどの場合、テスト精度とトレイン精度の差が2％未満であるというより良い一般化を示しています。

Deep Learning has become interestingly popular in computer vision, mostly attaining near or above human-level performance in various vision tasks. But recent work has also demonstrated that these deep neural networks are very vulnerable to adversarial examples (adversarial examples - inputs to a model which are naturally similar to original data but fools the model in classifying it into a wrong class). Humans are very robust against such perturbations; one possible reason could be that humans do not learn to classify based on an error between "target label" and "predicted label" but possibly due to reinforcements that they receive on their predictions. In this work, we proposed a novel method to train deep learning models on an image classification task. We used a reward-based optimization function, similar to the vanilla policy gradient method used in reinforcement learning, to train our model instead of conventional cross-entropy loss. An empirical evaluation on the cifar10 dataset showed that our method learns a more robust classifier than the same model architecture trained using cross-entropy loss function (on adversarial training). At the same time, our method shows a better generalization with the difference in test accuracy and train accuracy < 2% for most of the time compared to the cross-entropy one, whose difference most of the time remains > 2%.

updated: Tue Dec 08 2020 11:03:17 GMT+0000 (UTC)

published: Tue Dec 08 2020 11:03:17 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト