Robust Few-shot Learning Without Using any Adversarial Samples

Gaurav Kumar Nayak; Ruchit Rawal; Inder Khatri; Anirban Chakraborty

敵対的サンプルを使用しない堅牢な少数ショット学習

サンプルの取得と注釈付けのコストが高いため、「数ショット」学習の問題が最も重要になっています。既存の研究は、主にクリーンなデータのパフォーマンスを改善することに焦点を当てており、敵対的ノイズで乱されたデータの堅牢性に関する懸念を見落としています。最近、洗練されたメタ学習技術を使用して、少数ショットの問題を堅牢性の目標と組み合わせるいくつかの取り組みが行われました。これらの方法は、トレーニングのすべてのエピソードでの敵対的サンプルの生成に依存しているため、計算負荷がさらに増加します。このような時間のかかる複雑な手順を回避するために、敵対的なサンプルを必要としないシンプルで効果的な代替手段を提案します。人間の認知的意思決定プロセスに着想を得て、基本クラスのデータとそれに対応する低頻度サンプルの間の高レベルの特徴マッチングを、事前トレーニング段階で自己蒸留によって強制します。次に、モデルは新しいクラスのサンプルで微調整され、コサイン類似度を介して低頻度のクエリセット機能の識別可能性がさらに向上します。 CIFAR-FS データセットのワンショット設定では、この方法により、PGD と最先端の自動攻撃での敵対的精度がそれぞれ 60.55% と 62.05% と大幅に向上し、クリーンではわずかに低下します。ベースラインと比較した精度。さらに、私たちの方法は、標準的なトレーニング時間の 1.69 倍しかかからず、最先端の敵対的メタ学習方法よりも約 5 倍高速です。コードは https://github.com/vcl-iisc/robust-few-shot-learning で入手できます。

The high cost of acquiring and annotating samples has made the `few-shot' learning problem of prime importance. Existing works mainly focus on improving performance on clean data and overlook robustness concerns on the data perturbed with adversarial noise. Recently, a few efforts have been made to combine the few-shot problem with the robustness objective using sophisticated Meta-Learning techniques. These methods rely on the generation of adversarial samples in every episode of training, which further adds a computational burden. To avoid such time-consuming and complicated procedures, we propose a simple but effective alternative that does not require any adversarial samples. Inspired by the cognitive decision-making process in humans, we enforce high-level feature matching between the base class data and their corresponding low-frequency samples in the pretraining stage via self distillation. The model is then fine-tuned on the samples of novel classes where we additionally improve the discriminability of low-frequency query set features via cosine similarity. On a 1-shot setting of the CIFAR-FS dataset, our method yields a massive improvement of 60.55% & 62.05% in adversarial accuracy on the PGD and state-of-the-art Auto Attack, respectively, with a minor drop in clean accuracy compared to the baseline. Moreover, our method only takes 1.69× of the standard training time while being ≈ 5× faster than state-of-the-art adversarial meta-learning methods. The code is available at https://github.com/vcl-iisc/robust-few-shot-learning.

updated: Thu Nov 03 2022 05:58:26 GMT+0000 (UTC)

published: Thu Nov 03 2022 05:58:26 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト