RobustART: Benchmarking Robustness on Architecture Design and Training Techniques

Shiyu Tang; Ruihao Gong; Yan Wang; Aishan Liu; Jiakai Wang; Xinyun Chen; Fengwei Yu; Xianglong Liu; Dawn Song; Alan Yuille; Philip H. S. Torr; Dacheng Tao

RobustART：アーキテクチャの設計とトレーニング技術の堅牢性のベンチマーク

ディープニューラルネットワーク（DNN）は、モデルの堅牢性のベンチマークを動機付ける敵対的なノイズに対して脆弱です。既存のベンチマークは主に防御の評価に焦点を合わせていますが、アーキテクチャの設計とトレーニング手法が堅牢性にどのように影響するかについての包括的な研究はありません。それらの関係を包括的にベンチマークすることは、堅牢なDNNをよりよく理解して開発するために有益です。したがって、Architecture設計（49の人間が設計した既製のアーキテクチャとニューラルアーキテクチャ検索からの1200以上のネットワーク）およびトレーニング技術（10以上の技術、たとえばデータ拡張）に関するImageNetの最初の包括的なロバストネス調査ベンチマークであるRobustARTを提案します。多様なノイズ（敵対的、自然、およびシステムノイズ）に向けて。広範な実験により、初めていくつかの洞察が実証されました。たとえば、（1）敵対的な訓練は、トランスフォーマーとMLPミキサーのすべてのノイズタイプに対する堅牢性に効果的です。（2）同等のモデルサイズと調整されたトレーニング設定が与えられた場合、CNN>トランスフォーマー> MLP-自然ノイズおよびシステムノイズに対するロバスト性に関するミキサー。トランスフォーマー> MLP-ミキサー>敵対的なロバスト性に関するCNN; （3）一部の軽量アーキテクチャでは、モデルサイズを大きくしたり、追加のデータを使用したりしても、堅牢性を向上させることはできません。私たちのベンチマークは次のことを示しています。（1）包括的な堅牢性評価のためのオープンソースプラットフォーム。（2）堅牢性の評価を容易にするためのさまざまな事前トレーニング済みモデル。（3）堅牢なDNNの設計に向けたメカニズムをよりよく理解するための新しいビュー。私たちは、コミュニティのためにこのエコシステムに継続的に発展していきます。

Deep neural networks (DNNs) are vulnerable to adversarial noises, which motivates the benchmark of model robustness. Existing benchmarks mainly focus on evaluating defenses, but there are no comprehensive studies of how architecture design and training techniques affect robustness. Comprehensively benchmarking their relationships is beneficial for better understanding and developing robust DNNs. Thus, we propose RobustART, the first comprehensive Robustness investigation benchmark on ImageNet regarding ARchitecture design (49 human-designed off-the-shelf architectures and 1200+ networks from neural architecture search) and Training techniques (10+ techniques, e.g., data augmentation) towards diverse noises (adversarial, natural, and system noises). Extensive experiments substantiated several insights for the first time, e.g., (1) adversarial training is effective for the robustness against all noises types for Transformers and MLP-Mixers; (2) given comparable model sizes and aligned training settings, CNNs > Transformers > MLP-Mixers on robustness against natural and system noises; Transformers > MLP-Mixers > CNNs on adversarial robustness; (3) for some light-weight architectures, increasing model sizes or using extra data cannot improve robustness. Our benchmark presents: (1) an open-source platform for comprehensive robustness evaluation; (2) a variety of pre-trained models to facilitate robustness evaluation; and (3) a new view to better understand the mechanism towards designing robust DNNs. We will continuously develop to this ecosystem for the community.

updated: Fri Jan 14 2022 03:19:06 GMT+0000 (UTC)

published: Sat Sep 11 2021 08:01:14 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト