Accelerated Zeroth-Order and First-Order Momentum Methods from Mini to Minimax Optimization

Feihu Huang; Shangqian Gao; Jian Pei; Heng Huang

ミニからミニマックス最適化までの加速されたゼロ次および一次運動量法

この論文では、非凸ミニ最適化とミニマックス最適化の両方のための加速されたゼロ次および一次運動量法のクラスを提案します。具体的には、ブラックボックスのミニ最適化のための新しい加速ゼロ次運動量（Acc-ZOM）法を提案します。さらに、Acc-ZOMメソッドがϵ停留点を見つけるためにO（d ^ 3 / 4ϵ ^ -3）のクエリの複雑さを低くし、O（d ^ 1の係数で最もよく知られている結果を改善することを証明します。 / 4）ここで、dは可変次元を示します。特に、Acc-ZOMは、既存の0次確率的アルゴリズムで必要とされる大きなバッチを必要としません。一方、ブラックボックスミニマックス最適化のための加速ゼロ次運動量降下上昇（Acc-ZOMDA）法を提案します。これは、O（（d_1 + d_2）^ 3 /4κ_y^4.5ϵ ^ -3）のクエリ複雑性を取得します。 ϵ停留点を見つけるための大きなバッチなし。ここで、d_1とd_2は可変次元を示し、κ_yは条件数です。さらに、ホワイトボックスミニマックス最適化のための加速一次運動量降下上昇（Acc-MDA）法を提案します。これは、ϵ停留値を見つけるための大きなバッチなしでO（κ_y^ 4.5ϵ ^ -3）の勾配複雑度を持ちます。点。特に、Acc-MDAは、バッチサイズO（κ_y^ 4）でO（κ_y^ 2.5ϵ ^ -3）のより低い勾配複雑度を取得できます。ディープニューラルネットワーク（DNN）へのブラックボックスの敵対的攻撃と中毒攻撃に関する広範な実験結果は、アルゴリズムの効率を示しています。

In the paper, we propose a class of accelerated zeroth-order and first-order momentum methods for both nonconvex mini-optimization and minimax-optimization. Specifically, we propose a new accelerated zeroth-order momentum (Acc-ZOM) method for black-box mini-optimization. Moreover, we prove that our Acc-ZOM method achieves a lower query complexity of O(d^3/4ϵ^-3) for finding an ϵ-stationary point, which improves the best known result by a factor of O(d^1/4) where d denotes the variable dimension. In particular, the Acc-ZOM does not require large batches required in the existing zeroth-order stochastic algorithms. Meanwhile, we propose an accelerated zeroth-order momentum descent ascent (Acc-ZOMDA) method for black-box minimax-optimization, which obtains a query complexity of O((d_1+d_2)^3/4κ_y^4.5ϵ^-3) without large batches for finding an ϵ-stationary point, where d_1 and d_2 denote variable dimensions and κ_y is condition number. Moreover, we propose an accelerated first-order momentum descent ascent (Acc-MDA) method for white-box minimax optimization, which has a gradient complexity of O(κ_y^4.5ϵ^-3) without large batches for finding an ϵ-stationary point. In particular, our Acc-MDA can obtain a lower gradient complexity of O(κ_y^2.5ϵ^-3) with a batch size O(κ_y^4). Extensive experimental results on the black-box adversarial attack to deep neural networks (DNNs) and poisoning attack demonstrate efficiency of our algorithms.

updated: Tue Jan 04 2022 04:35:08 GMT+0000 (UTC)

published: Tue Aug 18 2020 22:19:29 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト