Cyclic Differentiable Architecture Search

Hongyuan Yu; Houwen Peng; Yan Huang; Jianlong Fu; Hao Du; Liang Wang; Haibin Ling

循環微分可能アーキテクチャ検索

微分可能なARchiTectureSearch、つまりDARTSは、ニューラルアーキテクチャ検索で大きな注目を集めています。浅い検索ネットワークで最適なアーキテクチャを見つけようとし、深い評価ネットワークでそのパフォーマンスを測定します。ただし、検索ネットワークと評価ネットワークを独立して最適化すると、2つのネットワーク間の相互作用が可能になるため、改善の余地があります。問題のある最適化問題に対処するために、新しい共同最適化目標と、CDARTSと呼ばれる新しい循環微分可能ARchiTecture検索フレームワークを提案します。構造の違いを考慮して、CDARTSは、内省的な蒸留を使用して、検索ネットワークと評価ネットワークの間に循環フィードバックメカニズムを構築します。まず、検索ネットワークが評価用の初期アーキテクチャを生成し、評価ネットワークの重みが最適化されます。第2に、検索ネットワークのアーキテクチャの重みは、分類におけるラベルの監視と、評価ネットワークから特徴の蒸留による正則化によってさらに最適化されます。上記のサイクルを繰り返すと、検索ネットワークと評価ネットワークが共同で最適化され、最終的な評価ネットワークに合わせてアーキテクチャを進化させることができます。 CIFAR、ImageNet、NAS-Bench-201での実験と分析は、最先端のアプローチに対する提案されたアプローチの有効性を示しています。具体的には、DARTS検索スペースでは、CIFAR10で97.52％のトップ1精度、ImageNetで76.3％のトップ1精度を達成しています。チェーン構造の検索スペースでは、ImageNetで78.2％のトップ1精度を達成します。これは、EfficientNet-B0よりも1.1％高くなります。コードとモデルは、https：//github.com/microsoft/Creamで公開されています。

Differentiable ARchiTecture Search, i.e., DARTS, has drawn great attention in neural architecture search. It tries to find the optimal architecture in a shallow search network and then measures its performance in a deep evaluation network. The independent optimization of the search and evaluation networks, however, leaves room for potential improvement by allowing interaction between the two networks. To address the problematic optimization issue, we propose new joint optimization objectives and a novel Cyclic Differentiable ARchiTecture Search framework, dubbed CDARTS. Considering the structure difference, CDARTS builds a cyclic feedback mechanism between the search and evaluation networks with introspective distillation. First, the search network generates an initial architecture for evaluation, and the weights of the evaluation network are optimized. Second, the architecture weights in the search network are further optimized by the label supervision in classification, as well as the regularization from the evaluation network through feature distillation. Repeating the above cycle results in joint optimization of the search and evaluation networks and thus enables the evolution of the architecture to fit the final evaluation network. The experiments and analysis on CIFAR, ImageNet and NAS-Bench-201 demonstrate the effectiveness of the proposed approach over the state-of-the-art ones. Specifically, in the DARTS search space, we achieve 97.52% top-1 accuracy on CIFAR10 and 76.3% top-1 accuracy on ImageNet. In the chain-structured search space, we achieve 78.2% top-1 accuracy on ImageNet, which is 1.1% higher than EfficientNet-B0. Our code and models are publicly available at https://github.com/microsoft/Cream.

updated: Tue Feb 15 2022 05:44:37 GMT+0000 (UTC)

published: Thu Jun 18 2020 17:55:19 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト