DASS: Differentiable Architecture Search for Sparse neural networks

Hamid Mousavi; Mohammad Loni; Mina Alibeigi; Masoud Daneshtalab

DASS: スパースニューラルネットワークの微分可能アーキテクチャ検索

エッジデバイスでのディープニューラルネットワーク (DNN) の展開は、パフォーマンス要件と利用可能な処理能力との間の実質的なギャップによって妨げられています。最近の研究では、DNN のコンピューティングオーバーヘッドを削減するためのスパースネットワークを構築するための枝刈り手法の開発が大幅に進歩しましたが、特に枝刈り比率が高い場合には、かなりの精度の損失が残っています。微分可能なアーキテクチャ検索方法によって高密度ネットワーク用に設計されたアーキテクチャは、プルーニングメカニズムが適用されると効果がないことがわかります。主な理由は、現在の方法が検索空間でスパースアーキテクチャをサポートしておらず、密なネットワーク用に作成された検索目標を使用し、スパース性に注意を払っていないことです。この論文では、スパース性に適したニューラルアーキテクチャを検索する新しい方法を提案します。これを行うには、検索スペースに 2 つの新しいスパース操作を追加し、検索目的を変更します。スパース操作を含むように検索空間を拡張するために、2 つの新しいパラメトリック SparseConv 操作と SparseLinear 操作を提案します。特に、これらの演算は、線形および畳み込み演算のスパースパラメトリックバージョンを使用するため、柔軟な検索空間を作成します。提案された検索目的により、検索空間操作のまばらさに基づいてアーキテクチャをトレーニングできます。定量分析は、当社の検索アーキテクチャが、CIFAR-10 および ImageNet データセットの最先端のスパースネットワークで使用されているものよりも優れていることを示しています。パフォーマンスとハードウェアの有効性に関して、DASS は MobileNet-v2 のスパースバージョンの精度を 73.44% から 81.35% (+7.91% 改善) に高め、推論時間は 3.87 倍高速です。

The deployment of Deep Neural Networks (DNNs) on edge devices is hindered by the substantial gap between performance requirements and available processing power. While recent research has made significant strides in developing pruning methods to build a sparse network for reducing the computing overhead of DNNs, there remains considerable accuracy loss, especially at high pruning ratios. We find that the architectures designed for dense networks by differentiable architecture search methods are ineffective when pruning mechanisms are applied to them. The main reason is that the current method does not support sparse architectures in their search space and uses a search objective that is made for dense networks and does not pay any attention to sparsity. In this paper, we propose a new method to search for sparsity-friendly neural architectures. We do this by adding two new sparse operations to the search space and modifying the search objective. We propose two novel parametric SparseConv and SparseLinear operations in order to expand the search space to include sparse operations. In particular, these operations make a flexible search space due to using sparse parametric versions of linear and convolution operations. The proposed search objective lets us train the architecture based on the sparsity of the search space operations. Quantitative analyses demonstrate that our search architectures outperform those used in the stateof-the-art sparse networks on the CIFAR-10 and ImageNet datasets. In terms of performance and hardware effectiveness, DASS increases the accuracy of the sparse version of MobileNet-v2 from 73.44% to 81.35% (+7.91% improvement) with 3.87x faster inference time.

updated: Tue Sep 12 2023 12:14:56 GMT+0000 (UTC)

published: Thu Jul 14 2022 14:53:50 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト