FNA++: Fast Network Adaptation via Parameter Remapping and Architecture Search

Jiemin Fang; Yuzhu Sun; Qian Zhang; Kangjian Peng; Yuan Li; Wenyu Liu; Xinggang Wang

FNA ++：パラメータの再マッピングとアーキテクチャ検索による高速ネットワーク適応

ディープニューラルネットワークは、多くのコンピュータービジョンタスクで驚くべきパフォーマンスを実現します。ほとんどの最先端（SOTA）セマンティックセグメンテーションおよびオブジェクト検出アプローチは、イメージ分類用に設計されたニューラルネットワークアーキテクチャをバックボーンとして再利用し、通常はImageNetで事前トレーニングされています。ただし、最近の検出とセグメンテーションに関するニューラルアーキテクチャ検索（NAS）の調査で示されているように、パフォーマンスの向上は、検出とセグメンテーション専用のネットワークアーキテクチャを設計することで実現できます。ただし、大きな課題の1つは、検索空間表現（別名スーパーネットワーク）または検索されたネットワークのImageNet事前トレーニングには、莫大な計算コストがかかることです。このホワイトペーパーでは、シードネットワーク（ImageNetの事前トレーニング済みネットワークなど）のアーキテクチャとパラメータの両方を適応させて、さまざまな深さ、幅、またはカーネルサイズのネットワークにすることができる高速ネットワーク適応（FNA ++）メソッドを提案します。パラメータの再マッピング手法。NASをセグメンテーションおよび検出タスクにはるかに効率的に使用できるようにします。私たちの実験では、MobileNetV2にFNA ++を適用して、手動およびNASの両方で設計された既存のネットワークを明らかに上回る、セマンティックセグメンテーション、オブジェクト検出、および人間の姿勢推定のための新しいネットワークを取得します。また、ResNetおよびNASネットワークにFNA ++を実装しているため、優れた一般化機能が実証されています。 FNA ++の総計算コストは、SOTAセグメンテーションおよび検出NASアプローチよりも大幅に低く、DPCより1737倍、Auto-DeepLabより6.8倍、DetNASより8.0倍低くなっています。一連のアブレーション研究が有効性を実証するために実行され、詳細な分析が機能メカニズムへのより多くの洞察のために提供されます。コードはhttps://github.com/JaminFong/FNAで入手できます。

Deep neural networks achieve remarkable performance in many computer vision tasks. Most state-of-the-art (SOTA) semantic segmentation and object detection approaches reuse neural network architectures designed for image classification as the backbone, commonly pre-trained on ImageNet. However, performance gains can be achieved by designing network architectures specifically for detection and segmentation, as shown by recent neural architecture search (NAS) research for detection and segmentation. One major challenge though is that ImageNet pre-training of the search space representation (a.k.a. super network) or the searched networks incurs huge computational cost. In this paper, we propose a Fast Network Adaptation (FNA++) method, which can adapt both the architecture and parameters of a seed network (e.g. an ImageNet pre-trained network) to become a network with different depths, widths, or kernel sizes via a parameter remapping technique, making it possible to use NAS for segmentation and detection tasks a lot more efficiently. In our experiments, we apply FNA++ on MobileNetV2 to obtain new networks for semantic segmentation, object detection, and human pose estimation that clearly outperform existing networks designed both manually and by NAS. We also implement FNA++ on ResNets and NAS networks, which demonstrates a great generalization ability. The total computation cost of FNA++ is significantly less than SOTA segmentation and detection NAS approaches: 1737x less than DPC, 6.8x less than Auto-DeepLab, and 8.0x less than DetNAS. A series of ablation studies are performed to demonstrate the effectiveness, and detailed analysis is provided for more insights into the working mechanism. Codes are available at https://github.com/JaminFong/FNA.

updated: Wed Dec 16 2020 03:57:51 GMT+0000 (UTC)

published: Sun Jun 21 2020 10:03:34 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト