ME R-CNN: Multi-Expert R-CNN for Object Detection

Hyungtae Lee; Sungmin Eum; Heesung Kwon

ME R-CNN：オブジェクト検出のためのマルチエキスパートR-CNN

複数の専門家（ME）を備えたマルチエキスパート地域ベースの畳み込みニューラルネットワーク（ME R-CNN）を紹介します。このネットワークでは、各専門家が特定のタイプの関心領域（RoI）を処理する方法を学習します。このアーキテクチャは、さまざまな形状、ポーズ、および表示角度によって引き起こされるRoIの外観の変化をより適切にキャプチャします。各RoIを適切な専門家に向けるために、専門家割り当てネットワーク（EAN）と呼ばれる新しい「学習可能な」ネットワークを考案します。 EANは、専門家の割り当てを監督しなくても、最適なRoIと専門家の関係を自動的に学習します。 ME R-CNNの主要コンポーネントであるMEとEANは、共有ネットワークに接続されている間、相互に影響を及ぼし合うため、交互の最適化も単純なエンドツーエンドの最適化も失敗する可能性はありません。この問題に対処するために、ME、EAN、および共有ネットワークをエンドツーエンドで最適化するように調整された実践的なトレーニング戦略を紹介します。両方のアーキテクチャが、PASCAL VOC 07、12、およびMSCOCOデータセットのベースラインよりも大幅なパフォーマンスの向上を提供することを示します。

We introduce Multi-Expert Region-based Convolutional Neural Network (ME R-CNN) which is equipped with multiple experts (ME) where each expert is learned to process a certain type of regions of interest (RoIs). This architecture better captures the appearance variations of the RoIs caused by different shapes, poses, and viewing angles. In order to direct each RoI to the appropriate expert, we devise a novel "learnable" network, which we call, expert assignment network (EAN). EAN automatically learns the optimal RoI-expert relationship even without any supervision of expert assignment. As the major components of ME R-CNN, ME and EAN, are mutually affecting each other while tied to a shared network, neither an alternating nor a naive end-to-end optimization is likely to fail. To address this problem, we introduce a practical training strategy which is tailored to optimize ME, EAN, and the shared network in an end-to-end fashion. We show that both of the architectures provide considerable performance increase over the baselines on PASCAL VOC 07, 12, and MS COCO datasets.

updated: Wed Apr 06 2022 13:22:27 GMT+0000 (UTC)

published: Tue Apr 04 2017 15:35:31 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト