Multi-adversarial Faster-RCNN for Unrestricted Object Detection

Zhenwei He; Lei Zhang

無制限のオブジェクト検出のための複数敵対的高速RCNN

従来のオブジェクト検出方法では、トレーニングとテストのデータが、高価なラベリングコストで制限されたターゲットドメインから収集されることを前提としています。ドメインの依存関係と面倒なラベリングの問題を軽減するために、本論文では、十分なラベルを持つ補助ソースドメインからトレーニングされたドメイン知識を活用することにより、制限のない環境でオブジェクトを検出することを提案します。具体的には、特徴表現におけるドメイン適応のためのドメイン視差最小化に本質的に対処する、無制限のオブジェクト検出のためのマルチ敵Faster-RCNN（MAF）フレームワークを提案します。論文のメリットは3つあります。1）画像分布の結果として生じるドメインの不均衡が現れると、オブジェクト検出器はしばしばドメイン非互換になるという考えで、階層型ドメインの複数の敵対的なドメイン分類子サブモジュールを含む階層型ドメインフィーチャアラインメントモジュールを提案します機能の混乱が設計されています。 2）階層的特徴マップのサイズ変更のための情報不変スケール削減モジュール（SRM）が、敵対ドメイン適応のトレーニング効率を促進するために提案されています。 3）ドメイン適応性を改善するために、検出結果を含む集約された提案機能は、ハード混乱ドメインサンプルを特徴付けるために提案された加重勾配反転層（WGRL）にフィードされます。 Cityscapes、KITTI、Sim10kなどを含む無制限のタスクでMAFを評価し、実験により、既存の検出器に対する最新のパフォーマンスを示しています。

Conventional object detection methods essentially suppose that the training and testing data are collected from a restricted target domain with expensive labeling cost. For alleviating the problem of domain dependency and cumbersome labeling, this paper proposes to detect objects in an unrestricted environment by leveraging domain knowledge trained from an auxiliary source domain with sufficient labels. Specifically, we propose a multi-adversarial Faster-RCNN (MAF) framework for unrestricted object detection, which inherently addresses domain disparity minimization for domain adaptation in feature representation. The paper merits are in three-fold: 1) With the idea that object detectors often becomes domain incompatible when image distribution resulted domain disparity appears, we propose a hierarchical domain feature alignment module, in which multiple adversarial domain classifier submodules for layer-wise domain feature confusion are designed; 2) An information invariant scale reduction module (SRM) for hierarchical feature map resizing is proposed for promoting the training efficiency of adversarial domain adaptation; 3) In order to improve the domain adaptability, the aggregated proposal features with detection results are feed into a proposed weighted gradient reversal layer (WGRL) for characterizing hard confused domain samples. We evaluate our MAF on unrestricted tasks, including Cityscapes, KITTI, Sim10k, etc. and the experiments show the state-of-the-art performance over the existing detectors.

updated: Sat Sep 07 2019 02:08:35 GMT+0000 (UTC)

published: Wed Jul 24 2019 10:12:36 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト