Should All Proposals be Treated Equally in Object Detection?

Yunsheng Li; Yinpeng Chen; Xiyang Dai; Dongdong Chen; Mengchen Liu; Pei Yu; Jing Yin; Lu Yuan; Zicheng Liu; Nuno Vasconcelos

オブジェクト検出では、すべての提案を同等に扱う必要がありますか？

オブジェクト検出器の複雑さと精度のトレードオフは、リソースに制約のあるビジョンタスクにとって重大な問題です。以前の作品は、効率的なバックボーンで実装された検出器を強調してきました。この作業では、検出ヘッドによる提案処理のこのトレードオフへの影響を調査します。検出効率の向上には、提案の不平等な処理に向けたパラダイムシフトが必要であり、貧弱な提案よりも優れた提案により多くの計算を割り当てることが必要であると仮定されています。これにより、利用可能な計算バジェットの利用率が向上し、同じFLOPSでより高い精度が可能になります。これを学習問題として定式化し、検出ヘッドで演算子を提案に割り当てることで、総計算コストを抑制し、精度を最大化します。重要な発見は、そのようなマッチングは、各提案の埋め込みを演算子上のワンホットコードにマッピングする関数として学習できることです。この機能は複雑な動的ネットワークルーティングメカニズムを誘発しますが、単純なMLPで実装し、既製のオブジェクト検出器でエンドツーエンドで学習することができます。この「動的提案処理」（DPP）は、最先端のエンドツーエンドオブジェクト検出器（DETR、スパースR-CNN）を、特定の計算の複雑さに対して明確なマージンで上回ることが示されています。

The complexity-precision trade-off of an object detector is a critical problem for resource constrained vision tasks. Previous works have emphasized detectors implemented with efficient backbones. The impact on this trade-off of proposal processing by the detection head is investigated in this work. It is hypothesized that improved detection efficiency requires a paradigm shift, towards the unequal processing of proposals, assigning more computation to good proposals than poor ones. This results in better utilization of available computational budget, enabling higher accuracy for the same FLOPS. We formulate this as a learning problem where the goal is to assign operators to proposals, in the detection head, so that the total computational cost is constrained and the precision is maximized. The key finding is that such matching can be learned as a function that maps each proposal embedding into a one-hot code over operators. While this function induces a complex dynamic network routing mechanism, it can be implemented by a simple MLP and learned end-to-end with off-the-shelf object detectors. This 'dynamic proposal processing' (DPP) is shown to outperform state-of-the-art end-to-end object detectors (DETR, Sparse R-CNN) by a clear margin for a given computational complexity.

updated: Thu Jul 07 2022 18:26:32 GMT+0000 (UTC)

published: Thu Jul 07 2022 18:26:32 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト