Discriminative Semantic Feature Pyramid Network with Guided Anchoring for Logo Detection

Baisong Zhang; Weiqing Min; Jing Wang; Sujuan Hou; Qiang Hou; Yuanjie Zheng; Shuqiang Jiang

ロゴ検出のためのガイド付きアンカーを備えた識別セマンティック機能ピラミッドネットワーク

最近、ロゴ検出は、知的財産保護、製品ブランド管理、ロゴ持続時間監視など、マルチメディア分野での幅広いアプリケーションでますます注目を集めています。一般的なオブジェクトの検出とは異なり、ロゴの検出は、特に実際のシナリオでの小さなロゴオブジェクトと大きなアスペクト比のロゴオブジェクトの場合、困難な作業です。この論文では、セマンティック情報を集約し、さまざまなアスペクト比のアンカーボックスを生成することでこれらの課題に対処できる、ガイド付きアンカー付きの識別セマンティック機能ピラミッドネットワーク（DSFP-GA）という新しいアプローチを提案します。より具体的には、私たちのアプローチは、主に識別セマンティック機能ピラミッド（DSFP）とガイド付きアンカー（GA）で構成されています。小さなロゴオブジェクトの検出に使用される低レベルの特徴マップには意味情報が不足していることを考慮して、低レベルの特徴マップのより識別力のある意味特徴を強化し、小さなロゴオブジェクトでより優れたパフォーマンスを実現できるDSFPを提案します。さらに、プリセットアンカーボックスは、アスペクト比の大きいロゴオブジェクトを検出するのに効率的ではありません。したがって、GAをメソッドに統合して、この問題を軽減するために大きなアスペクト比のアンカーボックスを生成します。 4つのベンチマークに関する広範な実験結果は、提案されたDSFP-GAの有効性を示しています。さらに、視覚分析とアブレーション研究をさらに実施して、大小のアスペクトのロゴオブジェクトを検出する際の私たちの方法の利点を説明します。コードとモデルはhttps://github.com/Zhangbaisong/DSFP-GAにあります。

Recently, logo detection has received more and more attention for its wide applications in the multimedia field, such as intellectual property protection, product brand management, and logo duration monitoring. Unlike general object detection, logo detection is a challenging task, especially for small logo objects and large aspect ratio logo objects in the real-world scenario. In this paper, we propose a novel approach, named Discriminative Semantic Feature Pyramid Network with Guided Anchoring (DSFP-GA), which can address these challenges via aggregating the semantic information and generating different aspect ratio anchor boxes. More specifically, our approach mainly consists of Discriminative Semantic Feature Pyramid (DSFP) and Guided Anchoring (GA). Considering that low-level feature maps that are used to detect small logo objects lack semantic information, we propose the DSFP, which can enrich more discriminative semantic features of low-level feature maps and can achieve better performance on small logo objects. Furthermore, preset anchor boxes are less efficient for detecting large aspect ratio logo objects. We therefore integrate the GA into our method to generate large aspect ratio anchor boxes to mitigate this issue. Extensive experimental results on four benchmarks demonstrate the effectiveness of our proposed DSFP-GA. Moreover, we further conduct visual analysis and ablation studies to illustrate the advantage of our method in detecting small and large aspect logo objects. The code and models can be found at https://github.com/Zhangbaisong/DSFP-GA.

updated: Tue Aug 31 2021 11:59:00 GMT+0000 (UTC)

published: Tue Aug 31 2021 11:59:00 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト