Focus for Free in Density-Based Counting

Zenglin Shi; Pascal Mettes; Cees G. M. Snoek

密度ベースの計数に無料で集中

この研究では、画像とそれに対応する点の注釈から数を数える教師あり学習を検討しています。密度ベースの計数方法では、通常、監視信号として機能するガウス密度マップを作成するためにのみポイントアノテーションが使用されますが、この研究の出発点は、ポイントアノテーションには密度マップの生成を超えた計数の可能性があるということです。利用可能なポイントの注釈を再利用して計数パフォーマンスを向上させる 2 つの方法を紹介します。 1 つ目は、ポイントアノテーションを利用して入力画像と密度画像の両方でオクルージョンされたオブジェクトをシミュレートし、オクルージョンに対するネットワークの堅牢性を強化する、カウント固有の拡張です。 2 番目の方法である前景蒸留では、ポイントアノテーションから前景マスクを生成し、そこから背景が黒く塗りつぶされた画像で補助ネットワークをトレーニングします。そうすることで、バックグラウンドからの干渉なしに前景のカウント知識を抽出する方法を学習します。これらの方法は、既存の計数の進歩とシームレスに統合でき、さまざまな損失関数に適応できます。私たちはこれらのアプローチの相補的な効果を実証し、背景のクラッター、オクルージョン、さまざまな群集密度などの困難なシナリオでも堅牢な計数結果を達成できるようにします。私たちが提案したアプローチは、ShanghaiTech Part\_A および Part\_B、UCF\_QNRF、JHU-Crowd++、NWPU-Crowd を含む複数のデータセットで強力な計数結果を達成します。

This work considers supervised learning to count from images and their corresponding point annotations. Where density-based counting methods typically use the point annotations only to create Gaussian-density maps, which act as the supervision signal, the starting point of this work is that point annotations have counting potential beyond density map generation. We introduce two methods that repurpose the available point annotations to enhance counting performance. The first is a counting-specific augmentation that leverages point annotations to simulate occluded objects in both input and density images to enhance the network's robustness to occlusions. The second method, foreground distillation, generates foreground masks from the point annotations, from which we train an auxiliary network on images with blacked-out backgrounds. By doing so, it learns to extract foreground counting knowledge without interference from the background. These methods can be seamlessly integrated with existing counting advances and are adaptable to different loss functions. We demonstrate complementary effects of the approaches, allowing us to achieve robust counting results even in challenging scenarios such as background clutter, occlusion, and varying crowd densities. Our proposed approach achieves strong counting results on multiple datasets, including ShanghaiTech Part\_A and Part\_B, UCF\_QNRF, JHU-Crowd++, and NWPU-Crowd.

updated: Thu Jun 08 2023 11:54:37 GMT+0000 (UTC)

published: Thu Jun 08 2023 11:54:37 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト