A Solution to Product detection in Densely Packed Scenes

Tianze Rong; Yanjia Zhu; Hongxiang Cai; Yichao Xiong

密集したシーンでの製品検出のソリューション

この作品は、密集したシーンデータセットSKU-110kのソリューションです。私たちの仕事はCascadeR-CNNから変更されています。この問題を解決するために、通常のランダムトリミングとは対照的に、サンプリングレートと入力スケールの両方が比較的十分であることを保証するランダムトリミング戦略を提案しました。そして、いくつかのトリックを採用し、ハイパーパラメータを最適化しました。密集したシーンの本質的な特徴を把握するために、検出器のステージを分析し、パフォーマンスを制限するボトルネックを調査します。その結果、私たちの方法では、SKU-110kのテストセットで58.7mAPが得られます。

This work is a solution to densely packed scenes dataset SKU-110k. Our work is modified from Cascade R-CNN. To solve the problem, we proposed a random crop strategy to ensure both the sampling rate and input scale is relatively sufficient as a contrast to the regular random crop. And we adopted some of trick and optimized the hyper-parameters. To grasp the essential feature of the densely packed scenes, we analysis the stages of a detector and investigate the bottleneck which limits the performance. As a result, our method obtains 58.7 mAP on test set of SKU-110k.

updated: Tue Aug 10 2021 07:45:52 GMT+0000 (UTC)

published: Thu Jul 23 2020 11:58:45 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト