Detecting and Matching Related Objects with One Proposal Multiple Predictions

Yang Liu; Luiz G. Hafemann; Michael Jamieson; Mehrsan Javan

1つの提案で関連するオブジェクトを検出して照合する複数の予測

スポーツビデオでのプレーヤーの追跡は、通常、検出による追跡フレームワークで行われ、最初に各フレームでプレーヤーを検出し、次に時間の経過とともに関連付けを実行します。一部のスポーツ追跡プレーヤーはゲーム分析に十分ですが、ホッケー、テニス、ポロなどのスポーツでは、プレーヤーが保持しているオブジェクト（ラケット、スティックなど）を含む追加の検出が必要になる場合があります。この問題のベースラインソリューションには、これらのオブジェクトを個別のクラスとして検出し、それらをユニオンオーバー交差（IoU）に基づくプレーヤー検出と照合することが含まれます。ただし、このアプローチは、プレーヤーとオブジェクトの関係をモデル化していないため、混雑した状況ではマッチングパフォーマンスが低下します。この論文では、同じ提案ボックスを介して複数のオブジェクトを予測するための暗黙的な関連付けを検討することにより、追加コストなしでプレーヤーと関連オブジェクトを一度に検出して照合するシンプルで効率的な方法を提案します。放送されたアイスホッケービデオのデータセットと、COCO + Torsoと呼ばれる新しい公開データセットでこの方法を評価します。アイスホッケーのデータセットでは、提案された方法により、マッチングパフォーマンスが57.1％から81.4％に向上すると同時に、プレーヤーとスティックの検出のmeanAPが68.4％から88.3％に向上します。 COCO + Torsoデータセットでは、マッチングが47.9％から65.2％に向上していることがわかります。 COCO + Torsoデータセット、コード、および事前トレーニング済みモデルは、https：//github.com/foreverYoungGitHub/detect-and-match-related-objectsでリリースされます。

Tracking players in sports videos is commonly done in a tracking-by-detection framework, first detecting players in each frame, and then performing association over time. While for some sports tracking players is sufficient for game analysis, sports like hockey, tennis and polo may require additional detections, that include the object the player is holding (e.g. racket, stick). The baseline solution for this problem involves detecting these objects as separate classes, and matching them to player detections based on the intersection over union (IoU). This approach, however, leads to poor matching performance in crowded situations, as it does not model the relationship between players and objects. In this paper, we propose a simple yet efficient way to detect and match players and related objects at once without extra cost, by considering an implicit association for prediction of multiple objects through the same proposal box. We evaluate the method on a dataset of broadcast ice hockey videos, and also a new public dataset we introduce called COCO +Torso. On the ice hockey dataset, the proposed method boosts matching performance from 57.1% to 81.4%, while also improving the meanAP of player+stick detections from 68.4% to 88.3%. On the COCO +Torso dataset, we see matching improving from 47.9% to 65.2%. The COCO +Torso dataset, code and pre-trained models will be released at https://github.com/foreverYoungGitHub/detect-and-match-related-objects.

updated: Fri Apr 23 2021 14:37:10 GMT+0000 (UTC)

published: Fri Apr 23 2021 14:37:10 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト