DeepDarts: Modeling Keypoints as Objects for Automatic Scorekeeping in Darts using a Single Camera

William McNally; Pascale Walters; Kanav Vats; Alexander Wong; John McPhee

DeepDarts：単一のカメラを使用したダーツでの自動スコアキーピングのオブジェクトとしてのキーポイントのモデリング

スチールチップダーツの自動スコアキーピング用の既存のマルチカメラソリューションは非常に高価であるため、ほとんどのプレーヤーがアクセスできません。よりアクセスしやすい低コストのソリューションを開発することを目的として、キーポイント検出への新しいアプローチを提示し、それを適用して、任意のカメラアングルから撮影した単一の画像からダーツスコアを予測します。この問題には、同じクラスであり、互いに近接して配置されている可能性のある複数のキーポイントの検出が含まれます。ヒートマップを使用してキーポイントを回帰するために広く採用されているフレームワークは、このタスクには適していません。この問題に対処するために、代わりにキーポイントをオブジェクトとしてモデル化することを提案します。このアイデアを中心に深い畳み込みニューラルネットワークを開発し、それを使用して、DeepDartsと呼ばれる自動ダーツスコアリングのパイプライン全体内のダーツの位置とダーツボードのキャリブレーションポイントを予測します。さらに、メソッドの一般化を改善するために、いくつかのタスク固有のデータ拡張戦略を提案します。概念実証として、2つの異なるダーツボードセットアップから発生した16k画像を含む2つのデータセットを手動で収集し、システムを評価するために注釈を付けました。スマートフォンを使用してダーツボードの正面からキャプチャされた15,000枚の画像を含むプライマリデータセットでは、DeepDartsはテスト画像の94.7％で合計スコアを正しく予測しました。限られたトレーニングデータ（830枚の画像）とさまざまなカメラアングルを含む2番目のより挑戦的なデータセットでは、転送学習と広範なデータ拡張を利用して、84.0％のテスト精度を達成します。 DeepDartsは単一の画像のみに依存しているため、エッジデバイスに展開できる可能性があり、スマートフォンを持っている人なら誰でも、スチールチップダーツの自動ダーツスコアリングシステムにアクセスできます。コードとデータセットが利用可能です。

Existing multi-camera solutions for automatic scorekeeping in steel-tip darts are very expensive and thus inaccessible to most players. Motivated to develop a more accessible low-cost solution, we present a new approach to keypoint detection and apply it to predict dart scores from a single image taken from any camera angle. This problem involves detecting multiple keypoints that may be of the same class and positioned in close proximity to one another. The widely adopted framework for regressing keypoints using heatmaps is not well-suited for this task. To address this issue, we instead propose to model keypoints as objects. We develop a deep convolutional neural network around this idea and use it to predict dart locations and dartboard calibration points within an overall pipeline for automatic dart scoring, which we call DeepDarts. Additionally, we propose several task-specific data augmentation strategies to improve the generalization of our method. As a proof of concept, two datasets comprising 16k images originating from two different dartboard setups were manually collected and annotated to evaluate the system. In the primary dataset containing 15k images captured from a face-on view of the dartboard using a smartphone, DeepDarts predicted the total score correctly in 94.7% of the test images. In a second more challenging dataset containing limited training data (830 images) and various camera angles, we utilize transfer learning and extensive data augmentation to achieve a test accuracy of 84.0%. Because DeepDarts relies only on single images, it has the potential to be deployed on edge devices, giving anyone with a smartphone access to an automatic dart scoring system for steel-tip darts. The code and datasets are available.

updated: Thu May 20 2021 16:25:57 GMT+0000 (UTC)

published: Thu May 20 2021 16:25:57 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト