CamTuner: Reinforcement-Learning based System for Camera Parameter Tuning to enhance Analytics

Sibendu Paul; Kunal Rao; Giuseppe Coviello; Murugan Sankaradas; Oliver Po; Y. Charlie Hu; Srimat T. Chakradhar

CamTuner：分析を強化するためのカメラパラメータ調整のための強化学習ベースのシステム

ビデオ分析システムは、高品質のビデオフレームをキャプチャするビデオカメラに大きく依存して、高い分析精度を実現しています。最近のビデオカメラは、エンドユーザーが設定できる数十の構成可能なパラメーター設定を公開することがよくありますが、エンドユーザーがこれらのパラメーターを再構成するスキルや理解が不足しているため、今日の監視カメラの展開では、パラメーター設定の固定セットが使用されることがよくあります。この論文では、最初に、典型的な監視カメラの展開において、環境条件の変化が、人物検出、顔検出、顔認識などの分析ユニットの精度に大きく影響する可能性があること、およびカメラ設定を動的に調整することによってそのような悪影響を軽減する方法を示します。。次に、既存のビデオ分析パイプライン（VAP）に簡単に適用できるフレームワークであるCAMTUNERを提案します。これにより、複雑なカメラ設定を変化する環境条件に自動的かつ動的に適応させ、VAP内の分析ユニット（AU）の精度を自律的に最適化できます。。 CAMTUNERはSARSA強化学習（RL）に基づいており、軽量の分析品質推定器と仮想カメラという2つの新しいコンポーネントが組み込まれています。 CAMTUNERは、AXIS監視カメラといくつかのVAP（さまざまなAUを含む）を備えたシステムに実装されており、空港の入り口でキャプチャされた1日中の顧客のビデオを処理します。私たちの評価は、CAMTUNERが変化する環境に迅速に適応できることを示しています。 CAMTUNERを、静的カメラ設定を使用する2つの代替アプローチ、またはカメラ設定を1時間ごとに手動で変更するストローマンアプローチ（人間の品質認識に基づく）と比較しました。顔検出と人物検出のAUの場合、CAMTUNERは、2つのアプローチの最良のものと比較して、それぞれ最大13.8％と9.2％高い精度を達成できることがわかりました（両方のAUで平均8％の改善）。

Video analytics systems critically rely on video cameras, which capture high-quality video frames, to achieve high analytics accuracy. Although modern video cameras often expose tens of configurable parameter settings that can be set by end-users, deployment of surveillance cameras today often uses a fixed set of parameter settings because the end-users lack the skill or understanding to reconfigure these parameters. In this paper, we first show that in a typical surveillance camera deployment, environmental condition changes can significantly affect the accuracy of analytics units such as person detection, face detection and face recognition, and how such adverse impact can be mitigated by dynamically adjusting camera settings. We then propose CAMTUNER, a framework that can be easily applied to an existing video analytics pipeline (VAP) to enable automatic and dynamic adaptation of complex camera settings to changing environmental conditions, and autonomously optimize the accuracy of analytics units (AUs) in the VAP. CAMTUNER is based on SARSA reinforcement learning (RL) and it incorporates two novel components: a light-weight analytics quality estimator and a virtual camera. CAMTUNER is implemented in a system with AXIS surveillance cameras and several VAPs (with various AUs) that processed day-long customer videos captured at airport entrances. Our evaluations show that CAMTUNER can adapt quickly to changing environments. We compared CAMTUNER with two alternative approaches where either static camera settings were used, or a strawman approach where camera settings were manually changed every hour (based on human perception of quality). We observed that for the face detection and person detection AUs, CAMTUNER is able to achieve up to 13.8% and 9.2% higher accuracy, respectively, compared to the best of the two approaches (average improvement of 8% for both AUs).

updated: Fri Dec 24 2021 14:34:51 GMT+0000 (UTC)

published: Thu Jul 08 2021 16:43:02 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト