Large-Scale Video Analytics through Object-Level Consolidation

Daniel Rivas; Francesc Guim; Jordà Polo; David Carrera

オブジェクトレベルの統合による大規模なビデオ分析

インストールされているカメラの数が増えると、これらのカメラによってキャプチャされたすべての画像を処理および分析するために必要な計算リソースも増えます。ビデオ分析により、スマートシティや自動運転などの新しいユースケースが可能になります。同時に、サービスプロバイダーは、需要に対応するために追加のコンピューティングリソースをインストールする必要がありますが、厳格な遅延要件により、コンピューティングはネットワークの終わりに向かってプッシュされ、地理的に分散した異種のコンピューティングロケーションのセットが形成され、共有され、リソースに制約があります。このようなランドスケープ（共有および分散ロケーション）では、利用可能なすべてのロケーション間で作業を最適化および分散できる新しい手法を設計する必要があり、理想的には、インストールされているカメラの数に関して計算要件が劣線形に増加します。本稿では、FoMO（Focus on Moving Objects）を紹介します。この方法は、シーンの画像を前処理し、空の領域を除外し、複数のカメラからの関心領域を、事前にトレーニングされたオブジェクト検出モデルの入力として機能する単一の画像に構成することにより、マルチカメラの展開を効果的に最適化します。結果は、システム全体のパフォーマンスを8倍に向上させると同時に、方法論の副産物として精度を40％向上させることを示しています。これらはすべて、追加のトレーニングや微調整を行わずに、既成の事前トレーニング済みモデルを使用しています。

As the number of installed cameras grows, so do the compute resources required to process and analyze all the images captured by these cameras. Video analytics enables new use cases, such as smart cities or autonomous driving. At the same time, it urges service providers to install additional compute resources to cope with the demand while the strict latency requirements push compute towards the end of the network, forming a geographically distributed and heterogeneous set of compute locations, shared and resource-constrained. Such landscape (shared and distributed locations) forces us to design new techniques that can optimize and distribute work among all available locations and, ideally, make compute requirements grow sublinearly with respect to the number of cameras installed. In this paper, we present FoMO (Focus on Moving Objects). This method effectively optimizes multi-camera deployments by preprocessing images for scenes, filtering the empty regions out, and composing regions of interest from multiple cameras into a single image that serves as input for a pre-trained object detection model. Results show that overall system performance can be increased by 8x while accuracy improves 40% as a by-product of the methodology, all using an off-the-shelf pre-trained model with no additional training or fine-tuning.

updated: Tue Nov 30 2021 14:48:54 GMT+0000 (UTC)

published: Tue Nov 30 2021 14:48:54 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト