Adaptive Streaming Perception using Deep Reinforcement Learning

Anurag Ghosh; Akshay Nambi; Aditya Singh; Harish YVS; Tanuja Ganu

深層強化学習を使用した適応ストリーミング知覚

ストリーミングビジュアルデータまたはストリーミング知覚でコンピュータービジョンモデルを実行することは、自動運転、具体化されたエージェント、および拡張/仮想現実のアプリケーションで新たに発生している問題です。このようなシステムの開発は、処理パイプラインの精度と待ち時間によって大きく左右されます。過去の研究では多数の近似実行フレームワークが提案されていますが、それらの決定機能は、待ち時間、精度、エネルギーなどの最適化にのみ焦点を当てています。これにより、決定が最適化されず、システム全体のパフォーマンスに影響します。ストリーミング知覚システムは、システム全体のパフォーマンスを全体的に最大化する必要があると主張します（つまり、精度と遅延の両方を同時に考慮する必要があります）。この目的のために、ストリーミング知覚の実行時にこれらのトレードオフを学習するための深層強化学習に基づく新しいアプローチについて説明します。このトレードオフの最適化は、新しいディープコンテキストバンディット問題として定式化され、レイテンシと精度を単一のメトリックに全体的に統合する新しい報酬関数を設計します。私たちのエージェントは、複数の意思決定の側面にわたって競争力のあるポリシーを学習できることを示しています。これは、公開データセットの最先端のポリシーよりも優れています。

Executing computer vision models on streaming visual data, or streaming perception is an emerging problem, with applications in self-driving, embodied agents, and augmented/virtual reality. The development of such systems is largely governed by the accuracy and latency of the processing pipeline. While past work has proposed numerous approximate execution frameworks, their decision functions solely focus on optimizing latency, accuracy, or energy, etc. This results in sub-optimum decisions, affecting the overall system performance. We argue that the streaming perception systems should holistically maximize the overall system performance (i.e., considering both accuracy and latency simultaneously). To this end, we describe a new approach based on deep reinforcement learning to learn these tradeoffs at runtime for streaming perception. This tradeoff optimization is formulated as a novel deep contextual bandit problem and we design a new reward function that holistically integrates latency and accuracy into a single metric. We show that our agent can learn a competitive policy across multiple decision dimensions, which outperforms state-of-the-art policies on public datasets.

updated: Thu Jun 10 2021 11:28:10 GMT+0000 (UTC)

published: Thu Jun 10 2021 11:28:10 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト