VOCALExplore: Pay-as-You-Go Video Data Exploration and Model Building

Maureen Daum; Enhao Zhang; Dong He; Stephen Mussmann; Brandon Haynes; Ranjay Krishna; Magdalena Balazinska

VOCALExplore: 従量課金制のビデオデータ探索とモデル構築

ビデオデータセット上でドメイン固有のモデルを構築するユーザーをサポートするために設計されたシステム、VOCALExplore を紹介します。 VOCALExplore は、インタラクティブなラベル付けセッションをサポートし、ユーザー提供のラベルを使用してモデルをトレーニングします。 VOCALExplore は、収集されたラベルで観察されたスキューに基づいてサンプルを選択する方法を自動的に決定することにより、モデルの品質を最大化します。また、機能の選択を上昇中のバンディット問題としてキャストすることにより、モデルをトレーニングするときに使用する最適なビデオ表現を選択します。最後に、VOCALExplore は最適化を実装して、モデルのパフォーマンスを犠牲にすることなく低レイテンシを実現します。 VOCALExplore が、候補取得関数と特徴抽出器を考慮して、可能な限り最高のモデル品質に近いものを達成することを実証します。これは、目に見えるレイテンシが低く (反復ごとに約 1 秒)、高価な前処理が不要であることを示しています。

We introduce VOCALExplore, a system designed to support users in building domain-specific models over video datasets. VOCALExplore supports interactive labeling sessions and trains models using user-supplied labels. VOCALExplore maximizes model quality by automatically deciding how to select samples based on observed skew in the collected labels. It also selects the optimal video representations to use when training models by casting feature selection as a rising bandit problem. Finally, VOCALExplore implements optimizations to achieve low latency without sacrificing model performance. We demonstrate that VOCALExplore achieves close to the best possible model quality given candidate acquisition functions and feature extractors, and it does so with low visible latency (~1 second per iteration) and no expensive preprocessing.

updated: Tue Mar 07 2023 17:26:04 GMT+0000 (UTC)

published: Tue Mar 07 2023 17:26:04 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト