VidHarm: A Clip Based Dataset for Harmful Content Detection

Johan Edstedt; Amanda Berg; Michael Felsberg; Johan Karlsson; Francisca Benavente; Anette Novak; Gustav Grund Pihlgren

VidHarm: 有害なコンテンツを検出するためのクリップベースのデータセット

ビデオ内の有害なコンテンツを自動的に識別することは、さまざまなアプリケーションで重要なタスクです。ただし、専門的にラベル付けされた利用可能なオープンデータセットが不足しています。この作品では、専門家によって注釈が付けられた映画の予告編からの 3589 のビデオクリップのオープンデータセットである VidHarm が提示されています。データセットの分析が実行され、特にクリップと予告編レベルの注釈の関係が明らかになります。視聴覚モデルはデータセットでトレーニングされ、モデリングの選択に関する詳細な調査が行われます。結果は、ビジュアルとオーディオのモダリティ、大規模なビデオ認識データセットでの事前トレーニング、およびクラスバランスサンプリングを組み合わせることで、パフォーマンスが大幅に向上することを示しています。最後に、訓練されたモデルのバイアスは、識別プロービングを使用して調査されます。 VidHarm は公開されており、詳細については https://vidharm.github.io をご覧ください。

Automatically identifying harmful content in video is an important task with a wide range of applications. However, there is a lack of professionally labeled open datasets available. In this work VidHarm, an open dataset of 3589 video clips from film trailers annotated by professionals, is presented. An analysis of the dataset is performed, revealing among other things the relation between clip and trailer level annotations. Audiovisual models are trained on the dataset and an in-depth study of modeling choices conducted. The results show that performance is greatly improved by combining the visual and audio modality, pre-training on large-scale video recognition datasets, and class balanced sampling. Lastly, biases of the trained models are investigated using discrimination probing. VidHarm is openly available, and further details are available at: https://vidharm.github.io

updated: Fri Sep 02 2022 15:16:09 GMT+0000 (UTC)

published: Tue Jun 15 2021 17:57:12 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト