Is this Harmful? Learning to Predict Harmfulness Ratings from Video

Johan Edstedt; Johan Karlsson; Francisca Benavente; Anette Novak; Amanda Berg; Michael Felsberg

これは有害ですか？ビデオから有害性評価を予測することを学ぶ

ビデオ内の有害なコンテンツを自動的に識別することは、さまざまなアプリケーションで重要なタスクです。ただし、高品質のラベルを収集することは困難であり、計算要件も厳しいため、このタスクには満足のいく一般的なアプローチがありませんでした。通常、暴力的なコンテンツの特定など、問題の小さなサブセットのみが考慮されます。一般的な問題に取り組む場合、ラベルの欠如と計算の複雑さに対処するために、大まかな近似と簡略化が行われます。この作業では、2つの主要な障害を特定して取り組みます。まず、この分野の専門家によって注釈が付けられた約4000のビデオクリップのデータセットを作成します。次に、ビデオ認識の進歩により、シーンの完全なコンテキストを考慮したデータセットのトレーニングモデルが可能になることを示します。モデリングの選択について詳細な調査を行ったところ、視覚と音声のモダリティを組み合わせることで大きなメリットが得られ、大規模なビデオ認識データセットとクラスバランスサンプリングの事前トレーニングによってパフォーマンスがさらに向上することがわかりました。さらに、データセットの非常にマルチモーダルな性質を明らかにする定性的調査を実行します。私たちのデータセットは公開時に利用可能になります。

Automatically identifying harmful content in video is an important task with a wide range of applications. However, due to the difficulty of collecting high-quality labels as well as demanding computational requirements, the task has not had a satisfying general approach. Typically, only small subsets of the problem are considered, such as identifying violent content. In cases where the general problem is tackled, rough approximations and simplifications are made to deal with the lack of labels and computational complexity. In this work, we identify and tackle the two main obstacles. First, we create a dataset of approximately 4000 video clips, annotated by professionals in the field. Secondly, we demonstrate that advances in video recognition enable training models on our dataset that consider the full context of the scene. We conduct an in-depth study on our modeling choices and find that we greatly benefit from combining the visual and audio modality and that pretraining on large-scale video recognition datasets and class balanced sampling further improves performance. We additionally perform a qualitative study that reveals the heavily multi-modal nature of our dataset. Our dataset will be made available upon publication.

updated: Tue Jun 15 2021 17:57:12 GMT+0000 (UTC)

published: Tue Jun 15 2021 17:57:12 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト