Overlooked Video Classification in Weakly Supervised Video Anomaly Detection

Weijun Tan; Qi Yao; Jingfeng Liu

弱い教師ありビデオ異常検出における見落とされたビデオ分類

現在の監視が弱いビデオ異常検出アルゴリズムは、ほとんどの場合、複数インスタンス学習 (MIL) またはそのバリエーションを使用しています。最近のほとんどすべてのアプローチは、パフォーマンスを改善するためのトレーニング用の正しいスニペットを選択する方法に焦点を当てています。彼らは、異常検出のパフォーマンスを向上させるビデオ分類の力を見過ごしているか、認識していません。このホワイトペーパーでは、BERT または LSTM を使用したビデオ分類監視の能力を明示的に検討します。この BERT または LSTM を使用すると、ビデオのすべてのスニペットの CNN 機能を単一の機能に集約して、ビデオの分類に使用できます。 MIL フレームワークに組み込まれたこのシンプルかつ強力なビデオ分類監視は、3 つの主要なビデオ異常検出データセットすべてで驚異的なパフォーマンス向上をもたらします。特に、XD-Violence の平均精度 (mAP) が SOTA 78.84% から新しい 82.10% に向上します。ソースコードは https://github.com/wjtan99/BERT_Anomaly_Video_Classification で入手できます。

Current weakly supervised video anomaly detection algorithms mostly use multiple instance learning (MIL) or their varieties. Almost all recent approaches focus on how to select the correct snippets for training to improve the performance. They overlook or do not realize the power of video classification in boosting the performance of anomaly detection. In this paper, we study explicitly the power of video classification supervision using a BERT or LSTM. With this BERT or LSTM, CNN features of all snippets of a video can be aggregated into a single feature which can be used for video classification. This simple yet powerful video classification supervision, combined into the MIL framework, brings extraordinary performance improvement on all three major video anomaly detection datasets. Particularly it improves the mean average precision (mAP) on the XD-Violence from SOTA 78.84% to new 82.10%. The source code is available at https://github.com/wjtan99/BERT_Anomaly_Video_Classification.

updated: Thu Oct 13 2022 03:00:22 GMT+0000 (UTC)

published: Thu Oct 13 2022 03:00:22 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト