An audiovisual and contextual approach for categorical and continuous emotion recognition in-the-wild

Panagiotis Antoniadis; Ioannis Pikoulis; Panagiotis P. Filntisis; Petros Maragos

野生のカテゴリー的で継続的な感情認識のための視聴覚的および文脈的アプローチ

この作品では、第2回ワークショップと野外での感情行動分析に関するコンペティション（ABAW）の敷地内で、ビデオベースの視聴覚感情認識のタスクに取り組んでいます。不十分な照明条件、頭/体の向き、および低い画像解像度は、顔の特徴の抽出と分析のみに依存する方法論の場合、パフォーマンスを潜在的に妨げる可能性のある要因を構成します。この問題を軽減するために、より広範な感情認識フレームワークの一部として、身体的特徴と文脈的特徴を活用します。シーケンス間（seq2seq）学習用に提案されたモデルのバックボーンとして、標準のCNN-RNNカスケードを使用することを選択します。 RGB入力モダリティによる学習とは別に、抽出されたメルスペクトログラムのシーケンスを操作する聴覚ストリームを構築します。挑戦的で新しく組み立てられたAffect-in-the-wild-2（Aff-Wild2）データセットに関する広範な実験により、既存のアプローチに対する方法の優位性が検証され、前述のすべてのモジュールをネットワークアンサンブルに適切に組み込むことで、公式の検証セットで、以前に公開された最高の認識スコアを上回ります。すべてのコードはPyTorchhttps：//pytorch.org/を使用して実装され、公開されていますhttps://github.com/PanosAntoniadis/NTUA-ABAW2021。

In this work we tackle the task of video-based audio-visual emotion recognition, within the premises of the 2nd Workshop and Competition on Affective Behavior Analysis in-the-wild (ABAW). Poor illumination conditions, head/body orientation and low image resolution constitute factors that can potentially hinder performance in case of methodologies that solely rely on the extraction and analysis of facial features. In order to alleviate this problem, we leverage bodily as well as contextual features, as part of a broader emotion recognition framework. We choose to use a standard CNN-RNN cascade as the backbone of our proposed model for sequence-to-sequence (seq2seq) learning. Apart from learning through the RGB input modality, we construct an aural stream which operates on sequences of extracted mel-spectrograms. Our extensive experiments on the challenging and newly assembled Affect-in-the-wild-2 (Aff-Wild2) dataset verify the superiority of our methods over existing approaches, while by properly incorporating all of the aforementioned modules in a network ensemble, we manage to surpass the previous best published recognition scores, in the official validation set. All the code was implemented using PyTorchhttps://pytorch.org/ and is publicly availablehttps://github.com/PanosAntoniadis/NTUA-ABAW2021.

updated: Sat Jul 10 2021 13:07:15 GMT+0000 (UTC)

published: Wed Jul 07 2021 20:13:17 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト