An audiovisual and contextual approach for categorical and continuous emotion recognition in-the-wild

Panagiotis Antoniadis; Ioannis Pikoulis; Panagiotis P. Filntisis; Petros Maragos

野生のカテゴリー的で継続的な感情認識のための視聴覚的および文脈的アプローチ

この作品では、第2回ワークショップと野外での感情行動分析に関するコンペティション（ABAW2）の敷地内で、ビデオベースの視聴覚感情認識のタスクに取り組んでいます。不十分な照明条件、頭/体の向き、および低い画像解像度は、顔の特徴の抽出と分析のみに依存する方法論の場合、パフォーマンスを潜在的に妨げる可能性のある要因を構成します。この問題を軽減するために、より広範な感情認識フレームワークの一部として、身体的特徴と文脈的特徴の両方を活用します。シーケンス間（seq2seq）学習用に提案されたモデルのバックボーンとして、標準のCNN-RNNカスケードを使用することを選択します。 RGB入力モダリティによる学習とは別に、抽出されたメルスペクトログラムのシーケンスを操作する聴覚ストリームを構築します。挑戦的で新しく組み立てられたAff-Wild2データセットに関する私たちの広範な実験は、野生の感情認識に向けた直感的なマルチストリームおよびマルチモーダルアプローチの有効性を検証します。これまで比較的未踏のままであった感情認識プロセスの側面として、人体とシーンのコンテキストの有益な影響に重点が置かれています。すべてのコードはPyTorchを使用して実装されており、公開されています。

In this work we tackle the task of video-based audio-visual emotion recognition, within the premises of the 2nd Workshop and Competition on Affective Behavior Analysis in-the-wild (ABAW2). Poor illumination conditions, head/body orientation and low image resolution constitute factors that can potentially hinder performance in case of methodologies that solely rely on the extraction and analysis of facial features. In order to alleviate this problem, we leverage both bodily and contextual features, as part of a broader emotion recognition framework. We choose to use a standard CNN-RNN cascade as the backbone of our proposed model for sequence-to-sequence (seq2seq) learning. Apart from learning through the RGB input modality, we construct an aural stream which operates on sequences of extracted mel-spectrograms. Our extensive experiments on the challenging and newly assembled Aff-Wild2 dataset verify the validity of our intuitive multi-stream and multi-modal approach towards emotion recognition in-the-wild. Emphasis is being laid on the the beneficial influence of the human body and scene context, as aspects of the emotion recognition process that have been left relatively unexplored up to this point. All the code was implemented using PyTorch and is publicly available.

updated: Fri Aug 13 2021 16:39:08 GMT+0000 (UTC)

published: Wed Jul 07 2021 20:13:17 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト