Hybrid Facial Expression Recognition (FER2013) Model for Real-Time Emotion Classification and Prediction

Ozioma Collins Oguine; Kaleab Alamayehu Kinfu; Kanyifeechukwu Jane Oguine; Hashim Ibrahim Bisallah; Daniel Ofuani

リアルタイムの感情分類と予測のためのハイブリッド顔表情認識（FER2013）モデル

顔の表情の認識は、人工知能やゲームからヒューマンコンピュータインタラクション（HCI）や心理学に至るまで、ほとんどの分野で重要な研究トピックです。この論文は、深層畳み込みニューラルネットワーク（DCNN）とハールカスケード深層学習アーキテクチャを含む、顔の表情認識のためのハイブリッドモデルを提案します。目的は、リアルタイムおよびデジタルの顔画像を、考慮される7つの顔の感情カテゴリの1つに分類することです。この研究で採用されたDCNNには、より畳み込み層、ReLU活性化関数、およびフィルタリングの深さと顔の特徴抽出を強化するための複数のカーネルがあります。さらに、ハールカスケードモデルは、リアルタイム画像とビデオフレームの顔の特徴を検出するためにも相互に使用されました。 Kaggleリポジトリ（FER-2013）からのグレースケール画像と、グラフィックスプロセッシングユニット（GPU）の計算を活用して、トレーニングと検証のプロセスを促進しました。前処理とデータ拡張の手法を適用して、トレーニングの効率と分類のパフォーマンスを向上させます。実験結果は、最先端の（SoTA）実験および研究と比較して、大幅に改善された分類パフォーマンスを示しています。また、他の従来のモデルと比較して、この論文は、提案されたアーキテクチャが分類性能において最大6％の改善、合計で最大70％の精度、および2098.8秒の短い実行時間で優れていることを検証します。

Facial Expression Recognition is a vital research topic in most fields ranging from artificial intelligence and gaming to Human-Computer Interaction (HCI) and Psychology. This paper proposes a hybrid model for Facial Expression recognition, which comprises a Deep Convolutional Neural Network (DCNN) and Haar Cascade deep learning architectures. The objective is to classify real-time and digital facial images into one of the seven facial emotion categories considered. The DCNN employed in this research has more convolutional layers, ReLU Activation functions, and multiple kernels to enhance filtering depth and facial feature extraction. In addition, a haar cascade model was also mutually used to detect facial features in real-time images and video frames. Grayscale images from the Kaggle repository (FER-2013) and then exploited Graphics Processing Unit (GPU) computation to expedite the training and validation process. Pre-processing and data augmentation techniques are applied to improve training efficiency and classification performance. The experimental results show a significantly improved classification performance compared to state-of-the-art (SoTA) experiments and research. Also, compared to other conventional models, this paper validates that the proposed architecture is superior in classification performance with an improvement of up to 6%, totaling up to 70% accuracy, and with less execution time of 2098.8s.

updated: Sun Jun 19 2022 23:43:41 GMT+0000 (UTC)

published: Sun Jun 19 2022 23:43:41 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト