Recent Advances and Trends in Multimodal Deep Learning: A Review

Jabeen Summaira; Xi Li; Amin Muhammad Shoib; Songyuan Li; Jabbar Abdul

マルチモーダルディープラーニングの最近の進歩と傾向：レビュー

ディープラーニングは幅広いアプリケーションを実装しており、近年ますます人気が高まっています。マルチモーダルディープラーニングの目標は、さまざまなモダリティを使用して情報を処理およびリンクできるモデルを作成することです。単峰性学習のために作られた大規模な開発にもかかわらず、それはまだ人間の学習のすべての側面をカバーすることはできません。マルチモーダル学習は、さまざまな感覚が情報処理に関与している場合に、よりよく理解して分析するのに役立ちます。このホワイトペーパーでは、画像、ビデオ、テキスト、オーディオ、ボディジェスチャ、顔の表情、生理学的信号など、複数の種類のモダリティに焦点を当てています。過去および現在のベースラインアプローチの詳細な分析と、マルチモーダルディープラーニングアプリケーションの最近の進歩に関する詳細な調査が提供されています。さまざまなマルチモーダル深層学習アプリケーションのきめ細かい分類法が提案され、さまざまなアプリケーションについてより詳細に説明されています。これらのアプリケーションで使用されるアーキテクチャとデータセットについても、それらの評価指標とともに説明します。最後に、主要な問題は、ドメインごとに個別に強調表示され、将来の研究の方向性も示されます。

Deep Learning has implemented a wide range of applications and has become increasingly popular in recent years. The goal of multimodal deep learning is to create models that can process and link information using various modalities. Despite the extensive development made for unimodal learning, it still cannot cover all the aspects of human learning. Multimodal learning helps to understand and analyze better when various senses are engaged in the processing of information. This paper focuses on multiple types of modalities, i.e., image, video, text, audio, body gestures, facial expressions, and physiological signals. Detailed analysis of past and current baseline approaches and an in-depth study of recent advancements in multimodal deep learning applications has been provided. A fine-grained taxonomy of various multimodal deep learning applications is proposed, elaborating on different applications in more depth. Architectures and datasets used in these applications are also discussed, along with their evaluation metrics. Last, main issues are highlighted separately for each domain along with their possible future research directions.

updated: Mon May 24 2021 04:20:45 GMT+0000 (UTC)

published: Mon May 24 2021 04:20:45 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト