Intelligent 3D Network Protocol for Multimedia Data Classification using Deep Learning

Arslan Syed; Eman A. Aldhahri; Muhammad Munawar Iqbal; Abid Ali; Ammar Muthanna; Harun Jamil; Faisal Jamil

ディープラーニングを使用したマルチメディアデータ分類のためのインテリジェント3Dネットワークプロトコル

ビデオでは、人間の行動は3次元（3D）信号です。これらのビデオは、人間の行動に関する時空間的な知識を調査します。有望な能力は、3D畳み込みニューラルネットワーク（CNN）を使用して調査されます。 3D CNNは、静止画で確立された2次元（2D）の同等物に対してまだ高出力を達成していません。ボード3D畳み込みメモリと時空間融合は、3DCNNが顕著な評価を達成するのを妨げるトレーニングの難しさに直面します。このホワイトペーパーでは、STIP機能と3DCNN機能を組み合わせて3Dビデオのパフォーマンスを効果的に強化するハイブリッドディープラーニングアーキテクチャを実装します。実装後、時空融合の各サークルでトレーニングするためのより詳細で詳細なチャート。トレーニングモデルは、モデルの複雑な評価を処理した後、結果をさらに強化します。この実装されたモデルでは、ビデオ分類モデルが使用されます。ディープラーニングを使用したマルチメディアデータ分類のためのインテリジェント3Dネットワークプロトコルが導入され、人間の努力における時空の関連性をさらに理解します。結果の実装では、よく知られているデータセット、つまりUCF101 toが、提案されたハイブリッド手法のパフォーマンスを評価します。結果は、最初の3DCNNを大幅に上回る提案されたハイブリッド手法を上回ります。結果は、95％の精度でUCF101の行動認識のための文献からの最先端のフレームワークと比較されます。

In videos, the human's actions are of three-dimensional (3D) signals. These videos investigate the spatiotemporal knowledge of human behavior. The promising ability is investigated using 3D convolution neural networks (CNNs). The 3D CNNs have not yet achieved high output for their well-established two-dimensional (2D) equivalents in still photographs. Board 3D Convolutional Memory and Spatiotemporal fusion face training difficulty preventing 3D CNN from accomplishing remarkable evaluation. In this paper, we implement Hybrid Deep Learning Architecture that combines STIP and 3D CNN features to enhance the performance of 3D videos effectively. After implementation, the more detailed and deeper charting for training in each circle of space-time fusion. The training model further enhances the results after handling complicated evaluations of models. The video classification model is used in this implemented model. Intelligent 3D Network Protocol for Multimedia Data Classification using Deep Learning is introduced to further understand spacetime association in human endeavors. In the implementation of the result, the well-known dataset, i.e., UCF101 to, evaluates the performance of the proposed hybrid technique. The results beat the proposed hybrid technique that substantially beats the initial 3D CNNs. The results are compared with state-of-the-art frameworks from literature for action recognition on UCF101 with an accuracy of 95%.

updated: Sat Jul 23 2022 12:24:52 GMT+0000 (UTC)

published: Sat Jul 23 2022 12:24:52 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト