Video Content Classification using Deep Learning

Pradyumn Patil; Vishwajeet Pawar; Yashraj Pawar; Shruti Pisal

ディープラーニングを使用したビデオコンテンツの分類

ビデオコンテンツの分類は、コンピュータビジョンの重要な研究コンテンツであり、画像やビデオの検索、コンピュータビジョンなど、多くの分野で広く使用されています。このホワイトペーパーでは、畳み込みニューラルネットワーク（CNN）とリカレントニューラルネットワーク（RNN）を組み合わせたモデルを紹介します。このモデルは、ビデオコンテンツのタイプを識別し、それらを次のようなカテゴリに分類できる深層学習ネットワークを開発、トレーニング、最適化します。アニメーション、ゲーム、ナチュラルコンテンツ、フラットコンテンツなど」。モデルのパフォーマンスを向上させるために、キーフレームのみを分類するための新しいキーフレーム抽出方法が含まれているため、パフォーマンスを大幅に犠牲にすることなく、全体的な処理時間を短縮できます。

Video content classification is an important research content in computer vision, which is widely used in many fields, such as image and video retrieval, computer vision. This paper presents a model that is a combination of Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) which develops, trains, and optimizes a deep learning network that can identify the type of video content and classify them into categories such as "Animation, Gaming, natural content, flat content, etc". To enhance the performance of the model novel keyframe extraction method is included to classify only the keyframes, thereby reducing the overall processing time without sacrificing any significant performance.

updated: Sat Nov 27 2021 04:36:17 GMT+0000 (UTC)

published: Sat Nov 27 2021 04:36:17 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト