LR-Net: A Block-based Convolutional Neural Network for Low-Resolution Image Classification

Ashkan Ganj; Mohsen Ebadpour; Mahdi Darvish; Hamid Bahador

LR-Net: 低解像度画像分類のためのブロックベースの畳み込みニューラルネットワーク

特徴の学習と抽出における画像分類に関する CNN ベースのアーキテクチャの成功により、最近では人気が高まっていますが、最先端のモデルを適用してノイズの多い低品質の画像を分類すると、画像分類のタスクはより困難になります。モデルがこのタイプの画像から意味のある特徴を抽出することは、解像度が低く、意味のある全体的な特徴がないため、依然として困難です。さらに、高解像度の画像をトレーニングするにはより多くのレイヤーが必要です。つまり、トレーニングにより多くの時間と計算能力が必要になります。私たちの方法は、前述のディープニューラルネットワークで層が深くなるにつれて、勾配が消失するという問題にも対処します。これらすべての問題に対処するために、ぼやけたノイズのある低解像度画像から低レベルの特徴とグローバルな特徴の両方を学習するように設計されたブロックで構成される、新しい画像分類アーキテクチャを開発しました。ブロックの設計は、パフォーマンスを向上させ、パラメーターサイズを縮小するために、Residual Connections および Inception モジュールの影響を強く受けました。また、MNISTファミリーのデータセットを使用して作業を評価します。特に、低品質でノイズの多い画像のために分類が最も難しいOracle-MNISTデータセットに重点を置いています。提示されたアーキテクチャが既存の最先端の畳み込みニューラルネットワークよりも高速で正確であることを示す詳細なテストを実行しました。さらに、私たちのモデルのユニークな特性により、より少ないパラメーターでより良い結果を生み出すことができます。

The success of CNN-based architecture on image classification in learning and extracting features made them so popular these days, but the task of image classification becomes more challenging when we apply state of art models to classify noisy and low-quality images. It is still difficult for models to extract meaningful features from this type of image due to its low-resolution and the lack of meaningful global features. Moreover, high-resolution images need more layers to train which means they take more time and computational power to train. Our method also addresses the problem of vanishing gradients as the layers become deeper in deep neural networks that we mentioned earlier. In order to address all these issues, we developed a novel image classification architecture, composed of blocks that are designed to learn both low level and global features from blurred and noisy low-resolution images. Our design of the blocks was heavily influenced by Residual Connections and Inception modules in order to increase performance and reduce parameter sizes. We also assess our work using the MNIST family datasets, with a particular emphasis on the Oracle-MNIST dataset, which is the most difficult to classify due to its low-quality and noisy images. We have performed in-depth tests that demonstrate the presented architecture is faster and more accurate than existing cutting-edge convolutional neural networks. Furthermore, due to the unique properties of our model, it can produce a better result with fewer parameters.

updated: Sun May 28 2023 20:59:42 GMT+0000 (UTC)

published: Tue Jul 19 2022 20:01:11 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト