Rethinking Recurrent Neural Networks and Other Improvements for Image Classification

Nguyen Huu Phong; Bernardete Ribeiro

リカレントニューラルネットワークの再考と画像分類のための他の改善

数十年にまでさかのぼる機械学習の長い歴史の中で、リカレントニューラルネットワーク（RNN）は、主にシーケンシャルデータと時系列または一般的に1D情報に使用されてきました。 2D画像に関するいくつかのまれな研究でも、ネットワークは画像の認識ではなく、データを順次学習して生成するだけです。この研究では、画像認識のモデルを設計する際の追加レイヤーとしてRNNを統合することを提案します。さらに、複数のモデルから専門家の予測を学習できるエンドツーエンドのアンサンブルマルチモデルを開発しています。さらに、トレーニング戦略とソフトマックスプルーニングを拡張し、全体として、複数のデータセットの上位モデルと同等のパフォーマンスを発揮するように設計を導きます。この記事で提供されているメソッドのソースコードは、https：//github.com/leonlha/e2e-3mおよびhttp://nguyenhuuphong.meで入手できます。

For a long history of Machine Learning which dates back to several decades, Recurrent Neural Networks (RNNs) have been mainly used for sequential data and time series or generally 1D information. Even in some rare researches on 2D images, the networks merely learn and generate data sequentially rather than for recognition of images. In this research, we propose to integrate RNN as an additional layer in designing image recognition's models. Moreover, we develop End-to-End Ensemble Multi-models that are able to learn experts' predictions from several models. Besides, we extend training strategy and softmax pruning which overall leads our designs to perform comparably to top models on several datasets. The source code of the methods provided in this article is available in https://github.com/leonlha/e2e-3m and http://nguyenhuuphong.me.

updated: Wed Mar 03 2021 02:47:41 GMT+0000 (UTC)

published: Thu Jul 30 2020 00:40:50 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト