Can Attention Enable MLPs To Catch Up With CNNs?

Meng-Hao Guo; Zheng-Ning Liu; Tai-Jiang Mu; Dun Liang; Ralph R. Martin; Shi-Min Hu

アテンションは MLP が CNN に追いつくことを可能にしますか?

2021 年 5 月の第 1 週、Google、清華大学、オックスフォード大学、Facebook の 4 つの異なる機関の研究者が、ほぼ同時に arXiv.org で最新の研究 [16、7、12、17] を共有し、それぞれが提案した主に線形層で構成される新しい学習アーキテクチャーは、畳み込みベースのモデルに匹敵するか、さらに優れていると主張しています。これにより、学術界と産業界の両方で、MLPが十分であるかどうかについての議論と議論が即座に起こり、多くの人が学習アーキテクチャがMLPに戻ってきていると考えました。これは本当ですか？この観点から、多層パーセプトロン (MLP)、畳み込みニューラルネットワーク (CNN)、トランスフォーマーなど、学習アーキテクチャの簡単な歴史を紹介します。次に、新しく提案された 4 つのアーキテクチャの共通点を調べます。最後に、将来の研究にインスピレーションを与えることを期待して、新しい学習アーキテクチャの課題と方向性についての見解を示します。

In the first week of May, 2021, researchers from four different institutions: Google, Tsinghua University, Oxford University and Facebook, shared their latest work [16, 7, 12, 17] on arXiv.org almost at the same time, each proposing new learning architectures, consisting mainly of linear layers, claiming them to be comparable, or even superior to convolutional-based models. This sparked immediate discussion and debate in both academic and industrial communities as to whether MLPs are sufficient, many thinking that learning architectures are returning to MLPs. Is this true? In this perspective, we give a brief history of learning architectures, including multilayer perceptrons (MLPs), convolutional neural networks (CNNs) and transformers. We then examine what the four newly proposed architectures have in common. Finally, we give our views on challenges and directions for new learning architectures, hoping to inspire future research.

updated: Mon May 31 2021 16:08:46 GMT+0000 (UTC)

published: Mon May 31 2021 16:08:46 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト