Augmentations: An Insight into their Effectiveness on Convolution Neural Networks

Sabeesh Ethiraj; Bharath Kumar Bolla

拡張：畳み込みニューラルネットワークでの有効性への洞察

拡張は、パフォーマンスを向上させる上で重要なエッジをモデルに提供するため、ニューラルネットワークのパフォーマンスを決定する重要な要素です。モデルの堅牢性を高める能力は、2つの要因、つまり、モデルアーキテクチャ、および拡張のタイプに依存します。拡張はデータセットに非常に固有であり、すべての種類の拡張が必ずしもモデルのパフォーマンスにプラスの効果をもたらすことは必須ではありません。したがって、さまざまなデータセット間で一貫して良好に機能し、アーキテクチャのタイプ、畳み込み、および使用されるパラメータの数に対して不変のままである拡張を識別する必要があります。したがって、さまざまなデータセット間で一貫して良好に機能し、アーキテクチャのタイプ、畳み込み、および使用されるパラメータの数に対して不変のままである拡張を識別する必要があります。このホワイトペーパーでは、MNIST、FMNIST、およびCIFAR10データセットのさまざまな拡張手法に対する、3x3および深度ごとに分離可能な畳み込みを使用したパラメーターの影響を評価します。統計的証拠は、カットアウトやランダム水平フリップなどの手法が、パラメトリックに低いアーキテクチャと高いアーキテクチャの両方で一貫していたことを示しています。深さ方向に分離可能な畳み込みは、より深いネットワークを作成できるため、より高いパラメーターで3x3の畳み込みを上回りました。拡張により、3x3と深さ方向に分離可能な畳み込みの間の精度のギャップが埋められ、モデルの一般化におけるそれらの役割が確立されました。数を増やしても、パフォーマンスに大きな変化はありませんでした。より高いパラメーターでの複数の増強の相乗効果と、より低いパラメーターでの拮抗効果も評価された。この作業は、特定の深層学習タスクでモデルのパフォーマンスを向上させるために、アーキテクチャの優位性と拡張の微妙なバランスを実現する必要があることを証明しています。

Augmentations are the key factor in determining the performance of any neural network as they provide a model with a critical edge in boosting its performance. Their ability to boost a model's robustness depends on two factors, viz-a-viz, the model architecture, and the type of augmentations. Augmentations are very specific to a dataset, and it is not imperative that all kinds of augmentation would necessarily produce a positive effect on a model's performance. Hence there is a need to identify augmentations that perform consistently well across a variety of datasets and also remain invariant to the type of architecture, convolutions, and the number of parameters used. Hence there is a need to identify augmentations that perform consistently well across a variety of datasets and also remain invariant to the type of architecture, convolutions, and the number of parameters used. This paper evaluates the effect of parameters using 3x3 and depth-wise separable convolutions on different augmentation techniques on MNIST, FMNIST, and CIFAR10 datasets. Statistical Evidence shows that techniques such as Cutouts and Random horizontal flip were consistent on both parametrically low and high architectures. Depth-wise separable convolutions outperformed 3x3 convolutions at higher parameters due to their ability to create deeper networks. Augmentations resulted in bridging the accuracy gap between the 3x3 and depth-wise separable convolutions, thus establishing their role in model generalization. At higher number augmentations did not produce a significant change in performance. The synergistic effect of multiple augmentations at higher parameters, with antagonistic effect at lower parameters, was also evaluated. The work proves that a delicate balance between architectural supremacy and augmentations needs to be achieved to enhance a model's performance in any given deep learning task.

updated: Mon May 09 2022 06:36:40 GMT+0000 (UTC)

published: Mon May 09 2022 06:36:40 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト