Advanced Data Augmentation Approaches: A Comprehensive Survey and Future directions

Teerath Kumar; Alessandra Mileo; Rob Brennan; Malika Bendechache

高度なデータ拡張アプローチ: 包括的な調査と今後の方向性

ディープラーニング (DL) アルゴリズムは、さまざまなコンピュータービジョンタスクで優れたパフォーマンスを示しています。ただし、ラベル付けされたデータが限られていると、ネットワークの過適合の問題が発生し、トレーニングデータと比較して目に見えないデータのネットワークパフォーマンスが低下します。その結果、パフォーマンスの向上が制限されます。この問題に対処するために、ドロップアウト、正規化、高度なデータ拡張など、さまざまな手法が提案されています。その中でも、サンプルの多様性を含めることでデータセットのサイズを拡大することを目的としたデータ拡張は、最近のホットなトピックです。この記事では、高度なデータ拡張技術に焦点を当てます。データ拡張の背景、レビューされたデータ拡張技術の斬新で包括的な分類法、および各技術の長所と短所 (可能な限り) を提供します。また、画像分類、オブジェクト検出、セマンティックセグメンテーションなど、3 つの一般的なコンピュータービジョンタスクに対するデータ拡張効果の包括的な結果も提供します。結果の再現性のために、すべてのデータ拡張技術の利用可能なコードをまとめました。最後に、課題と困難、および研究コミュニティの将来の方向性について説明します。この調査にはいくつかの利点があると考えています。i) 読者はデータ拡張の仕組みを理解し、オーバーフィッティングの問題を修正します。ii) 結果により、研究者が比較のために検索する時間を節約できます。 iii) 上記のデータ拡張技術のコードは、https://github.com/kmr2017/Advanced-Data-augmentation-codes で入手できます。iv) 今後の作業は、研究コミュニティの関心を呼び起こすでしょう。

Deep learning (DL) algorithms have shown significant performance in various computer vision tasks. However, having limited labelled data lead to a network overfitting problem, where network performance is bad on unseen data as compared to training data. Consequently, it limits performance improvement. To cope with this problem, various techniques have been proposed such as dropout, normalization and advanced data augmentation. Among these, data augmentation, which aims to enlarge the dataset size by including sample diversity, has been a hot topic in recent times. In this article, we focus on advanced data augmentation techniques. we provide a background of data augmentation, a novel and comprehensive taxonomy of reviewed data augmentation techniques, and the strengths and weaknesses (wherever possible) of each technique. We also provide comprehensive results of the data augmentation effect on three popular computer vision tasks, such as image classification, object detection and semantic segmentation. For results reproducibility, we compiled available codes of all data augmentation techniques. Finally, we discuss the challenges and difficulties, and possible future direction for the research community. We believe, this survey provides several benefits i) readers will understand the data augmentation working mechanism to fix overfitting problems ii) results will save the searching time of the researcher for comparison purposes. iii) Codes of the mentioned data augmentation techniques are available at https://github.com/kmr2017/Advanced-Data-augmentation-codes iv) Future work will spark interest in research community.

updated: Thu Mar 02 2023 16:49:52 GMT+0000 (UTC)

published: Sat Jan 07 2023 11:37:32 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト