Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Reinforcement

Fartash Faghri; Hadi Pouransari; Sachin Mehta; Mehrdad Farajtabar; Ali Farhadi; Mohammad Rastegari; Oncel Tuzel

データを強化し、影響を倍増: データセット強化によるモデルの精度とロバスト性の向上

強化されたデータセットでトレーニングされたモデルアーキテクチャの精度が、ユーザーの追加のトレーニングコストなしで改善されるように、データセットを一度改善する戦略であるデータセット強化を提案します。データの増強と知識の蒸留に基づくデータセット強化戦略を提案します。当社の一般的な戦略は、CNN ベースおよびトランスフォーマーベースのモデルにわたる広範な分析と、さまざまなデータ拡張を備えた最先端のモデルを使用した大規模な蒸留研究の実行に基づいて設計されています。 ImageNet+ と呼ばれる ImageNet トレーニングデータセットの強化バージョンと、強化されたデータセット CIFAR-100+、Flowers-102+、および Food-101+ を作成します。 ImageNet+ でトレーニングされたモデルは、より正確で、堅牢で、調整されており、下流のタスク (セグメンテーションや検出など) にうまく移行できます。例として、ResNet-50 の精度は、ImageNet 検証セットで 1.7%、ImageNetV2 で 3.5%、ImageNet-R で 10.0% 向上します。 ImageNet 検証セットの予想されるキャリブレーションエラー (ECE) も 9.9% 削減されます。このバックボーンを Mask-RCNN で MS-COCO のオブジェクト検出に使用すると、平均精度が 0.8% 向上します。 MobileNets、ViTs、および Swin-Transformers についても同様の利益が得られます。 MobileNetV3 と Swin-Tiny では、ImageNet-R/A/C で最大 10% の堅牢性の大幅な改善が見られます。 ImageNet+ で事前トレーニングされ、CIFAR-100+、Flowers-102+、Food-101+ で微調整されたモデルは、精度が最大 3.4% 向上します。

We propose Dataset Reinforcement, a strategy to improve a dataset once such that the accuracy of any model architecture trained on the reinforced dataset is improved at no additional training cost for users. We propose a Dataset Reinforcement strategy based on data augmentation and knowledge distillation. Our generic strategy is designed based on extensive analysis across CNN- and transformer-based models and performing large-scale study of distillation with state-of-the-art models with various data augmentations. We create a reinforced version of the ImageNet training dataset, called ImageNet+, as well as reinforced datasets CIFAR-100+, Flowers-102+, and Food-101+. Models trained with ImageNet+ are more accurate, robust, and calibrated, and transfer well to downstream tasks (e.g., segmentation and detection). As an example, the accuracy of ResNet-50 improves by 1.7% on the ImageNet validation set, 3.5% on ImageNetV2, and 10.0% on ImageNet-R. Expected Calibration Error (ECE) on the ImageNet validation set is also reduced by 9.9%. Using this backbone with Mask-RCNN for object detection on MS-COCO, the mean average precision improves by 0.8%. We reach similar gains for MobileNets, ViTs, and Swin-Transformers. For MobileNetV3 and Swin-Tiny we observe significant improvements on ImageNet-R/A/C of up to 10% improved robustness. Models pretrained on ImageNet+ and fine-tuned on CIFAR-100+, Flowers-102+, and Food-101+, reach up to 3.4% improved accuracy.

updated: Wed Mar 15 2023 23:10:17 GMT+0000 (UTC)

published: Wed Mar 15 2023 23:10:17 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト