Effect of Pre-Training Scale on Intra- and Inter-Domain Full and Few-Shot Transfer Learning for Natural and Medical X-Ray Chest Images

Mehdi Cherti; Jenia Jitsev

自然および医療用X線胸部画像のドメイン内およびドメイン間の完全および少数ショット転送学習に対する事前トレーニングスケールの影響

事前トレーニングでモデル、データ、および計算予算の規模を拡大すると、モデルの一般化が大幅に改善され、言語モデリングと自然画像認識で行われる膨大な作業で学習が転移することが示されています。ただし、大規模なプラスの効果に関するほとんどの研究は、ソースデータとターゲットデータが近接しているドメイン内設定の範囲で行われました。フルショット転送と少数ショット転送を実行するときのドメイン内設定とドメイン外設定の両方でより大きなスケールの影響を調べるために、ここで初めて大規模で公開されている医療用 X 線胸部画像データセットを組み合わせて、次のスケールに到達します。 ImageNet-1k に匹敵する医用画像ドメイン。自然画像ドメインでの事前トレーニングに日常的に使用されます。次に、ネットワークのサイズとソースデータの規模とドメインを変化させながら、監視付きの事前トレーニングを実施し、大規模な自然 (ImageNet-1k/21k) または大規模な医療用胸部 X 線データセットのいずれかであり、事前トレーニング済みのモデルを別の自然または医療に転送します。ターゲット。ドメイン内の自然から自然への移行、および医療から医療への移行の事前トレーニングの規模が大きくなったため、大幅な改善が見られました。ドメイン間の自然医療の移行については、フルショット体制での大きな X 線ターゲットでの事前トレーニングスケールの拡大による改善が見られますが、小さなターゲットや少数ショット体制では、改善は見られません。驚くべきことに、非常に大規模な自然 ImageNet-21k で事前トレーニングされた大規模ネットワークは、大規模な X 線ターゲットへの転送を実行する場合、利用可能な最大の医療用 X 線データで事前トレーニングされたネットワークと同等かそれ以上です。事前トレーニングでモデルと一般的な医療分野に依存しない自然画像ソースデータの規模を大幅に拡大すると、医療分野固有のターゲットへの高品質のドメイン外転送が可能になり、多くの場合、大規模な医療分野固有のソースデータへの依存がなくなります。実務では利用できません。

Increasing model, data and compute budget scale in the pre-training has been shown to strongly improve model generalization and transfer learning in vast line of work done in language modeling and natural image recognition. However, most studies on the positive effect of larger scale were done in scope of in-domain setting, with source and target data being in close proximity. To study effect of larger scale for both in-domain and out-of-domain setting when performing full and few-shot transfer, we combine here for the first time large, openly available medical X-Ray chest imaging datasets to reach a scale for medical imaging domain comparable to ImageNet-1k, routinely used for pre-training in natural image domain. We then conduct supervised pre-training, while varying network size and source data scale and domain, being either large natural (ImageNet-1k/21k) or large medical chest X-Ray datasets, and transfer pre-trained models to different natural or medical targets. We observe strong improvement due to larger pre-training scale for intra-domain natural-natural and medical-medical transfer. For inter-domain natural-medical transfer, we find improvements due to larger pre-training scale on larger X-Ray targets in full shot regime, while for smaller targets and for few-shot regime the improvement is not visible. Remarkably, large networks pre-trained on very large natural ImageNet-21k are as good or better than networks pre-trained on largest available medical X-Ray data when performing transfer to large X-Ray targets. We conclude that substantially increasing model and generic, medical domain-agnostic natural image source data scale in the pre-training can enable high quality out-of-domain transfer to medical domain specific targets, removing dependency on large medical domain-specific source data often not available in the practice.

updated: Mon Dec 19 2022 00:47:51 GMT+0000 (UTC)

published: Mon May 31 2021 21:55:56 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト