FoPro-KD: Fourier Prompted Effective Knowledge Distillation for Long-Tailed Medical Image Recognition

Marawan Elbatel; Robert Martí; Xiaomeng Li

FoPro-KD: ロングテール医療画像認識のためのフーリエプロンプトによる効果的な知識の蒸留

転移学習は、医療画像分類、特にロングテールデータセットにとって有望な技術です。ただし、医療画像分野ではデータが不足しているため、公開されている大規模な事前トレーニング済みモデルを微調整する際に過剰なパラメーター化が発生することがよくあります。さらに、これらの大規模なモデルは、計算コストがかかるため、臨床現場での展開には効果的ではありません。これらの課題に対処するために、私たちは FoPro-KD を提案します。これは、公開されている凍結済みの事前トレーニング済みモデルから学習した周波数パターンの力を解き放ち、その伝達性と圧縮を強化する新しいアプローチです。 FoPro-KD は、フーリエプロンプトジェネレーター (FPG)、有効知識蒸留 (EKD)、および敵対的知識蒸留 (AKD) の 3 つのモジュールで構成されます。 FPG モジュールは、自然画像でトレーニングされた、フリーズされた事前トレーニング済みモデルの表現を探索しながら、ターゲットデータセットを条件としたターゲット摂動を生成する方法を学習します。 EKD モジュールは、より小さなターゲットモデルへの蒸留を通じてこれらの一般化可能な表現を活用し、AKD モジュールは蒸留プロセスをさらに強化します。これらのモジュールを通じて、FoPro-KD はロングテール医用画像分類ベンチマークのパフォーマンスの大幅な向上を達成し、事前トレーニングされたモデルから学習した周波数パターンを活用して、実現可能な展開に向けて大規模な事前トレーニングされたモデルの転移学習と圧縮を強化できる可能性を示しています。。

Transfer learning is a promising technique for medical image classification, particularly for long-tailed datasets. However, the scarcity of data in medical imaging domains often leads to overparameterization when fine-tuning large publicly available pre-trained models. Moreover, these large models are ineffective in deployment in clinical settings due to their computational expenses. To address these challenges, we propose FoPro-KD, a novel approach that unleashes the power of frequency patterns learned from frozen publicly available pre-trained models to enhance their transferability and compression. FoPro-KD comprises three modules: Fourier prompt generator (FPG), effective knowledge distillation (EKD), and adversarial knowledge distillation (AKD). The FPG module learns to generate targeted perturbations conditional on a target dataset, exploring the representations of a frozen pre-trained model, trained on natural images. The EKD module exploits these generalizable representations through distillation to a smaller target model, while the AKD module further enhances the distillation process. Through these modules, FoPro-KD achieves significant improvements in performance on long-tailed medical image classification benchmarks, demonstrating the potential of leveraging the learned frequency patterns from pre-trained models to enhance transfer learning and compression of large pre-trained models for feasible deployment.

updated: Sat May 27 2023 09:01:21 GMT+0000 (UTC)

published: Sat May 27 2023 09:01:21 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト