Memory-aware curriculum federated learning for breast cancer classification

Amelia Jiménez-Sánchez; Mickael Tardy; Miguel A. González Ballester; Diana Mateus; Gemma Piella

乳がん分類のための記憶を意識したカリキュラム連合学習

乳がんの早期発見には、マンモグラフィ画像による定期的なスクリーニングが推奨されます。定期的な検査の結果、ネガティブサンプルが圧倒的に多いデータセットが得られます。このようなクラスの不均衡に対する潜在的な解決策は、複数の機関にまたがって力を合わせることです。協調的なコンピュータ支援診断システムの開発は、さまざまな点で困難です。患者のプライバシーと規制は慎重に尊重する必要があります。機関全体のデータは、異なるデバイスまたはイメージングプロトコルから取得される可能性があり、異種の非IIDデータにつながります。また、学習ベースの方法では、分散データに作用する新しい最適化戦略が必要です。最近、連合学習が共学習の効果的なツールとして登場しました。この設定では、ローカルモデルはプライベートデータに対して計算を実行して、グローバルモデルを更新します。ローカル更新の順序と頻度は、最終的なグローバルモデルに影響を与えます。したがって、サンプルがオプティマイザーにローカルに提示される順序が重要な役割を果たします。この作業では、フェデレーション設定のメモリを意識したカリキュラム学習方法を定義します。私たちのカリキュラムは、グローバルモデルの展開後に忘れられたものに特別な注意を払って、トレーニングサンプルの順序を制御します。私たちのアプローチは、データのプライバシーを保護しながらドメインシフトに対処するために、教師なしドメイン適応と組み合わされています。異なるベンダーからの3つの臨床データセットを使用してメソッドを評価します。私たちの結果は、マルチサイト乳がん分類のための連合敵対的学習の有効性を検証します。さらに、提案された記憶認識カリキュラム方法が分類パフォーマンスをさらに改善するのに有益であることを示します。私たちのコードは、https：//github.com/ameliajimenez/curriculum-federated-learningで公開されています。

For early breast cancer detection, regular screening with mammography imaging is recommended. Routinary examinations result in datasets with a predominant amount of negative samples. A potential solution to such class-imbalance is joining forces across multiple institutions. Developing a collaborative computer-aided diagnosis system is challenging in different ways. Patient privacy and regulations need to be carefully respected. Data across institutions may be acquired from different devices or imaging protocols, leading to heterogeneous non-IID data. Also, for learning-based methods, new optimization strategies working on distributed data are required. Recently, federated learning has emerged as an effective tool for collaborative learning. In this setting, local models perform computation on their private data to update the global model. The order and the frequency of local updates influence the final global model. Hence, the order in which samples are locally presented to the optimizers plays an important role. In this work, we define a memory-aware curriculum learning method for the federated setting. Our curriculum controls the order of the training samples paying special attention to those that are forgotten after the deployment of the global model. Our approach is combined with unsupervised domain adaptation to deal with domain shift while preserving data privacy. We evaluate our method with three clinical datasets from different vendors. Our results verify the effectiveness of federated adversarial learning for the multi-site breast cancer classification. Moreover, we show that our proposed memory-aware curriculum method is beneficial to further improve classification performance. Our code is publicly available at: https://github.com/ameliajimenez/curriculum-federated-learning.

updated: Fri Jan 06 2023 15:04:36 GMT+0000 (UTC)

published: Tue Jul 06 2021 09:50:20 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト