Efficient Representation Learning for Healthcare with Cross-Architectural Self-Supervision

Pranav Singh; Jacopo Cirrone

クロスアーキテクチャーの自己監視による医療向けの効率的な表現学習

ヘルスケアおよび生物医学アプリケーションでは、極端な計算要件が表現学習の導入に大きな障壁となります。表現学習は、限られた医療データから有用な事前分布を学習することで、深層学習アーキテクチャのパフォーマンスを向上させることができます。ただし、最先端の自己教師あり手法では、臨床現場でより実用的な、より小さいバッチサイズまたはより短い事前トレーニングエポックを使用すると、パフォーマンスが低下します。私たちは、この課題に応えるための Cross Architectural - Self Supervision (CASS) を提案します。この新しいシャム自己教師あり学習アプローチは、効率的な学習のためにトランスフォーマーと畳み込みニューラルネットワーク (CNN) を相乗的に活用します。私たちの経験的評価は、CASS でトレーニングされた CNN とトランスフォーマーが、4 つの多様な医療データセットにわたって既存の自己教師あり学習方法よりも優れたパフォーマンスを発揮することを示しています。 CASS は、微調整用のラベル付きデータがわずか 1% であるだけで、平均 3.8% の改善を達成します。ラベル付きデータが 10% の場合、5.9% 増加します。 100% ラベル付けされたデータでは、10.13% という驚くべき向上に達します。特に、CASS は最先端の方法と比較して事前トレーニング時間を 69% 短縮し、臨床での実装に適しています。また、CASS はバッチサイズや事前トレーニングエポックの変動に対してかなり堅牢であるため、医療アプリケーションにおける機械学習の候補として適していることも実証しました。

In healthcare and biomedical applications, extreme computational requirements pose a significant barrier to adopting representation learning. Representation learning can enhance the performance of deep learning architectures by learning useful priors from limited medical data. However, state-of-the-art self-supervised techniques suffer from reduced performance when using smaller batch sizes or shorter pretraining epochs, which are more practical in clinical settings. We present Cross Architectural - Self Supervision (CASS) in response to this challenge. This novel siamese self-supervised learning approach synergistically leverages Transformer and Convolutional Neural Networks (CNN) for efficient learning. Our empirical evaluation demonstrates that CASS-trained CNNs and Transformers outperform existing self-supervised learning methods across four diverse healthcare datasets. With only 1% labeled data for finetuning, CASS achieves a 3.8% average improvement; with 10% labeled data, it gains 5.9%; and with 100% labeled data, it reaches a remarkable 10.13% enhancement. Notably, CASS reduces pretraining time by 69% compared to state-of-the-art methods, making it more amenable to clinical implementation. We also demonstrate that CASS is considerably more robust to variations in batch size and pretraining epochs, making it a suitable candidate for machine learning in healthcare applications.

updated: Sat Aug 19 2023 15:57:19 GMT+0000 (UTC)

published: Sat Aug 19 2023 15:57:19 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト