Unsupervised Local Discrimination for Medical Images

Huai Chen; Renzhen Wang; Xiuying Wang; Jieyu Li; Qu Fang; Hui Li; Jianhao Bai; Qing Peng; Deyu Meng; Lisheng Wang

医用画像の教師なし局所識別

ラベル付けされていない画像から一般的な表現をキャプチャして医療分析モデルを初期化することを目的とする対照学習は、高価な注釈に対する高い需要を軽減するのに効果的であることが証明されています。現在の方法は、主にインスタンスごとの比較に焦点を当てて、グローバルな識別機能を学習しますが、小さな解剖学的構造、病変、および組織を区別するためにローカルの詳細を先送りします。この課題に対処するために、この論文では、局所弁別 (LD) と呼ばれる一般的な教師なし表現学習フレームワークを提案し、意味的に類似したピクセルを密接に埋め込み、異なる画像間で類似構造の領域を識別することによって、医療画像の局所弁別機能を学習します。具体的には、このモデルには、ピクセル単位の埋め込みのための埋め込みモジュールと、セグメンテーションを生成するためのクラスタリングモジュールが装備されています。そして、これら2つのモジュールは、相互に有益なメカニズムで新しい領域弁別損失関数を最適化することで統合されます。これにより、モデルは構造情報を反映し、ピクセルごとおよび領域ごとの類似性を測定できます。さらに、LD に基づいて、中心に敏感なワンショットランドマークローカリゼーションアルゴリズムと、モデルの一般化可能性を促進する形状ガイド付きクロスモダリティセグメンテーションモデルを提案します。ダウンストリームタスクに転送されると、私たちの方法で学習した表現は、18 の最先端 (SOTA) メソッドからの表現よりも優れた一般化を示し、12 のダウンストリームタスクすべてから 9 つを獲得しました。特に挑戦的な病変セグメンテーションタスクの場合、提案された方法は大幅に優れたパフォーマンスを実現します。ソースコードは、https://github.com/HuaiChen-1994/LDLearning で公開されています。

Contrastive learning, which aims to capture general representation from unlabeled images to initialize the medical analysis models, has been proven effective in alleviating the high demand for expensive annotations. Current methods mainly focus on instance-wise comparisons to learn the global discriminative features, however, pretermitting the local details to distinguish tiny anatomical structures, lesions, and tissues. To address this challenge, in this paper, we propose a general unsupervised representation learning framework, named local discrimination (LD), to learn local discriminative features for medical images by closely embedding semantically similar pixels and identifying regions of similar structures across different images. Specifically, this model is equipped with an embedding module for pixel-wise embedding and a clustering module for generating segmentation. And these two modules are unified through optimizing our novel region discrimination loss function in a mutually beneficial mechanism, which enables our model to reflect structure information as well as measure pixel-wise and region-wise similarity. Furthermore, based on LD, we propose a center-sensitive one-shot landmark localization algorithm and a shape-guided cross-modality segmentation model to foster the generalizability of our model. When transferred to downstream tasks, the learned representation by our method shows a better generalization, outperforming representation from 18 state-of-the-art (SOTA) methods and winning 9 out of all 12 downstream tasks. Especially for the challenging lesion segmentation tasks, the proposed method achieves significantly better performances. The source codes are publicly available at https://github.com/HuaiChen-1994/LDLearning.

updated: Wed Aug 17 2022 07:54:02 GMT+0000 (UTC)

published: Sat Aug 21 2021 04:53:19 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト