TransNuSeg: A Lightweight Multi-Task Transformer for Nuclei Segmentation

Zhenqi He; Mathias Unberath; Jing Ke; Yiqing Shen

TransNuSeg: 核分割用の軽量マルチタスクトランスフォーマー

核はサイズが小さいように見えますが、実際の臨床現場では、全体的な空間情報と、核と背景の間の色や明るさのコントラストの相関関係が、正確な核セグメンテーションにとって重要な要素と考えられています。しかし、自動原子核セグメンテーションの分野は畳み込みニューラルネットワーク (CNN) によって独占されており、一方、ローカルとグローバルの相関関係を捕捉するのに強力な、最近普及しているトランスフォーマーの可能性は十分に研究されていません。この目的を達成するために、我々は、TransNuSeg と呼ばれる、核セグメンテーションのための純粋な Transformer フレームワークに初めての試みを行います。以前の研究とは異なり、我々は、困難な核セグメンテーションタスクを固有のマルチタスク学習タスクに分離し、核インスタンス、核エッジ、クラスター化エッジのセグメンテーションにそれぞれトライデコーダ構造を採用しています。以前の研究での異なるブランチからの発散予測を排除するために、新しい自己蒸留損失が導入され、ブランチ間の整合性規制が明示的に課されます。さらに、ブランチ間の高い相関を定式化し、パラメータの数を減らすために、トリデコーダ間でセルフアテンションヘッドを部分的に共有することによって効率的なアテンション共有スキームを提案します。最後に、過剰にパラメータ化された Transformer のボトルネックがトークン MLP ボトルネックに置き換わることで、モデルの複雑さがさらに軽減されます。 MoNuSeg を含む、異なるモダリティの 2 つのデータセットでの実験により、私たちの手法が CA2.5-Net などの最先端の対応物よりも 30% 少ないパラメータで 2 ～ 3% の Dice で優れたパフォーマンスを発揮できることが示されました。結論として、TransNuSeg は、核セグメンテーションのコンテキストにおける Transformer の強みを裏付けており、実際の臨床現場での効率的なソリューションとして機能します。コードは https://github.com/zhenqi-he/transnuseg で入手できます。

Nuclei appear small in size, yet, in real clinical practice, the global spatial information and correlation of the color or brightness contrast between nuclei and background, have been considered a crucial component for accurate nuclei segmentation. However, the field of automatic nuclei segmentation is dominated by Convolutional Neural Networks (CNNs), meanwhile, the potential of the recently prevalent Transformers has not been fully explored, which is powerful in capturing local-global correlations. To this end, we make the first attempt at a pure Transformer framework for nuclei segmentation, called TransNuSeg. Different from prior work, we decouple the challenging nuclei segmentation task into an intrinsic multi-task learning task, where a tri-decoder structure is employed for nuclei instance, nuclei edge, and clustered edge segmentation respectively. To eliminate the divergent predictions from different branches in previous work, a novel self distillation loss is introduced to explicitly impose consistency regulation between branches. Moreover, to formulate the high correlation between branches and also reduce the number of parameters, an efficient attention sharing scheme is proposed by partially sharing the self-attention heads amongst the tri-decoders. Finally, a token MLP bottleneck replaces the over-parameterized Transformer bottleneck for a further reduction in model complexity. Experiments on two datasets of different modalities, including MoNuSeg have shown that our methods can outperform state-of-the-art counterparts such as CA2.5-Net by 2-3% Dice with 30% fewer parameters. In conclusion, TransNuSeg confirms the strength of Transformer in the context of nuclei segmentation, which thus can serve as an efficient solution for real clinical practice. Code is available at https://github.com/zhenqi-he/transnuseg.

updated: Sun Jul 16 2023 14:12:54 GMT+0000 (UTC)

published: Sun Jul 16 2023 14:12:54 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト