Dilated-UNet: A Fast and Accurate Medical Image Segmentation Approach using a Dilated Transformer and U-Net Architecture

Davoud Saadati; Omid Nejati Manzari; Sattar Mirzakuchaki

Dilated-UNet: Dilated Transformer と U-Net アーキテクチャを使用した高速で正確な医用画像セグメンテーションアプローチ

医用画像のセグメンテーションは、コンピュータ支援の診断および治療システムの開発に不可欠ですが、依然として多くの困難に直面しています。近年、一般的に使用されている CNN に基づくエンコーダー/デコーダーアーキテクチャは、医療画像のセグメンテーションに効果的に適用されていますが、グローバルコンテキストと空間関係の学習に関しては限界があります。一部の研究者は、トランスフォーマーをデコーダーとエンコーダーの両方のコンポーネントに組み込むことを試みており、有望な結果が得られていますが、このアプローチは計算が非常に複雑であるため、さらに改善する必要があります。このホワイトペーパーでは、Dilated Transformer ブロックを U-Net アーキテクチャと組み合わせて、正確かつ高速な医用画像セグメンテーションを実現する Dilated-UNet について説明します。画像パッチはトークンに変換され、U 字型のエンコーダー/デコーダーアーキテクチャに供給されます。ローカルとグローバルのセマンティックな特徴を学習するためのスキップ接続が使用されます。エンコーダーは、Neighborhood Attention と Dilated Neighborhood Attention Transformer を組み合わせた階層型 Dilated Transformer を使用して、ローカルおよびスパースグローバルアテンションを抽出します。私たちの実験結果は、ISIC や Synapse などのいくつかの困難な医療画像セグメンテーションデータセットで、Dilated-UNet が他のモデルよりも優れていることを示しています。

Medical image segmentation is crucial for the development of computer-aided diagnostic and therapeutic systems, but still faces numerous difficulties. In recent years, the commonly used encoder-decoder architecture based on CNNs has been applied effectively in medical image segmentation, but has limitations in terms of learning global context and spatial relationships. Some researchers have attempted to incorporate transformers into both the decoder and encoder components, with promising results, but this approach still requires further improvement due to its high computational complexity. This paper introduces Dilated-UNet, which combines a Dilated Transformer block with the U-Net architecture for accurate and fast medical image segmentation. Image patches are transformed into tokens and fed into the U-shaped encoder-decoder architecture, with skip-connections for local-global semantic feature learning. The encoder uses a hierarchical Dilated Transformer with a combination of Neighborhood Attention and Dilated Neighborhood Attention Transformer to extract local and sparse global attention. The results of our experiments show that Dilated-UNet outperforms other models on several challenging medical image segmentation datasets, such as ISIC and Synapse.

updated: Sat Apr 22 2023 17:20:13 GMT+0000 (UTC)

published: Sat Apr 22 2023 17:20:13 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト