SAR-U-Net: squeeze-and-excitation block and atrous spatial pyramid pooling based residual U-Net for automatic liver segmentation in Computed Tomography

Jinke Wang; Peiqing Lv; Haiying Wang; Changfa Shi

SAR-U-Net: コンピューター断層撮影における自動肝臓セグメンテーションのための、圧搾および励起ブロックおよびアトラス空間ピラミッドプーリングベースの残差 U-Net

背景と目的: このホワイトペーパーでは、Squeeze-and-Excitation (SE) ブロック、Atrous Spatial Pyramid Pooling (ASPP)、および正確で堅牢な肝臓 CT セグメンテーションのための残差学習の手法を活用する、修正された U-Net ベースのフレームワークが提示されます。提案された方法の有効性は、2 つの公開データセット LiTS17 と SLiver07 でテストされました。方法: SAR-U-Net と呼ばれる新しいネットワークアーキテクチャが設計されました。まず、SE ブロックが導入され、U-Net エンコーダーの各畳み込み後に画像の特徴を適応的に抽出し、無関係な領域を抑制し、特定のセグメンテーションタスクの特徴を強調します。次に、ASPP を使用して遷移層と出力層を置き換え、さまざまな受容野を介してマルチスケール画像情報を取得しました。第三に、劣化の問題を軽減するために、従来の畳み込みブロックが残差ブロックに置き換えられ、ネットワークが大幅に深さを増すことで精度が向上するようになりました。結果: LiTS17 実験では、ダイス、VOE、RVD、ASD、MSD の平均値は、それぞれ 95.71、9.52、-0.84、1.54、29.14 でした。他の密接に関連する 2D ベースのモデルと比較して、提案された方法は最高の精度を達成しました。 SLiver07 の実験では、ダイス、VOE、RVD、ASD、MSD の平均値は、それぞれ 97.31、5.37、-1.08、1.85、27.45 でした。他の密接に関連するモデルと比較して、提案された方法は、RVD を除いて最高のセグメンテーション精度を達成しました。結論: 提案されたモデルは、2D ベースのモデルと比較して精度が大幅に向上し、小さな肝臓領域、不連続な肝臓領域、ぼやけた肝臓境界などの困難な問題を回避する際の堅牢性も十分に実証され、検証されています。

Background and objective: In this paper, a modified U-Net based framework is presented, which leverages techniques from Squeeze-and-Excitation (SE) block, Atrous Spatial Pyramid Pooling (ASPP) and residual learning for accurate and robust liver CT segmentation, and the effectiveness of the proposed method was tested on two public datasets LiTS17 and SLiver07. Methods: A new network architecture called SAR-U-Net was designed. Firstly, the SE block is introduced to adaptively extract image features after each convolution in the U-Net encoder, while suppressing irrelevant regions, and highlighting features of specific segmentation task; Secondly, ASPP was employed to replace the transition layer and the output layer, and acquire multi-scale image information via different receptive fields. Thirdly, to alleviate the degradation problem, the traditional convolution block was replaced with the residual block and thus prompt the network to gain accuracy from considerably increased depth. Results: In the LiTS17 experiment, the mean values of Dice, VOE, RVD, ASD and MSD were 95.71, 9.52, -0.84, 1.54 and 29.14, respectively. Compared with other closely related 2D-based models, the proposed method achieved the highest accuracy. In the experiment of the SLiver07, the mean values of Dice, VOE, RVD, ASD and MSD were 97.31, 5.37, -1.08, 1.85 and 27.45, respectively. Compared with other closely related models, the proposed method achieved the highest segmentation accuracy except for the RVD. Conclusion: The proposed model enables a great improvement on the accuracy compared to 2D-based models, and its robustness in circumvent challenging problems, such as small liver regions, discontinuous liver regions, and fuzzy liver boundaries, is also well demonstrated and validated.

updated: Sat Jul 17 2021 03:40:40 GMT+0000 (UTC)

published: Thu Mar 11 2021 02:32:59 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト