MUVF-YOLOX: A Multi-modal Ultrasound Video Fusion Network for Renal Tumor Diagnosis

Junyu Li; Han Huang; Dong Ni; Wufeng Xue; Dongmei Zhu; Jun Cheng

MUVF-YOLOX: 腎腫瘍診断のためのマルチモーダル超音波ビデオ融合ネットワーク

腎臓がんを早期に診断すると、患者の生存率が大幅に向上します。造影超音波 (CEUS) は費用対効果が高く、非侵襲的な画像技術であり、腎腫瘍の診断にますます頻繁に使用されるようになりました。しかし、良性腎腫瘍と悪性腎腫瘍の分類は、がんの外観が非常に不均一であることや画像アーチファクトのため、依然として非常に困難な場合があります。私たちの目的は、B モードと CEUS モードの超音波ビデオを統合することで腎腫瘍を検出し、分類することです。この目的を達成するために、腎腫瘍診断のためのマルチモーダル特徴融合とビデオ分類を効果的に実行できる新しいマルチモーダル超音波ビデオ融合ネットワークを提案します。アテンションベースのマルチモーダル融合モジュールは、クロスアテンションとセルフアテンションを使用して、モダリティ不変の特徴とモダリティ固有の特徴を並行して抽出します。さらに、低品質の特徴を自動的にフィルタリングし、複数のフレームからの時間情報を効率的に統合して腫瘍診断の精度を向上できる、オブジェクトレベルの時間集約 (OTA) モジュールを設計します。多施設データセットの実験結果は、提案されたフレームワークが単一モーダルモデルや競合する手法よりも優れていることを示しています。さらに、当社の OTA モジュールは、フレームレベルの予測よりも高い分類精度を実現します。私たちのコードは https://github.com/JeunyuLi/MUAF で入手できます。

Early diagnosis of renal cancer can greatly improve the survival rate of patients. Contrast-enhanced ultrasound (CEUS) is a cost-effective and non-invasive imaging technique and has become more and more frequently used for renal tumor diagnosis. However, the classification of benign and malignant renal tumors can still be very challenging due to the highly heterogeneous appearance of cancer and imaging artifacts. Our aim is to detect and classify renal tumors by integrating B-mode and CEUS-mode ultrasound videos. To this end, we propose a novel multi-modal ultrasound video fusion network that can effectively perform multi-modal feature fusion and video classification for renal tumor diagnosis. The attention-based multi-modal fusion module uses cross-attention and self-attention to extract modality-invariant features and modality-specific features in parallel. In addition, we design an object-level temporal aggregation (OTA) module that can automatically filter low-quality features and efficiently integrate temporal information from multiple frames to improve the accuracy of tumor diagnosis. Experimental results on a multicenter dataset show that the proposed framework outperforms the single-modal models and the competing methods. Furthermore, our OTA module achieves higher classification accuracy than the frame-level predictions. Our code is available at https://github.com/JeunyuLi/MUAF.

updated: Sat Jul 15 2023 14:15:42 GMT+0000 (UTC)

published: Sat Jul 15 2023 14:15:42 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト