An Arbitrary Scale Super-Resolution Approach for 3D MR Images via Implicit Neural Representation

Qing Wu; Yuwei Li; Yawen Sun; Yan Zhou; Hongjiang Wei; Jingyi Yu; Yuyao Zhang

陰的神経表現による 3D MR 画像のための任意スケールの超解像アプローチ

高解像度 (HR) の医用画像は、豊富な解剖学的構造の詳細を提供し、早期かつ正確な診断を容易にします。 MRI では、ハードウェア容量、スキャン時間、および患者の協力能力によって制限されるため、等方性 3D HR 画像の取得には、通常、長いスキャン時間が必要となり、空間カバレッジが小さくなり、SNR が低くなります。最近の研究では、深い畳み込みニューラルネットワークを使用して、単一画像超解像 (SISR) アルゴリズムを介して低解像度 (LR) 入力から等方性 HR MR 画像を復元できることが示されました。ただし、ほとんどの既存の SISR メソッドは、LR 画像と HR 画像の間のスケール固有の投影に近づく傾向があるため、これらの方法は固定のアップサンプリングレートしか処理できません。異なるアップサンプリングレートを実現するには、複数の SR ネットワークをそれぞれ構築する必要があり、これには非常に時間がかかり、リソースを大量に消費します。この論文では、3D HR MR画像を回復するための任意スケールの超解像度アプローチであるArSSRを提案します。 ArSSR モデルでは、異なるアップスケーリングレートでの HR 画像の再構成は、観測された LR 画像から連続的な暗黙のボクセル関数を学習することとして定義されます。次に、SR タスクは、一連の対になった HR-LR トレーニング例からディープニューラルネットワークを介して暗黙的なボクセル関数を表すように変換されます。 ArSSR モデルは、エンコーダーネットワークとデコーダーネットワークで構成されます。具体的には、畳み込みエンコーダネットワークは LR 入力画像から特徴マップを抽出し、全結合デコーダネットワークは暗黙的なボクセル関数を近似します。学習された関数の連続性により、単一の ArSSR モデルは、トレーニング後に任意の入力 LR 画像から HR 画像の任意のアップサンプリングレート再構成を実現できます。 3 つのデータセットでの実験結果は、ArSSR モデルが 3D HR MR 画像再構成の最先端の SR パフォーマンスを達成できることを示していますが、単一のトレーニング済みモデルを使用して任意のアップサンプリングスケールを達成しています。

High Resolution (HR) medical images provide rich anatomical structure details to facilitate early and accurate diagnosis. In MRI, restricted by hardware capacity, scan time, and patient cooperation ability, isotropic 3D HR image acquisition typically requests long scan time and, results in small spatial coverage and low SNR. Recent studies showed that, with deep convolutional neural networks, isotropic HR MR images could be recovered from low-resolution (LR) input via single image super-resolution (SISR) algorithms. However, most existing SISR methods tend to approach a scale-specific projection between LR and HR images, thus these methods can only deal with a fixed up-sampling rate. For achieving different up-sampling rates, multiple SR networks have to be built up respectively, which is very time-consuming and resource-intensive. In this paper, we propose ArSSR, an Arbitrary Scale Super-Resolution approach for recovering 3D HR MR images. In the ArSSR model, the reconstruction of HR images with different up-scaling rates is defined as learning a continuous implicit voxel function from the observed LR images. Then the SR task is converted to represent the implicit voxel function via deep neural networks from a set of paired HR-LR training examples. The ArSSR model consists of an encoder network and a decoder network. Specifically, the convolutional encoder network is to extract feature maps from the LR input images and the fully-connected decoder network is to approximate the implicit voxel function. Due to the continuity of the learned function, a single ArSSR model can achieve arbitrary up-sampling rate reconstruction of HR images from any input LR image after training. Experimental results on three datasets show that the ArSSR model can achieve state-of-the-art SR performance for 3D HR MR image reconstruction while using a single trained model to achieve arbitrary up-sampling scales.

updated: Wed Nov 30 2022 03:52:44 GMT+0000 (UTC)

published: Wed Oct 27 2021 14:48:54 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト