A microstructure estimation Transformer inspired by sparse representation for diffusion MRI

Tianshu Zheng; Cong Sun; Weihao Zheng; Wen Shi; Haotian Li; Yi Sun; Yi Zhang; Guangbin Wang; Chuyang Ye; Dan Wu

拡散MRIのスパース表現に触発された微細構造推定トランスフォーマー

拡散磁気共鳴画像法（dMRI）は、複雑で非常に非線形な生物物理学的モデルに基づいて組織の微細構造を特徴付ける重要なツールです。最適化手法を使用して微細構造を解決すると、推定誤差が発生しやすく、q空間での高密度サンプリングが必要になります。これらの制限を克服するために、ディープラーニングベースのアプローチが提案されています。トランスフォーマーの優れたパフォーマンスに動機付けられて、この作業では、トランスフォーマーに基づく学習ベースのフレームワーク、つまり、ダウンサンプリングされたq空間データを使用したdMRIベースの微細構造推定のためのスパースコーディングを備えた微細構造推定トランスフォーマー（METSC）を紹介します。大規模なトレーニングデータ要件の制限に対処しながらTransformerを利用するために、トレーニングプロセスを容易にするために、スパースコーディング手法を使用してTransformerに誘導バイアス（モデルバイアス）を明示的に導入します。したがって、METSCは、埋め込みステージ、スパース表現ステージ、およびマッピングステージの3つのステージで構成されます。埋め込みステージは、信号をエンコードしてボクセルが効果的に表現されるようにするトランスフォーマーベースの構造です。スパース表現の段階では、反復ハードしきい値処理（IHT）プロセスを展開するスパース再構成問題を解決することによって辞書が構築されます。マッピングステージは、本質的には、重みも学習される正規化された辞書係数の重み付き合計に基づいて、第2ステージの出力から微細構造パラメーターを計算するデコーダーです。ボクセル内インコヒーレントモーション（IVIM）モデルと神経突起配向分散および密度イメージング（NODDI）モデルを含む、ダウンサンプリングされたq空間データを使用して2つのdMRIモデルでフレームワークをテストしました。提案された方法は、スキャン時間で最大11.25倍の加速を達成し、他の最先端の学習ベースの方法を上回りました。

Diffusion magnetic resonance imaging (dMRI) is an important tool in characterizing tissue microstructure based on biophysical models, which are complex and highly non-linear. Resolving microstructures with optimization techniques is prone to estimation errors and requires dense sampling in the q-space. Deep learning based approaches have been proposed to overcome these limitations. Motivated by the superior performance of the Transformer, in this work, we present a learning-based framework based on Transformer, namely, a Microstructure Estimation Transformer with Sparse Coding (METSC) for dMRI-based microstructure estimation with downsampled q-space data. To take advantage of the Transformer while addressing its limitation in large training data requirements, we explicitly introduce an inductive bias - model bias into the Transformer using a sparse coding technique to facilitate the training process. Thus, the METSC is composed with three stages, an embedding stage, a sparse representation stage, and a mapping stage. The embedding stage is a Transformer-based structure that encodes the signal to ensure the voxel is represented effectively. In the sparse representation stage, a dictionary is constructed by solving a sparse reconstruction problem that unfolds the Iterative Hard Thresholding (IHT) process. The mapping stage is essentially a decoder that computes the microstructural parameters from the output of the second stage, based on the weighted sum of normalized dictionary coefficients where the weights are also learned. We tested our framework on two dMRI models with downsampled q-space data, including the intravoxel incoherent motion (IVIM) model and the neurite orientation dispersion and density imaging (NODDI) model. The proposed method achieved up to 11.25 folds of acceleration in scan time and outperformed the other state-of-the-art learning-based methods.

updated: Fri May 13 2022 05:14:22 GMT+0000 (UTC)

published: Fri May 13 2022 05:14:22 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト