A robust and interpretable deep learning framework for multi-modal registration via keypoints

Alan Q. Wang; Evan M. Yu; Adrian V. Dalca; Mert R. Sabuncu

キーポイントによるマルチモーダル登録のための堅牢で解釈可能な深層学習フレームワーク

KeyMorph は、対応するキーポイントの自動検出に依存するディープラーニングベースの画像登録フレームワークです。登録のための最先端の深層学習方法は、多くの場合、大きなミスアライメントに対して堅牢ではなく、解釈できず、問題の対称性を取り入れていません。さらに、ほとんどのモデルは、テスト時に 1 つの予測のみを生成します。これらの欠点に対処する私たちの核となる洞察は、画像間の対応するキーポイントを使用して、微分可能な閉じた形式の式を介して最適な変換を取得できるということです。この観察結果を使用して、登録タスク用に調整されたキーポイントのエンドツーエンドの学習を推進しますが、グラウンドトゥルースキーポイントの知識は必要ありません。このフレームワークは、実質的により堅牢な登録につながるだけでなく、キーポイントが画像のどの部分が最終的な位置合わせを促進しているかを明らかにするため、解釈可能性も向上させます。さらに、KeyMorph は、画像の変換の下で等変になるように、および/または入力画像の順序に関して対称になるように設計できます。最後に、さまざまな変換バリアントに対応するテスト時に、複数の変形フィールドを効率的に、閉じた形式で計算する方法を示します。マルチモーダル脳MRIスキャンの3Dアフィンおよびスプラインベースの登録を解決する際に提案されたフレームワークを示します。特に、特に大きな変位のコンテキストで、現在の最先端の方法を超える登録精度を示します。私たちのコードは https://github.com/evanmy/keymorph で入手できます。

We present KeyMorph, a deep learning-based image registration framework that relies on automatically detecting corresponding keypoints. State-of-the-art deep learning methods for registration often are not robust to large misalignments, are not interpretable, and do not incorporate the symmetries of the problem. In addition, most models produce only a single prediction at test-time. Our core insight which addresses these shortcomings is that corresponding keypoints between images can be used to obtain the optimal transformation via a differentiable closed-form expression. We use this observation to drive the end-to-end learning of keypoints tailored for the registration task, and without knowledge of ground-truth keypoints. This framework not only leads to substantially more robust registration but also yields better interpretability, since the keypoints reveal which parts of the image are driving the final alignment. Moreover, KeyMorph can be designed to be equivariant under image translations and/or symmetric with respect to the input image ordering. Finally, we show how multiple deformation fields can be computed efficiently and in closed-form at test time corresponding to different transformation variants. We demonstrate the proposed framework in solving 3D affine and spline-based registration of multi-modal brain MRI scans. In particular, we show registration accuracy that surpasses current state-of-the-art methods, especially in the context of large displacements. Our code is available at https://github.com/evanmy/keymorph.

updated: Wed Apr 19 2023 19:35:25 GMT+0000 (UTC)

published: Wed Apr 19 2023 19:35:25 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト