Learnable Graph Matching: A Practical Paradigm for Data Association

Jiawei He; Zehao Huang; Naiyan Wang; Zhaoxiang Zhang

学習可能なグラフマッチング: データ関連付けの実用的なパラダイム

データの関連付けは、複数のオブジェクトの追跡、画像の照合、点群の登録など、多くのコンピュータービジョンタスクの中核です。既存の方法では、通常、ネットワークフローの最適化、二部マッチング、またはエンドツーエンドの直接学習によって、データ関連付けの問題を解決します。それらの人気にもかかわらず、現在のソリューションにはいくつかの欠陥があります。ビュー内のコンテキスト情報をほとんど無視します。さらに、ディープアソシエーションモデルをエンドツーエンドの方法でトレーニングし、最適化ベースの割り当て方法の利点をほとんど利用しないか、既製のニューラルネットワークのみを使用して特徴を抽出します。この論文では、これらの問題に対処するための一般的な学習可能なグラフマッチング手法を提案します。特に、ビュー内の関係を無向グラフとしてモデル化します。次に、データの関連付けは、グラフ間の一般的なグラフマッチング問題に変わります。さらに、最適化をエンドツーエンドで微分可能にするために、元のグラフマッチング問題を連続二次計画法に緩和し、KKT 条件と陰関数定理を使用してディープグラフニューラルネットワークにトレーニングを組み込みます。 MOTタスクでは、私たちの方法はいくつかのMOTデータセットで最先端のパフォーマンスを達成しています。画像マッチングの場合、私たちの方法は、人気のある屋内データセットである ScanNet で半分のトレーニングデータと反復を使用する最先端の方法よりも優れています。コードは https://github.com/jiaweihe1996/GMTracker で入手できます。

Data association is at the core of many computer vision tasks, e.g., multiple object tracking, image matching, and point cloud registration. Existing methods usually solve the data association problem by network flow optimization, bipartite matching, or end-to-end learning directly. Despite their popularity, we find some defects of the current solutions: they mostly ignore the intra-view context information; besides, they either train deep association models in an end-to-end way and hardly utilize the advantage of optimization-based assignment methods, or only use an off-the-shelf neural network to extract features. In this paper, we propose a general learnable graph matching method to address these issues. Especially, we model the intra-view relationships as an undirected graph. Then data association turns into a general graph matching problem between graphs. Furthermore, to make optimization end-to-end differentiable, we relax the original graph matching problem into continuous quadratic programming and then incorporate training into a deep graph neural network with KKT conditions and implicit function theorem. In MOT task, our method achieves state-of-the-art performance on several MOT datasets. For image matching, our method outperforms state-of-the-art methods with half training data and iterations on a popular indoor dataset, ScanNet. Code will be available at https://github.com/jiaweihe1996/GMTracker.

updated: Mon Mar 27 2023 17:39:00 GMT+0000 (UTC)

published: Mon Mar 27 2023 17:39:00 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト