Beyond Fixed Grid: Learning Geometric Image Representation with a Deformable Grid

Jun Gao; Zian Wang; Jinchen Xuan; Sanja Fidler

固定グリッドを超えて：変形可能なグリッドを使用した幾何学的画像表現の学習

現代のコンピュータービジョンでは、画像は通常、一定のストライドを持つ固定された均一なグリッドとして表され、深い畳み込みニューラルネットワークを介して処理されます。グリッドを変形して高周波画像コンテンツとより適切に位置合わせすることは、より効果的な戦略であると私たちは主張します。 Deformable Grid DefGridを紹介します。これは、2次元の三角グリッドの頂点の位置オフセットを予測し、変形したグリッドのエッジが画像の境界に揃うようにする学習可能なニューラルネットワークモジュールです。 DefGridをさまざまなユースケースで紹介します。つまり、さまざまな処理レベルでモジュールとして挿入します。 DefGridをエンドツーエンドの学習可能な幾何学的ダウンサンプリングレイヤーとして利用し、標準のプーリング手法に取って代わり、画像をディープCNNにフィードするときの機能解像度を削減します。セマンティックセグメンテーションのタスクに均一グリッドでCNNを使用する場合と比較して、同じグリッド解像度で大幅に改善された結果が表示されます。また、オブジェクトマスクアノテーションのタスクの出力レイヤーでDefGridを利用し、予測されたポリゴングリッドのオブジェクト境界について推論すると、既存のピクセル単位および曲線ベースのアプローチよりも正確な結果が得られることを示します。最後に、監視なしの画像分割用のスタンドアロンモジュールとしてDefGridを紹介し、既存のアプローチよりも優れたパフォーマンスを示します。プロジェクトのウェブサイト：http://www.cs.toronto.edu/~jungao/def-grid

In modern computer vision, images are typically represented as a fixed uniform grid with some stride and processed via a deep convolutional neural network. We argue that deforming the grid to better align with the high-frequency image content is a more effective strategy. We introduce Deformable Grid DefGrid, a learnable neural network module that predicts location offsets of vertices of a 2-dimensional triangular grid, such that the edges of the deformed grid align with image boundaries. We showcase our DefGrid in a variety of use cases, i.e., by inserting it as a module at various levels of processing. We utilize DefGrid as an end-to-end learnable geometric downsampling layer that replaces standard pooling methods for reducing feature resolution when feeding images into a deep CNN. We show significantly improved results at the same grid resolution compared to using CNNs on uniform grids for the task of semantic segmentation. We also utilize DefGrid at the output layers for the task of object mask annotation, and show that reasoning about object boundaries on our predicted polygonal grid leads to more accurate results over existing pixel-wise and curve-based approaches. We finally showcase DefGrid as a standalone module for unsupervised image partitioning, showing superior performance over existing approaches. Project website: http://www.cs.toronto.edu/~jungao/def-grid

updated: Mon Apr 03 2023 23:03:54 GMT+0000 (UTC)

published: Fri Aug 21 2020 02:22:06 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト