On Point Affiliation in Feature Upsampling

Wenze Liu; Hao Lu; Yuliang Liu; Zhiguo Cao

機能アップサンプリングにおけるオンポイントアフィリエーション

ポイントの所属の概念を特徴のアップサンプリングに導入します。特徴マップを同一の意味の点によって形成される重複しない意味クラスターに抽象化することにより、特徴のアップサンプリングをポイントの所属、つまりアップサンプリングされた各点の意味クラスターを指定するものと見なすことができます。カーネルベースの動的アップサンプリングのフレームワークでは、アップサンプリングされたポイントが、近隣の低解像度デコーダーポイントと高解像度エンコーダーポイントを利用して、それらの間の相互類似性を条件に、所属を推論できることを示します。したがって、類似性を意識したアップサンプリングカーネルを生成するための一般的な定式化を提示し、そのようなカーネルが意味論的な滑らかさだけでなく境界の鮮明さも促進することを証明します。この定式化は、新規で軽量かつ汎用的なアップサンプリングソリューション、Similarity-Aware Point Affiliation (SAPA) を構成します。ウィンドウ形状のカーネルを使用した予備設計を通じて、その動作メカニズムを示します。物体検出に関する設計の限界を調査した後、動的カーネル形状を備えた SAPA につながるアップサンプリングに関する追加の洞察を明らかにしました。広範な実験により、SAPA が以前のアップサンプラーよりも優れたパフォーマンスを示し、セマンティックセグメンテーション、オブジェクト検出、インスタンスセグメンテーション、パノプティックセグメンテーション、画像マッティング、深度推定などの多数の高密度予測タスクで一貫したパフォーマンスの向上をもたらすことが実証されています。コードはhttps://github.com/tiny-smart/sapaから入手できます。

We introduce the notion of point affiliation into feature upsampling. By abstracting a feature map into non-overlapped semantic clusters formed by points of identical semantic meaning, feature upsampling can be viewed as point affiliation -- designating a semantic cluster for each upsampled point. In the framework of kernel-based dynamic upsampling, we show that an upsampled point can resort to its low-res decoder neighbors and high-res encoder point to reason the affiliation, conditioned on the mutual similarity between them. We therefore present a generic formulation for generating similarity-aware upsampling kernels and prove that such kernels encourage not only semantic smoothness but also boundary sharpness. This formulation constitutes a novel, lightweight, and universal upsampling solution, Similarity-Aware Point Affiliation (SAPA). We show its working mechanism via our preliminary designs with window-shape kernel. After probing the limitations of the designs on object detection, we reveal additional insights for upsampling, leading to SAPA with the dynamic kernel shape. Extensive experiments demonstrate that SAPA outperforms prior upsamplers and invites consistent performance improvements on a number of dense prediction tasks, including semantic segmentation, object detection, instance segmentation, panoptic segmentation, image matting, and depth estimation. Code is made available at: https://github.com/tiny-smart/sapa

updated: Mon Jul 17 2023 01:59:14 GMT+0000 (UTC)

published: Mon Jul 17 2023 01:59:14 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト