SelectionConv: Convolutional Neural Networks for Non-rectilinear Image Data

David Hart; Michael Whitney; Bryan Morse

SelectionConv：非直線画像データ用の畳み込みニューラルネットワーク

畳み込みニューラルネットワークは、ビジョンアプリケーションに革命をもたらしました。ただし、標準のCNNでは処理できない画像ドメインと表現があります（たとえば、球面画像、スーパーピクセル）。このようなデータは通常、各タイプに特化したネットワークとアルゴリズムを使用して処理されます。この作業では、そのような空間で動作するために特殊なニューラルネットワークを使用する必要が必ずしもない場合があることを示します。代わりに、2D畳み込み重みをコピーできる新しい構造化グラフ畳み込み演算子を導入し、すでにトレーニングされた従来のCNNの機能を新しいグラフネットワークに転送します。このネットワークは、位置グラフとして表すことができる任意のデータを操作できます。非直線データをグラフに変換することにより、大規模なドメイン固有のデータセットのトレーニングを必要とせずに、これらの不規則な画像ドメインにこれらの畳み込みを適用できます。セグメンテーション、スタイリング、および深度予測のために事前にトレーニングされた画像ネットワークを転送した結果は、このようなさまざまなデータ形式で示されます。

Convolutional Neural Networks have revolutionized vision applications. There are image domains and representations, however, that cannot be handled by standard CNNs (e.g., spherical images, superpixels). Such data are usually processed using networks and algorithms specialized for each type. In this work, we show that it may not always be necessary to use specialized neural networks to operate on such spaces. Instead, we introduce a new structured graph convolution operator that can copy 2D convolution weights, transferring the capabilities of already trained traditional CNNs to our new graph network. This network can then operate on any data that can be represented as a positional graph. By converting non-rectilinear data to a graph, we can apply these convolutions on these irregular image domains without requiring training on large domain-specific datasets. Results of transferring pre-trained image networks for segmentation, stylization, and depth prediction are demonstrated for a variety of such data forms.

updated: Mon Jul 18 2022 23:20:50 GMT+0000 (UTC)

published: Mon Jul 18 2022 23:20:50 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト