DISCO: accurate Discrete Scale Convolutions

Ivan Sosnovik; Artem Moskalev; Arnold Smeulders

DISCO: 正確な離散スケール畳み込み

スケールは、多くの視覚タスクにおいて、与えられた邪魔な要素と見なされることがよくあります。その際、学習中により多くのデータが必要になる理由の 1 つです。最近の作業では、畳み込みニューラルネットワークにスケールの等価性が追加されました。さまざまな作業に効果があることがわかりました。私たちは、精度の高いスケール等価畳み込みニューラルネットワーク (SE-CNN) を目指しており、スケールの粒度が高く、フィルターサイズが小さい必要がある問題に適用できます。現在の SE-CNN は、重みの共有とフィルターの再スケーリングに依存しており、後者は整数スケールに対してのみ正確です。正確なスケールの等分散に到達するために、スケールコンボリューションが離散再スケーリングと同等のままになる一般的な制約を導き出します。それが存在するすべての場合の正確なソリューションを見つけ、残りの近似を計算します。 MNIST スケールでの新しい最先端の分類と STL-10 での結果の改善に示されているように、離散スケールコンボリューションは報われます。同じ SE スキームを使用して、OTB-13 上のスケール同変のシャムトラッカーの計算作業も改善します。

Scale is often seen as a given, disturbing factor in many vision tasks. When doing so it is one of the factors why we need more data during learning. In recent work scale equivariance was added to convolutional neural networks. It was shown to be effective for a range of tasks. We aim for accurate scale-equivariant convolutional neural networks (SE-CNNs) applicable for problems where high granularity of scale and small filter sizes are required. Current SE-CNNs rely on weight sharing and filter rescaling, the latter of which is accurate for integer scales only. To reach accurate scale equivariance, we derive general constraints under which scale-convolution remains equivariant to discrete rescaling. We find the exact solution for all cases where it exists, and compute the approximation for the rest. The discrete scale-convolution pays off, as demonstrated in a new state-of-the-art classification on MNIST-scale and improving the results on STL-10. With the same SE scheme, we also improve the computational effort of a scale-equivariant Siamese tracker on OTB-13.

updated: Fri Jun 04 2021 21:48:09 GMT+0000 (UTC)

published: Fri Jun 04 2021 21:48:09 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト