Tunable Convolutions with Parametric Multi-Loss Optimization

Matteo Maggioni; Thomas Tanay; Francesca Babiloni; Steven McDonagh; Aleš Leonardis

パラメトリックマルチロス最適化による調整可能な畳み込み

ニューラルネットワークの動作は、トレーニング中に使用される特定の損失とデータによって、取り返しのつかないほど決定されます。ただし、多くの場合、ユーザーの好みやデータの動的特性などの外的要因に基づいて、推論時にモデルを調整することが望ましいです。これは、不適切な画像から画像への変換タスクの知覚と歪みのトレードオフのバランスを取るために特に重要です。この作業では、同数の目的を含むパラメトリックマルチロスを使用して、多数の異なるカーネルを含むパラメトリック調整可能な畳み込み層を最適化することを提案します。私たちの重要な洞察は、共有パラメーターのセットを使用して、目標とカーネルの両方を動的に補間することです。トレーニング中、これらのパラメーターはランダムにサンプリングされ、目的のすべての可能な組み合わせを明示的に最適化し、その結果、それらの効果を対応するカーネルに解きほぐします。推論中、これらのパラメーターはモデルのインタラクティブな入力になるため、モデルの動作に対する信頼性の高い一貫した制御が可能になります。広範な実験結果は、当社の調整可能な畳み込みが既存のニューラルネットワークの従来の畳み込みのドロップイン代替として効果的に機能し、追加の計算コストが実質的になく、幅広いアプリケーションで最先端の制御戦略よりも優れていることを示しています。画像のノイズ除去、ブレ除去、超解像、スタイル転送など。

Behavior of neural networks is irremediably determined by the specific loss and data used during training. However it is often desirable to tune the model at inference time based on external factors such as preferences of the user or dynamic characteristics of the data. This is especially important to balance the perception-distortion trade-off of ill-posed image-to-image translation tasks. In this work, we propose to optimize a parametric tunable convolutional layer, which includes a number of different kernels, using a parametric multi-loss, which includes an equal number of objectives. Our key insight is to use a shared set of parameters to dynamically interpolate both the objectives and the kernels. During training, these parameters are sampled at random to explicitly optimize all possible combinations of objectives and consequently disentangle their effect into the corresponding kernels. During inference, these parameters become interactive inputs of the model hence enabling reliable and consistent control over the model behavior. Extensive experimental results demonstrate that our tunable convolutions effectively work as a drop-in replacement for traditional convolutions in existing neural networks at virtually no extra computational cost, outperforming state-of-the-art control strategies in a wide range of applications; including image denoising, deblurring, super-resolution, and style transfer.

updated: Mon Apr 03 2023 11:36:10 GMT+0000 (UTC)

published: Mon Apr 03 2023 11:36:10 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト