A Unified Framework for Implicit Sinkhorn Differentiation

Marvin Eisenberger; Aysim Toker; Laura Leal-Taixé; Florian Bernard; Daniel Cremers

暗黙のシンクホーン差別化のための統一されたフレームワーク

Sinkhornのオペレーターは、最近、コンピュータービジョンおよび関連分野で人気が急上昇しています。主な理由の1つは、ディープラーニングフレームワークへの統合が容易なことです。それぞれのニューラルネットワークの効率的なトレーニングを可能にするために、陰関数の微分を介してシンクホーン層の分析勾配を取得するアルゴリズムを提案します。以前の作業と比較して、私たちのフレームワークは、Sinkhornオペレーターの最も一般的な定式化に基づいています。これにより、あらゆるタイプの損失関数が可能になり、ターゲット容量とコストマトリックスの両方が共同で区別されます。さらに、近似入力に対して結果のアルゴリズムの誤差範囲を構築します。最後に、多くのアプリケーションで、自動微分をアルゴリズムに置き換えるだけで、取得した勾配の安定性と精度が直接向上することを示します。さらに、特にGPUメモリなどのリソースが不足している場合は、計算効率が高いことを示します。

The Sinkhorn operator has recently experienced a surge of popularity in computer vision and related fields. One major reason is its ease of integration into deep learning frameworks. To allow for an efficient training of respective neural networks, we propose an algorithm that obtains analytical gradients of a Sinkhorn layer via implicit differentiation. In comparison to prior work, our framework is based on the most general formulation of the Sinkhorn operator. It allows for any type of loss function, while both the target capacities and cost matrices are differentiated jointly. We further construct error bounds of the resulting algorithm for approximate inputs. Finally, we demonstrate that for a number of applications, simply replacing automatic differentiation with our algorithm directly improves the stability and accuracy of the obtained gradients. Moreover, we show that it is computationally more efficient, particularly when resources like GPU memory are scarce.

updated: Fri May 13 2022 14:45:31 GMT+0000 (UTC)

published: Fri May 13 2022 14:45:31 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト