Addressing Negative Transfer in Diffusion Models

Hyojun Go; JinYoung Kim; Yunsung Lee; Seunghyun Lee; Shinhyeok Oh; Hyeongdon Moon; Seungtaek Choi

拡散モデルにおける負の転移への対処

拡散ベースの生成モデルは、さまざまな分野で目覚ましい成功を収めています。これは、さまざまなノイズレベルを同時に包含するノイズ除去タスクでモデルをトレーニングし、マルチタスク学習 (MTL) の形式を表します。ただし、MTL の観点から拡散モデルを分析および改善することは、まだ研究が進んでいません。特に、MTL は、タスク間の競合により特定のタスクのパフォーマンスが低下する、よく知られたネガティブ転送現象を引き起こすことがあります。この論文では、MTL の観点から拡散トレーニングを分析することを目的としており、2 つの重要な観察結果を提示します: (O1) ノイズ除去タスク間のタスクの親和性は、ノイズレベル間のギャップが広がるにつれて減少します。(O2) コンテキスト内でも負の伝達が発生する可能性があります。普及トレーニングの様子。これらの観察に基づいて、私たちの目的は、否定的な転移を軽減することによって普及トレーニングを強化することです。これを達成するために、既存の MTL 手法を活用することを提案しますが、膨大な数のノイズ除去タスクが存在するため、必要なタスクごとの損失や勾配を計算するのに計算コストがかかります。この課題に対処するために、ノイズ除去タスクを小さなタスククラスターにクラスタリングし、それらに MTL メソッドを適用することを提案します。具体的には、(O2) に基づいて、インターバルクラスタリングを使用して、クラスタ内のノイズ除去タスク間の時間的近接性を強制します。区間クラスタリングが動的計画法で解決でき、クラスタリングの目的に信号対雑音比、タイムステップ、およびタスクの親和性を利用できることを示します。これにより、私たちのアプローチは、MTL 法の効率的な計算を可能にすることで、拡散モデルにおける負の伝達の問題に対処します。提案されたクラスタリングとその MTL 手法との統合をさまざまな実験を通じて検証し、拡散モデルのサンプル品質の向上を実証します。

Diffusion-based generative models have achieved remarkable success in various domains. It trains a model on denoising tasks that encompass different noise levels simultaneously, representing a form of multi-task learning (MTL). However, analyzing and improving diffusion models from an MTL perspective remains under-explored. In particular, MTL can sometimes lead to the well-known phenomenon of negative transfer, which results in the performance degradation of certain tasks due to conflicts between tasks. In this paper, we aim to analyze diffusion training from an MTL standpoint, presenting two key observations: (O1) the task affinity between denoising tasks diminishes as the gap between noise levels widens, and (O2) negative transfer can arise even in the context of diffusion training. Building upon these observations, our objective is to enhance diffusion training by mitigating negative transfer. To achieve this, we propose leveraging existing MTL methods, but the presence of a huge number of denoising tasks makes this computationally expensive to calculate the necessary per-task loss or gradient. To address this challenge, we propose clustering the denoising tasks into small task clusters and applying MTL methods to them. Specifically, based on (O2), we employ interval clustering to enforce temporal proximity among denoising tasks within clusters. We show that interval clustering can be solved with dynamic programming and utilize signal-to-noise ratio, timestep, and task affinity for clustering objectives. Through this, our approach addresses the issue of negative transfer in diffusion models by allowing for efficient computation of MTL methods. We validate the proposed clustering and its integration with MTL methods through various experiments, demonstrating improved sample quality of diffusion models.

updated: Thu Jun 01 2023 05:17:07 GMT+0000 (UTC)

published: Thu Jun 01 2023 05:17:07 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト