Deep Generative Modeling on Limited Data with Regularization by Nontransferable Pre-trained Models

Yong Zhong; Hongtao Liu; Xiaodong Liu; Fan Bao; Weiran Shen; Chongxuan Li

転送不可能な事前トレーニング済みモデルによる正規化を使用した限られたデータの深い生成モデリング

深層生成モデル (DGM) は、限られたデータで複雑なモデルを学習すると、分散が大きくなり、過適合になりやすいため、データを大量に消費します。バイアスと分散のトレードオフの古典的な視点に触発されて、正規化された深い生成モデル (Reg-DGM) を提案します。これは、転送不可能な事前トレーニング済みモデルを活用して、限られたデータで生成モデリングの分散を減らします。正式には、Reg-DGM は、特定の発散とエネルギー関数の期待値の加重和を最適化します。ここで、発散はデータとモデルの分布の間にあり、エネルギー関数はモデル分布に関する事前トレーニング済みのモデルによって定義されます。重み付けハイパーパラメーターがバイアスと分散をどのようにトレードオフするかを示すために、単純だが代表的なガウスフィッティングケースを分析します。理論的には、ノンパラメトリック設定での Reg-DGM のグローバル最小値の存在と一意性を特徴付け、勾配ベースの方法でトレーニングされたニューラルネットワークとの収束を証明します。経験的に、さまざまな事前トレーニング済みの特徴抽出器とデータ依存のエネルギー関数を使用して、Reg-DGM は限られたデータで強力な DGM の生成パフォーマンスを一貫して改善し、最先端の方法に匹敵する結果を達成します。私たちの実装は、https://github.com/ML-GSAI/Reg-ADA-APA で入手できます。

Deep generative models (DGMs) are data-eager because learning a complex model on limited data suffers from a large variance and easily overfits. Inspired by the classical perspective of the bias-variance tradeoff, we propose regularized deep generative model (Reg-DGM), which leverages a nontransferable pre-trained model to reduce the variance of generative modeling with limited data. Formally, Reg-DGM optimizes a weighted sum of a certain divergence and the expectation of an energy function, where the divergence is between the data and the model distributions, and the energy function is defined by the pre-trained model w.r.t. the model distribution. We analyze a simple yet representative Gaussian-fitting case to demonstrate how the weighting hyperparameter trades off the bias and the variance. Theoretically, we characterize the existence and the uniqueness of the global minimum of Reg-DGM in a non-parametric setting and prove its convergence with neural networks trained by gradient-based methods. Empirically, with various pre-trained feature extractors and a data-dependent energy function, Reg-DGM consistently improves the generation performance of strong DGMs with limited data and achieves competitive results to the state-of-the-art methods. Our implementation is available at https://github.com/ML-GSAI/Reg-ADA-APA.

updated: Mon Apr 10 2023 09:27:28 GMT+0000 (UTC)

published: Tue Aug 30 2022 10:28:50 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト