On Designing Good Representation Learning Models

Qinglin Li; Bin Li; Jonathan M Garibaldi; Guoping Qiu

優れた表現学習モデルの設計について

表現学習の目標は、意思決定などの機械学習の最終的な目的とは異なります。したがって、表現学習モデルをトレーニングするための明確で直接的な目的を確立することは非常に困難です。優れた表現は根本的な変動要因を解きほぐす必要があると主張されてきましたが、これをトレーニングの目的に変換する方法は不明なままです。この論文は、優れた表現学習モデルを開発するための直接的なトレーニング基準と設計原理を確立する試みを提示します。優れた表現学習モデルは、最大限に表現力がある、つまり、入力構成の最大数を区別できる必要があることを提案します。表現力を正式に定義し、一般的な学習モデルの最大表現力（MEXS）の定理を導入します。モデルの滑らかさなどの一般的な優先順位を組み込むと同時に、表現力を最大化することによってモデルをトレーニングすることを提案します。モデルがMEXSに到達するのを促進すると同時に、モデルの滑らかさを事前に順守する良心の競争力のある学習アルゴリズムを提示します。また、類似したサンプルに一貫性のあるラベルを割り当てるように促すことで、モデルの滑らかさを高めるラベル一貫性トレーニング（LCT）手法を紹介します。私たちの方法が、最先端の表現と同等またはそれ以上の表現を開発できる表現学習モデルを実際に設計できることを示すために、広範な実験結果を提示します。また、私たちの手法が計算効率が高く、さまざまなパラメーター設定に対して堅牢であり、さまざまなデータセットで効果的に機能できることも示しています。 https://github.com/qlilx/odgrlm.gitで入手可能なコード

The goal of representation learning is different from the ultimate objective of machine learning such as decision making, it is therefore very difficult to establish clear and direct objectives for training representation learning models. It has been argued that a good representation should disentangle the underlying variation factors, yet how to translate this into training objectives remains unknown. This paper presents an attempt to establish direct training criterions and design principles for developing good representation learning models. We propose that a good representation learning model should be maximally expressive, i.e., capable of distinguishing the maximum number of input configurations. We formally define expressiveness and introduce the maximum expressiveness (MEXS) theorem of a general learning model. We propose to train a model by maximizing its expressiveness while at the same time incorporating general priors such as model smoothness. We present a conscience competitive learning algorithm which encourages the model to reach its MEXS whilst at the same time adheres to model smoothness prior. We also introduce a label consistent training (LCT) technique to boost model smoothness by encouraging it to assign consistent labels to similar samples. We present extensive experimental results to show that our method can indeed design representation learning models capable of developing representations that are as good as or better than state of the art. We also show that our technique is computationally efficient, robust against different parameter settings and can work effectively on a variety of datasets. Code available at https://github.com/qlilx/odgrlm.git

updated: Fri Aug 06 2021 04:48:59 GMT+0000 (UTC)

published: Tue Jul 13 2021 09:39:43 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト