Label-similarity Curriculum Learning

Urun Dogan; Aniket Anand Deshmukh; Marcin Machura; Christian Igel

ラベル類似性カリキュラム学習

カリキュラム学習は、最適化を望ましい最適化に導くことにより、ニューラルネットワークのトレーニングを改善できます。ラベル表現を変更することによって損失関数を適応させる画像分類のための新しいカリキュラム学習アプローチを提案します。アイデアは、クラス全体の確率分布をターゲットラベルとして使用することです。ここで、クラス確率は、真のクラスとの類似性を反映します。徐々に、このラベル表現は標準のワンホットエンコーディングに移行します。つまり、最初は、小さな間違いは大きな間違いよりも少なく修正され、微妙な違いを教える前に、まず広い概念を説明する教育プロセスに似ています。クラスの類似性は、事前の知識に基づくことができます。ラベルが自然な単語である特殊なケースでは、類似性を自動的に計算する一般的な方法を提案します。自然な単語は、標準的な単語の埋め込みを使用してユークリッド空間に埋め込まれます。各クラスの確率は、クラスのベクトル表現と真のラベル間のコサイン類似度の関数です。提案されたラベル類似性カリキュラム学習（LCL）アプローチは、ImageNet、CIFAR100、およびAWA2を含む5つのデータセットに適用される画像分類タスクのためのいくつかの一般的なディープラーニングアーキテクチャを使用して経験的に評価されました。すべてのシナリオで、LCLは標準トレーニングと比較して、テストデータの分類精度を向上させることができました。

Curriculum learning can improve neural network training by guiding the optimization to desirable optima. We propose a novel curriculum learning approach for image classification that adapts the loss function by changing the label representation. The idea is to use a probability distribution over classes as target label, where the class probabilities reflect the similarity to the true class. Gradually, this label representation is shifted towards the standard one-hot-encoding. That is, in the beginning minor mistakes are corrected less than large mistakes, resembling a teaching process in which broad concepts are explained first before subtle differences are taught. The class similarity can be based on prior knowledge. For the special case of the labels being natural words, we propose a generic way to automatically compute the similarities. The natural words are embedded into Euclidean space using a standard word embedding. The probability of each class is then a function of the cosine similarity between the vector representations of the class and the true label. The proposed label-similarity curriculum learning (LCL) approach was empirically evaluated using several popular deep learning architectures for image classification tasks applied to five datasets including ImageNet, CIFAR100, and AWA2. In all scenarios, LCL was able to improve the classification accuracy on the test data compared to standard training.

updated: Thu Jul 23 2020 00:48:48 GMT+0000 (UTC)

published: Fri Nov 15 2019 23:03:58 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト