ST-CoNAL: Consistency-Based Acquisition Criterion Using Temporal Self-Ensemble for Active Learning

Jae Soon Baik; In Young Yoon; Jun Won Choi

ST-CoNAL：アクティブラーニングのための時間的自己アンサンブルを使用した一貫性ベースの取得基準

現代の深層学習は、さまざまな分野で大きな成功を収めています。ただし、膨大な量のデータにラベルを付ける必要があり、費用と労力がかかります。ラベル付けする最も有益なサンプルを識別する能動学習（AL）は、トレーニングプロセスの効率を最大化するためにますます重要になっています。既存のALメソッドは、ほとんどの場合、ラベル付けするサンプルを取得するために単一の最終固定モデルのみを使用します。この戦略は、特定のトレーニングデータのモデルの構造的な不確実性がサンプルを取得するために考慮されていないという点で十分ではない可能性があります。本研究では、従来の確率的勾配降下法（SGD）最適化によって生成された時間的自己集団に基づく新しい取得基準を提案します。これらの自己アンサンブルモデルは、SGDの反復によって取得された中間ネットワークの重みをキャプチャすることによって取得されます。私たちの取得機能は、学生モデルと教師モデルの間の一貫性の尺度に依存しています。学生モデルには一定数の時間的自己アンサンブルモデルが与えられ、教師モデルは学生モデルの重みを平均することによって構築されます。提案された取得基準を使用して、ALアルゴリズム、つまり学生と教師の一貫性に基づくAL（ST-CoNAL）を提示します。 CIFAR-10、CIFAR-100、Caltech-256、およびTiny ImageNetデータセットで画像分類タスクに対して実施された実験は、提案されたST-CoNALが既存の取得方法よりも大幅に優れたパフォーマンスを達成することを示しています。さらに、広範な実験により、私たちの方法の堅牢性と有効性が示されています。

Modern deep learning has achieved great success in various fields. However, it requires the labeling of huge amounts of data, which is expensive and labor-intensive. Active learning (AL), which identifies the most informative samples to be labeled, is becoming increasingly important to maximize the efficiency of the training process. The existing AL methods mostly use only a single final fixed model for acquiring the samples to be labeled. This strategy may not be good enough in that the structural uncertainty of a model for given training data is not considered to acquire the samples. In this study, we propose a novel acquisition criterion based on temporal self-ensemble generated by conventional stochastic gradient descent (SGD) optimization. These self-ensemble models are obtained by capturing the intermediate network weights obtained through SGD iterations. Our acquisition function relies on a consistency measure between the student and teacher models. The student models are given a fixed number of temporal self-ensemble models, and the teacher model is constructed by averaging the weights of the student models. Using the proposed acquisition criterion, we present an AL algorithm, namely student-teacher consistency-based AL (ST-CoNAL). Experiments conducted for image classification tasks on CIFAR-10, CIFAR-100, Caltech-256, and Tiny ImageNet datasets demonstrate that the proposed ST-CoNAL achieves significantly better performance than the existing acquisition methods. Furthermore, extensive experiments show the robustness and effectiveness of our methods.

updated: Tue Jul 05 2022 17:25:59 GMT+0000 (UTC)

published: Tue Jul 05 2022 17:25:59 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト