PAL : Pretext-based Active Learning

Shubhang Bhatnagar; Sachin Goyal; Darshan Tank; Amit Sethi

PAL：口実ベースの能動学習

プールベースのアクティブラーニングの目標は、教師あり学習者の精度を最大化するために、プールからラベルなしサンプルの固定サイズのサブセットを慎重に選択して、オラクルにラベルを照会することです。ただし、オラクルが常に正しいラベルを割り当てる必要があるという前述の要件は、ほとんどの状況で不合理です。以前に提案された手法よりも誤ラベル付けに対してロバストなディープニューラルネットワークのアクティブラーニング手法を提案します。以前の手法は、ラベルのないサンプルの新規性を推定するためにタスクネットワーク自体に依存していますが、タスクの学習（一般化）とサンプルの選択（分布外検出）は、相反する目標になる可能性があります。別のネットワークを使用して、ラベルのないサンプルをスコアリングして選択します。スコアリングネットワークは、潜在的にノイズの多いラベルへの依存を減らすために、ラベル付けされたサンプルの分布をモデル化するための自己監視に依存しています。データの不足に対処するために、マルチタスク学習による正則化のためにスコアリングネットワークに別のヘッドを展開し、通常とは異なる自己バランス型ハイブリッドスコアリング関数を使用します。さらに、ラベル付けの前に各クエリをサブクエリに分割して、クエリに多様なサンプルがあることを確認します。オラクルによるサンプルの誤ったラベル付けに対する耐性が高いことに加えて、結果として得られる手法は、ラベルノイズがない場合でも競争力のある精度を生み出します。この手法では、これらのクラスのサンプリングレートを一時的に上げることにより、新しいクラスの導入をオンザフライで適切に処理します。

The goal of pool-based active learning is to judiciously select a fixed-sized subset of unlabeled samples from a pool to query an oracle for their labels, in order to maximize the accuracy of a supervised learner. However, the unsaid requirement that the oracle should always assign correct labels is unreasonable for most situations. We propose an active learning technique for deep neural networks that is more robust to mislabeling than the previously proposed techniques. Previous techniques rely on the task network itself to estimate the novelty of the unlabeled samples, but learning the task (generalization) and selecting samples (out-of-distribution detection) can be conflicting goals. We use a separate network to score the unlabeled samples for selection. The scoring network relies on self-supervision for modeling the distribution of the labeled samples to reduce the dependency on potentially noisy labels. To counter the paucity of data, we also deploy another head on the scoring network for regularization via multi-task learning and use an unusual self-balancing hybrid scoring function. Furthermore, we divide each query into sub-queries before labeling to ensure that the query has diverse samples. In addition to having a higher tolerance to mislabeling of samples by the oracle, the resultant technique also produces competitive accuracy in the absence of label noise. The technique also handles the introduction of new classes on-the-fly well by temporarily increasing the sampling rate of these classes.

updated: Sun Mar 28 2021 21:04:37 GMT+0000 (UTC)

published: Thu Oct 29 2020 21:16:37 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト