Probing the Effect of Selection Bias on NN Generalization with a Thought Experiment

John K. Tsotsos; Jun Luo

思考実験によるNN一般化に対する選択バイアスの影響の調査

視覚認識と認知の領域で学習したネットワークは、可能な画像の全母集団よりも桁違いに小さいデータセットでトレーニングされているにもかかわらず、新しいデータやこれまでに見られなかったデータに適用できる十分な一般化を示しているため、部分的に印象的です。多くの人が一般化に関する問題をいくつかの観点から検討しましたが、ネットワークが、ある定義ドメイン属性に対応する特定のサンプルを見逃す偏ったデータセットでトレーニングされている場合、そのトレーニングデータセットが抽出された完全なドメインに一般化できるでしょうか？確かに、視覚では、現在のトレーニングセットがすべての視覚情報を完全にキャプチャするわけではなく、これが選択バイアスにつながる可能性があります。ここでは、思考実験の伝統に基づいた新しいアプローチを試みます。この思考実験は、トレーニングデータの特定のギャップと、パフォーマンス要件への影響を完全に特徴付けて確認できるビジュアルオブジェクトの実際のドメインで実行します。私たちの思考実験は、3つの結論を示しています。1つは、一般化の動作は、トレーニング中にドメインの特定の次元がどれだけ十分に表現されるかに依存するということです。第二に、一般化の効用は、許容できるシステムエラーに完全に依存していること。第3に、イメージング平面からのポーズの向きや色など、オブジェクトの特定の視覚的特徴は、トレーニングセットで十分に表現されていない場合、回復できない可能性があります。現代の深層学習ネットワークで現在観察されている一般化は、偶然の調整の結果である可能性があり、その有用性はシステムのパフォーマンス仕様に関して確認する必要があります。結果として生じるバイアスの内訳と相まって、私たちの思考実験プローブのアプローチは、バイアスの影響を理解するのに非常に有益です。

Learned networks in the domain of visual recognition and cognition impress in part because even though they are trained with datasets many orders of magnitude smaller than the full population of possible images, they exhibit sufficient generalization to be applicable to new and previously unseen data. Although many have examined issues regarding generalization from several perspectives, we wondered If a network is trained with a biased dataset that misses particular samples corresponding to some defining domain attribute, can it generalize to the full domain from which that training dataset was extracted? It is certainly true that in vision, no current training set fully captures all visual information and this may lead to Selection Bias. Here, we try a novel approach in the tradition of the Thought Experiment. We run this thought experiment on a real domain of visual objects that we can fully characterize and look at specific gaps in training data and their impact on performance requirements. Our thought experiment points to three conclusions: first, that generalization behavior is dependent on how sufficiently the particular dimensions of the domain are represented during training; second, that the utility of any generalization is completely dependent on the acceptable system error; and third, that specific visual features of objects, such as pose orientations out of the imaging plane or colours, may not be recoverable if not represented sufficiently in a training set. Any currently observed generalization in modern deep learning networks may be more the result of coincidental alignments and whose utility needs to be confirmed with respect to a system's performance specification. Our Thought Experiment Probe approach, coupled with the resulting Bias Breakdown can be very informative towards understanding the impact of biases.

updated: Thu May 20 2021 17:54:48 GMT+0000 (UTC)

published: Thu May 20 2021 17:54:48 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト