Blocks World Revisited: The Effect of Self-Occlusion on Classification by Convolutional Neural Networks

Markus D. Solbach; John K. Tsotsos

ブロックの世界の再考：畳み込みニューラルネットワークによる分類に対する自己閉塞の影響

コンピュータビジョンの最近の成功にもかかわらず、探求する新しい道が残っています。この作業では、深層ニューラルネットワークに対する自己閉塞の影響を調査するための新しいデータセットを提案します。 TEOS（The Effect of Self-Occlusion）を使用して、3Dオブジェクトの幾何学的形状とそれらの遍在する自己閉塞の課題に焦点を当てた3Dブロックワールドデータセットを提案します。 TEOSは、オブジェクト分類のコンテキストで自己閉塞の役割を調査するように設計されています。オブジェクトの分類に目覚ましい進歩が見られましたが、自己閉塞は課題です。現実の世界では、3Dオブジェクトの自己閉塞は、ディープラーニングアプローチにとって依然として重要な課題です。しかし、人間は、視点を変えたり、シーンを操作して必要な情報を収集したりするなど、複雑な戦略を展開することでこれに対処します。 TEOSを使用して、それぞれ36個と12個のオブジェクトを含む2つの難易度レベル（L1とL2）のデータセットを提示します。各オブジェクト、それらのマスク、オブジェクトとカメラの位置、向き、自己閉塞の量、および各オブジェクトのCADモデルの738個の均一にサンプリングされたビューを提供します。 5つのよく知られた分類ディープニューラルネットワークを使用したベースライン評価を提示し、TEOSがそれらすべてに重大な課題をもたらすことを示します。データセットと事前トレーニング済みモデルは、https：//nvision2.data.eecs.yorku.ca/TEOSの下で科学コミュニティに公開されています。

Despite the recent successes in computer vision, there remain new avenues to explore. In this work, we propose a new dataset to investigate the effect of self-occlusion on deep neural networks. With TEOS (The Effect of Self-Occlusion), we propose a 3D blocks world dataset that focuses on the geometric shape of 3D objects and their omnipresent challenge of self-occlusion. We designed TEOS to investigate the role of self-occlusion in the context of object classification. Even though remarkable progress has been seen in object classification, self-occlusion is a challenge. In the real-world, self-occlusion of 3D objects still presents significant challenges for deep learning approaches. However, humans deal with this by deploying complex strategies, for instance, by changing the viewpoint or manipulating the scene to gather necessary information. With TEOS, we present a dataset of two difficulty levels (L1 and L2 ), containing 36 and 12 objects, respectively. We provide 738 uniformly sampled views of each object, their mask, object and camera position, orientation, amount of self-occlusion, as well as the CAD model of each object. We present baseline evaluations with five well-known classification deep neural networks and show that TEOS poses a significant challenge for all of them. The dataset, as well as the pre-trained models, are made publicly available for the scientific community under https://nvision2.data.eecs.yorku.ca/TEOS.

updated: Thu Feb 25 2021 15:02:47 GMT+0000 (UTC)

published: Thu Feb 25 2021 15:02:47 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト