Bridging Gap between Image Pixels and Semantics via Supervision: A Survey

Jiali Duan; C. -C. Jay Kuo

監視による画像ピクセルとセマンティクス間のギャップの橋渡し：調査

低レベルの機能と画像のセマンティックな意味の間にセマンティックギャップと呼ばれるギャップが存在するという事実は、何十年も前から知られています。セマンティックギャップの解決は、長年の問題です。セマンティックギャップの問題をレビューし、ギャップを埋める最近の取り組みに関する調査をこの作業で行います。最も重要なことは、セマンティックギャップは主に今日の教師あり学習によって埋められていると主張しています。この点を説明するために、2つのアプリケーションドメインから経験を引き出します。1）オブジェクト検出と2）コンテンツベースの画像検索（CBIR）のメトリック学習。まず、このペーパーでは、監督に関する歴史的な回顧展を提供し、最新のデータ駆動型手法に段階的に移行し、一般的に使用されるデータセットを紹介します。次に、オブジェクト検出とメトリック学習のコンテキストでセマンティックギャップを埋めるためのさまざまな監視方法を要約します。

The fact that there exists a gap between low-level features and semantic meanings of images, called the semantic gap, is known for decades. Resolution of the semantic gap is a long standing problem. The semantic gap problem is reviewed and a survey on recent efforts in bridging the gap is made in this work. Most importantly, we claim that the semantic gap is primarily bridged through supervised learning today. Experiences are drawn from two application domains to illustrate this point: 1) object detection and 2) metric learning for content-based image retrieval (CBIR). To begin with, this paper offers a historical retrospective on supervision, makes a gradual transition to the modern data-driven methodology and introduces commonly used datasets. Then, it summarizes various supervision methods to bridge the semantic gap in the context of object detection and metric learning.

updated: Thu Jul 29 2021 05:55:40 GMT+0000 (UTC)

published: Thu Jul 29 2021 05:55:40 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト