Exploiting Local Indexing and Deep Feature Confidence Scores for Fast Image-to-Video Search

Savas Ozkan; Gozde Bozdagi Akar

画像からビデオへの高速検索のためのローカルインデックス作成と深い特徴信頼性スコアの活用

費用効果の高い視覚的表現と高速な例によるクエリ検索は、中程度のハードウェアでのWebスケールの視覚的検索タスクのために維持する必要がある2つの挑戦的な目標です。このホワイトペーパーでは、画像からビデオへの検索シナリオで最先端のパフォーマンスを実現することにより、これらの目標の両方を保証する高速で堅牢な方法を紹介します。したがって、より速く、より良く、適度な検索パフォーマンスを促進することにより、よく知られている索引付けおよび視覚的表現技術に対する重要な機能強化を提示します。また、クエリ時にローカル記述子とグローバル記述子の個々の決定を活用することにより、いくつかの視覚的な課題に対するメソッドの優位性を高めます。たとえば、ローカルコンテンツ記述子は、スケール、方向、アフィン変換などの大きな幾何学的変形を伴うコピー/複製されたシーンを表します。対照的に、グローバルコンテンツ記述子の使用は、ほぼ重複したセマンティック検索に対してより実用的です。実験は、大規模なスタンフォードI2Vデータセットで実施されます。実験結果は、ローカル表現とグローバル表現が一緒に使用されている場合でも、私たちの方法が大規模な視覚的検索シナリオの複雑さとクエリ処理時間の点で有用であることを示しています。提案された方法は優れており、このデータセットの平均平均精度（MAP）スコアに基づいて最先端のパフォーマンスを実現します。最後に、提案手法の検索結果で明らかになった地上注釈を更新した後、追加のMAPスコアを報告し、実際の性能を示しています。

The cost-effective visual representation and fast query-by-example search are two challenging goals that should be maintained for web-scale visual retrieval tasks on moderate hardware. This paper introduces a fast and robust method that ensures both of these goals by obtaining state-of-the-art performance for an image-to-video search scenario. Hence, we present critical enhancements to well-known indexing and visual representation techniques by promoting faster, better and moderate retrieval performance. We also boost the superiority of our method for some visual challenges by exploiting individual decisions of local and global descriptors at query time. For instance, local content descriptors represent copied/duplicated scenes with large geometric deformations such as scale, orientation and affine transformation. In contrast, the use of global content descriptors is more practical for near-duplicate and semantic searches. Experiments are conducted on a large-scale Stanford I2V dataset. The experimental results show that our method is useful in terms of complexity and query processing time for large-scale visual retrieval scenarios, even if local and global representations are used together. The proposed method is superior and achieves state-of-the-art performance based on the mean average precision (MAP) score of this dataset. Lastly, we report additional MAP scores after updating the ground annotations unveiled by retrieval results of the proposed method, and it shows that the actual performance.

updated: Sat Dec 12 2020 14:42:35 GMT+0000 (UTC)

published: Fri Aug 03 2018 07:29:43 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト