Understanding and Predicting the Memorability of Outdoor Natural Scenes

Jiaxin Lu; Mai Xu; Ren Yang; Zulin Wang

屋外の自然シーンの記憶性の理解と予測

記憶力は、一見した後に画像がどれだけ簡単に記憶されるかを測定します。これは、雑誌の表紙、観光広報資料などの設計に役立つ可能性があります。最近の作品は、一般的な画像、オブジェクト画像、または顔写真を思い出深いものにする視覚的特徴に光を当てています。ただし、これらの方法では、屋外の自然シーン画像の記憶性を効果的に予測することはできません。以前の作品のこの欠点を克服するために、本書では、「屋外の自然なシーンを記憶に残るものにするもの」と答える試みを提供します。この目的のために、最初に大規模な屋外自然シーン画像の記憶性（LNSIM）データベースを確立します。このデータベースには、グラウンドトゥルースの記憶性スコアとマルチラベルシーンカテゴリアノテーションを含む2,632の屋外自然シーン画像が含まれます。次に、以前の作品と同様に、データベースをマイニングして、低レベル、中レベル、高レベルの手作りの機能が屋外の自然シーンの記憶にどのように影響するかを調査します。特に、シーンカテゴリの高レベルの特徴は屋外の自然なシーンの記憶性とかなり相関しており、ディープニューラルネットワーク（DNN）によって学習された深い特徴も記憶性スコアの予測に有効であることがわかります。さらに、深い機能とカテゴリ機能を組み合わせることで、記憶性予測のパフォーマンスをさらに高めることができます。したがって、エンドツーエンドのDNNベースの屋外自然シーン記憶性（DeepNSM）予測子を提案します。これは、学習したカテゴリ関連の特徴を活用します。次に、実験結果により、最新の方法を超えるDeepNSMモデルの有効性を検証します。最後に、DeepNSMモデルのパフォーマンスが良い理由を理解し、DeepNSMモデルが屋外の自然シーンの記憶力を正確に予測できないかどうかを調べます。

Memorability measures how easily an image is to be memorized after glancing, which may contribute to designing magazine covers, tourism publicity materials, and so forth. Recent works have shed light on the visual features that make generic images, object images or face photographs memorable. However, these methods are not able to effectively predict the memorability of outdoor natural scene images. To overcome this shortcoming of previous works, in this paper, we provide an attempt to answer: "what exactly makes outdoor natural scenes memorable". To this end, we first establish a large-scale outdoor natural scene image memorability (LNSIM) database, containing 2,632 outdoor natural scene images with their ground truth memorability scores and the multi-label scene category annotations. Then, similar to previous works, we mine our database to investigate how low-, middle- and high-level handcrafted features affect the memorability of outdoor natural scenes. In particular, we find that the high-level feature of scene category is rather correlated with outdoor natural scene memorability, and the deep features learnt by deep neural network (DNN) are also effective in predicting the memorability scores. Moreover, combining the deep features with the category feature can further boost the performance of memorability prediction. Therefore, we propose an end-to-end DNN based outdoor natural scene memorability (DeepNSM) predictor, which takes advantage of the learned category-related features. Then, the experimental results validate the effectiveness of our DeepNSM model, exceeding the state-of-the-art methods. Finally, we try to understand the reason of the good performance for our DeepNSM model, and also study the cases that our DeepNSM model succeeds or fails to accurately predict the memorability of outdoor natural scenes.

updated: Sat Feb 29 2020 10:19:37 GMT+0000 (UTC)

published: Tue Oct 09 2018 09:25:07 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト