ALADIN: All Layer Adaptive Instance Normalization for Fine-grained Style Similarity

Dan Ruta; Saeid Motiian; Baldo Faieta; Zhe Lin; Hailin Jin; Alex Filipkowski; Andrew Gilbert; John Collomosse

ALADIN：きめ細かいスタイルの類似性のための全層適応インスタンス正規化

ALADIN（All Layer AdaIN）を紹介します。それらの芸術的なスタイルの類似性に基づいて画像を検索するための新しいアーキテクチャ。表現学習は視覚探索にとって重要であり、学習した検索埋め込みの距離は画像の類似性を反映します。スタイルの定義とラベル付けが難しいため、スタイルのきめ細かいバリエーションを区別する埋め込みを学ぶのは困難です。 ALADINは、ウェブから収集されたユーザー生成コンテンツグループの新しい大規模データセットであるBAM-FGを活用して、デジタルアートワークのきめ細かいスタイルの類似性の表現を学習するための弱く監視されたアプローチを採用しています。 ALADINは、粗いラベルの付いたスタイルデータ（BAM）とBAM-FGの両方に対して、スタイルベースの視覚的検索に新しい最先端の精度を設定します。この作業によって、31万のきめ細かいスタイルグループの新しい262万の画像データセットも提供されました。

We present ALADIN (All Layer AdaIN); a novel architecture for searching images based on the similarity of their artistic style. Representation learning is critical to visual search, where distance in the learned search embedding reflects image similarity. Learning an embedding that discriminates fine-grained variations in style is hard, due to the difficulty of defining and labelling style. ALADIN takes a weakly supervised approach to learning a representation for fine-grained style similarity of digital artworks, leveraging BAM-FG, a novel large-scale dataset of user generated content groupings gathered from the web. ALADIN sets a new state of the art accuracy for style-based visual search over both coarse labelled style data (BAM) and BAM-FG; a new 2.62 million image dataset of 310,000 fine-grained style groupings also contributed by this work.

updated: Wed Mar 17 2021 17:03:27 GMT+0000 (UTC)

published: Wed Mar 17 2021 17:03:27 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト