Pixel VQ-VAEs for Improved Pixel Art Representation

Akash Saravanan; Matthew Guzdial

ピクセルアート表現を改善するためのピクセルVQ-VAE

機械学習は、画像処理で大きな成功を収めています。ただし、この作業の焦点は主にリアルな画像にあり、ピクセルアートなどのよりニッチなアートスタイルは無視されています。さらに、ピクセルのグループに焦点を当てた多くの従来の機械学習モデルは、個々のピクセルが重要であるピクセルアートではうまく機能しません。ピクセルアートの表現を学習する特殊なVQ-VAEモデルであるPixelVQ-VAEを提案します。埋め込みの品質とダウンストリームタスクのパフォーマンスの両方で、他のモデルよりも優れていることを示します。

Machine learning has had a great deal of success in image processing. However, the focus of this work has largely been on realistic images, ignoring more niche art styles such as pixel art. Additionally, many traditional machine learning models that focus on groups of pixels do not work well with pixel art, where individual pixels are important. We propose the Pixel VQ-VAE, a specialized VQ-VAE model that learns representations of pixel art. We show that it outperforms other models in both the quality of embeddings as well as performance on downstream tasks.

updated: Wed Sep 21 2022 20:42:00 GMT+0000 (UTC)

published: Wed Mar 23 2022 01:47:33 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト