HashEncoding: Autoencoding with Multiscale Coordinate Hashing

Lukas Zhornyak; Zhengjie Xu; Haoran Tang; Jianbo Shi

HashEncoding: マルチスケール座標ハッシュによる自動エンコード

HashEncoding は、ノンパラメトリックマルチスケール座標ハッシュ関数を利用して畳み込みなしでピクセルごとのデコーダーを容易にする新しい自動エンコードアーキテクチャです。ハッシュ関数の空間折りたたみ動作を利用することで、HashEncoding は、元の画像よりもはるかに小さいままで、本質的にマルチスケールの埋め込み空間を可能にします。その結果、デコーダーは、従来のオートエンコーダーのデコーダーと比較して非常に少ないパラメーターを必要とし、元の画像のノンパラメトリック再構成に近づき、より大きな一般化を可能にします。最後に、座標空間への逆伝播を直接許可することで、オプティカルフローなどの幾何学的タスクに HashEncoding を利用できることを示します。

We present HashEncoding, a novel autoencoding architecture that leverages a non-parametric multiscale coordinate hash function to facilitate a per-pixel decoder without convolutions. By leveraging the space-folding behaviour of hashing functions, HashEncoding allows for an inherently multiscale embedding space that remains much smaller than the original image. As a result, the decoder requires very few parameters compared with decoders in traditional autoencoders, approaching a non-parametric reconstruction of the original image and allowing for greater generalizability. Finally, by allowing backpropagation directly to the coordinate space, we show that HashEncoding can be exploited for geometric tasks such as optical flow.

updated: Tue Nov 29 2022 03:22:19 GMT+0000 (UTC)

published: Tue Nov 29 2022 03:22:19 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト