Neural Implicit Dictionary via Mixture-of-Expert Training

Peihao Wang; Zhiwen Fan; Tianlong Chen; Zhangyang Wang

専門家の混合トレーニングによる神経暗黙辞書

座標ベースの深く完全に接続されたネットワークによる視覚信号の表現は、離散グリッドベースの表現よりも複雑な詳細をフィッティングし、逆問題を解決するのに有利であることが示されています。ただし、このような継続的な暗黙的ニューラル表現（INR）を取得するには、大量の信号測定に関する面倒なシーンごとのトレーニングが必要であり、その実用性が制限されます。この論文では、データ収集からニューラル暗黙辞書（NID）を学習し、辞書からサンプリングされた基礎の機能的な組み合わせとしてINRを表すことにより、データとトレーニング効率の両方を実現する一般的なINRフレームワークを紹介します。 NIDは、目的の関数空間にまたがるように調整された座標ベースのサブネットワークのグループを組み立てます。トレーニング後、コーディング係数を解くことにより、目に見えないシーン表現を即座に確実に取得できます。ネットワークの大規模なグループを並行して最適化するために、Mixture-of-Expert（MoE）からアイデアを借りて、スパースゲーティングメカニズムを使用してネットワークを設計およびトレーニングします。私たちの実験によると、NIDは、入力データを最大98％削減し、2D画像または3Dシーンの再構成を2桁高速化できます。さらに、画像の修復とオクルージョンの除去におけるNIDのさまざまなアプリケーションを示します。これらは、バニラINRでは困難であると考えられています。私たちのコードはhttps://github.com/VITA-Group/Neural-Implicit-Dictで入手できます。

Representing visual signals by coordinate-based deep fully-connected networks has been shown advantageous in fitting complex details and solving inverse problems than discrete grid-based representation. However, acquiring such a continuous Implicit Neural Representation (INR) requires tedious per-scene training on tons of signal measurements, which limits its practicality. In this paper, we present a generic INR framework that achieves both data and training efficiency by learning a Neural Implicit Dictionary (NID) from a data collection and representing INR as a functional combination of basis sampled from the dictionary. Our NID assembles a group of coordinate-based subnetworks which are tuned to span the desired function space. After training, one can instantly and robustly acquire an unseen scene representation by solving the coding coefficients. To parallelly optimize a large group of networks, we borrow the idea from Mixture-of-Expert (MoE) to design and train our network with a sparse gating mechanism. Our experiments show that, NID can improve reconstruction of 2D images or 3D scenes by 2 orders of magnitude faster with up to 98% less input data. We further demonstrate various applications of NID in image inpainting and occlusion removal, which are considered to be challenging with vanilla INR. Our codes are available in https://github.com/VITA-Group/Neural-Implicit-Dict.

updated: Fri Jul 08 2022 05:07:19 GMT+0000 (UTC)

published: Fri Jul 08 2022 05:07:19 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト