A Dive into SAM Prior in Image Restoration

Zeyu Xiao; Jiawang Bai; Zhihe Lu; Zhiwei Xiong

イメージ復元における SAM の詳細

コンピュータビジョンの基本的な問題である画像復元 (IR) の目標は、劣化した低品質 (LQ) 観察から高品質 (HQ) 画像を復元することです。この適切に設定されていない問題では、複数の HQ 解が 1 つの LQ 入力に対応する可能性があり、曖昧な解空間が作成されます。これにより、解空間を効果的に制限し、復元された画像の品質を向上させるために、事前の知識を調査して組み込むことが動機付けられます。 IR では手作りおよび学習された事前分布が広く使用されているにもかかわらず、大規模な基礎モデルからの知識の組み込みには限定的な注意が払われてきました。このペーパーでは、最先端のセグメント何でもモデル (SAM) の事前知識を初めて利用して、パラメータ効率の高い調整方法で既存の IR ネットワークのパフォーマンスを向上させます。特に、SAM の選択は、HQ セマンティックマスクを SAM から抽出できるように、画像劣化に対する堅牢性に基づいています。セマンティック事前調整を活用して復元品質を向上させるために、軽量の SAM 事前調整 (SPT) ユニットを提案します。このプラグアンドプレイコンポーネントにより、セマンティック事前情報を既存の IR ネットワークに効果的に統合できるようになり、復元品質が大幅に向上します。 SPT ユニットは、私たちのメソッドで唯一トレーニング可能なモジュールとして、効率とスケーラビリティの両方を向上させる可能性があります。画像の超解像度やカラー画像のノイズ除去など、複数のタスクにわたるさまざまな方法を強化する際の、提案された方法の有効性を実証します。

The goal of image restoration (IR), a fundamental issue in computer vision, is to restore a high-quality (HQ) image from its degraded low-quality (LQ) observation. Multiple HQ solutions may correspond to an LQ input in this poorly posed problem, creating an ambiguous solution space. This motivates the investigation and incorporation of prior knowledge in order to effectively constrain the solution space and enhance the quality of the restored images. In spite of the pervasive use of hand-crafted and learned priors in IR, limited attention has been paid to the incorporation of knowledge from large-scale foundation models. In this paper, we for the first time leverage the prior knowledge of the state-of-the-art segment anything model (SAM) to boost the performance of existing IR networks in an parameter-efficient tuning manner. In particular, the choice of SAM is based on its robustness to image degradations, such that HQ semantic masks can be extracted from it. In order to leverage semantic priors and enhance restoration quality, we propose a lightweight SAM prior tuning (SPT) unit. This plug-and-play component allows us to effectively integrate semantic priors into existing IR networks, resulting in significant improvements in restoration quality. As the only trainable module in our method, the SPT unit has the potential to improve both efficiency and scalability. We demonstrate the effectiveness of the proposed method in enhancing a variety of methods across multiple tasks, such as image super-resolution and color image denoising.

updated: Tue May 23 2023 02:31:06 GMT+0000 (UTC)

published: Tue May 23 2023 02:31:06 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト