SAM Fails to Segment Anything? -- SAM-Adapter: Adapting SAM in Underperformed Scenes: Camouflage, Shadow, Medical Image Segmentation, and More

Tianrun Chen; Lanyun Zhu; Chaotao Ding; Runlong Cao; Yan Wang; Zejian Li; Lingyun Sun; Papa Mao; Ying Zang

SAM は何かをセグメント化できませんか? -- SAM-Adapter: パフォーマンスの低いシーンでの SAM の適応: カモフラージュ、影、医療画像セグメンテーションなど

基礎モデルとしても知られる大規模モデルの出現は、AI 研究に大きな進歩をもたらしました。そのようなモデルの 1 つが、画像セグメンテーションタスク用に設計された Segment Anything (SAM) です。ただし、他の基盤モデルと同様に、私たちの実験結果は、SAM が影の検出やカモフラージュされたオブジェクトの検出 (隠れたオブジェクトの検出) などの特定のセグメンテーションタスクで失敗するか、パフォーマンスが低下する可能性があることを示唆しています。この研究は、SAM のパフォーマンスが低い状況でも、大規模な事前トレーニング済みの画像セグメンテーションモデル SAM をこれらのダウンストリームタスクに適用する道を開きます。 SAM ネットワークを微調整するのではなく、シンプルで効果的なアダプターを使用して、ドメイン固有の情報または視覚的なプロンプトをセグメンテーションネットワークに組み込む SAM アダプターを提案します。大規模なモデルによって学習された一般的な知識とタスク固有の知識を統合することにより、SAM-Adapter は、広範な実験で示されているように、困難なタスクにおける SAM のパフォーマンスを大幅に向上させることができます。タスク固有のネットワークモデルよりも優れたパフォーマンスを発揮し、テストしたタスク (カモフラージュされたオブジェクトの検出、影の検出) で最先端のパフォーマンスを達成することさえできます。また、ポリープセグメンテーション (医療画像セグメンテーション) もテストし、より良い結果を達成しました。私たちの研究は、医療用画像処理、農業、リモートセンシングなど、さまざまな分野での潜在的なアプリケーションを使用して、ダウンストリームタスクで SAM を利用する機会を開くと信じています。

The emergence of large models, also known as foundation models, has brought significant advancements to AI research. One such model is Segment Anything (SAM), which is designed for image segmentation tasks. However, as with other foundation models, our experimental findings suggest that SAM may fail or perform poorly in certain segmentation tasks, such as shadow detection and camouflaged object detection (concealed object detection). This study first paves the way for applying the large pre-trained image segmentation model SAM to these downstream tasks, even in situations where SAM performs poorly. Rather than fine-tuning the SAM network, we propose SAM-Adapter, which incorporates domain-specific information or visual prompts into the segmentation network by using simple yet effective adapters. By integrating task-specific knowledge with general knowledge learnt by the large model, SAM-Adapter can significantly elevate the performance of SAM in challenging tasks as shown in extensive experiments. We can even outperform task-specific network models and achieve state-of-the-art performance in the task we tested: camouflaged object detection, shadow detection. We also tested polyp segmentation (medical image segmentation) and achieves better results. We believe our work opens up opportunities for utilizing SAM in downstream tasks, with potential applications in various fields, including medical image processing, agriculture, remote sensing, and more.

updated: Tue May 02 2023 17:06:51 GMT+0000 (UTC)

published: Tue Apr 18 2023 17:38:54 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト