FoodSAM: Any Food Segmentation

Xing Lan; Jiayi Lyu; Hanyu Jiang; Kun Dong; Zehai Niu; Yi Zhang; Jian Xue

FoodSAM: あらゆる食品のセグメンテーション

このペーパーでは、食品画像セグメンテーションのためのセグメントエニシングモデル (SAM) のゼロショット機能を調査します。 SAM で生成されたマスクにクラス固有の情報が欠けていることに対処するために、私たちは FoodSAM と呼ばれる新しいフレームワークを提案します。この革新的なアプローチは、粗いセマンティックマスクを SAM で生成されたマスクと統合し、セマンティックセグメンテーションの品質を向上させます。さらに、食品の成分は独立した個体として想定できることを認識しており、これが食品画像に対してインスタンスのセグメンテーションを実行する動機となっています。さらに、FoodSAM は、FoodSAM をレンダリングして非食品オブジェクト情報を効果的にキャプチャするオブジェクト検出器を組み込むことにより、パノプティックセグメンテーションを包含するようにゼロショット機能を拡張します。プロンプト可能なセグメンテーションの最近の成功からインスピレーションを得て、私たちは FoodSAM をプロンプト可能なセグメンテーションにも拡張し、さまざまなプロンプトバリアントをサポートします。その結果、FoodSAM は、複数の粒度レベルで食品をセグメント化できる包括的なソリューションとして登場します。注目すべきことに、この先駆的なフレームワークは、食品画像のインスタンス、パノプティック、プロンプトのセグメンテーションを達成するための最初の研究として機能します。広範な実験により、FoodSAM の実現可能性と印象的なパフォーマンスが実証され、食品画像セグメンテーションの領域内で顕著かつ影響力のあるツールとしての SAM の可能性が検証されました。コードは https://github.com/jamesjg/FoodSAM でリリースされています。

In this paper, we explore the zero-shot capability of the Segment Anything Model (SAM) for food image segmentation. To address the lack of class-specific information in SAM-generated masks, we propose a novel framework, called FoodSAM. This innovative approach integrates the coarse semantic mask with SAM-generated masks to enhance semantic segmentation quality. Besides, we recognize that the ingredients in food can be supposed as independent individuals, which motivated us to perform instance segmentation on food images. Furthermore, FoodSAM extends its zero-shot capability to encompass panoptic segmentation by incorporating an object detector, which renders FoodSAM to effectively capture non-food object information. Drawing inspiration from the recent success of promptable segmentation, we also extend FoodSAM to promptable segmentation, supporting various prompt variants. Consequently, FoodSAM emerges as an all-encompassing solution capable of segmenting food items at multiple levels of granularity. Remarkably, this pioneering framework stands as the first-ever work to achieve instance, panoptic, and promptable segmentation on food images. Extensive experiments demonstrate the feasibility and impressing performance of FoodSAM, validating SAM's potential as a prominent and influential tool within the domain of food image segmentation. We release our code at https://github.com/jamesjg/FoodSAM.

updated: Fri Aug 11 2023 04:42:10 GMT+0000 (UTC)

published: Fri Aug 11 2023 04:42:10 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト