Improving Data-Efficient Fossil Segmentation via Model Editing

Indu Panigrahi; Ryan Manzuk; Adam Maloof; Ruth Fong

モデル編集によるデータ効率の良い化石セグメンテーションの改善

ほとんどのコンピュータービジョン研究は、ありふれたオブジェクトの何千もの画像を含むデータセットに焦点を当てています。ただし、医学や地球科学のデータセットなど、影響の大きいデータセットの多くには、ドメインの専門家の知識を認識して収集し、注釈を付けるのに時間がかかる細粒度のオブジェクトが含まれています。その結果、これらのデータセットにはラベル付きの画像がほとんど含まれておらず、現在のマシンビジョンモデルはそれらを集中的にトレーニングすることはできません。元々は多言語モデルを修正するために導入された機械学習のモデル編集手法は、少量のデータと追加のトレーニングのみを使用してモデルのパフォーマンスを向上させることが示されています。マスク R-CNN を使用して岩石サンプル画像の古代サンゴ礁の化石をセグメント化することで、ラベル付き画像をほとんど使用せずに化石セグメンテーションを改善する 2 つの部分からなるパラダイムを提示します。最初に画像摂動を使用してモデルの弱点を特定し、次にモデル編集を使用してそれらの弱点を軽減します。具体的には、ドメイン情報に基づく画像摂動を適用して、Mask R-CNN が異なるクラスの化石を区別できないことと、異なるテクスチャを持つ化石をセグメント化する際の矛盾を明らかにします。これらの欠点に対処するために、画像分類の体系的な誤りを修正するための既存のモデル編集方法を、追加のラベル付きデータを必要としない画像セグメンテーションに拡張し、異なる種類の化石間の混乱を減らす効果を示します。また、この状況でのモデル編集の最適な設定も強調しています。1 つの画像内の関連するすべてのピクセルを使用して 1 つの編集を行います (複数の画像、複数の編集、またはより少ないピクセルを使用するのとは対照的です)。化石のセグメンテーションに焦点を当てていますが、私たちのアプローチは、データが限られている他の同様の細粒度のセグメンテーションの問題にも役立つ可能性があります。

Most computer vision research focuses on datasets containing thousands of images of commonplace objects. However, many high-impact datasets, such as those in medicine and the geosciences, contain fine-grain objects that require domain-expert knowledge to recognize and are time-consuming to collect and annotate. As a result, these datasets contain few labeled images, and current machine vision models cannot train intensively on them. Originally introduced to correct large-language models, model-editing techniques in machine learning have been shown to improve model performance using only small amounts of data and additional training. Using a Mask R-CNN to segment ancient reef fossils in rock sample images, we present a two-part paradigm to improve fossil segmentation with few labeled images: we first identify model weaknesses using image perturbations and then mitigate those weaknesses using model editing. Specifically, we apply domain-informed image perturbations to expose the Mask R-CNN's inability to distinguish between different classes of fossils and its inconsistency in segmenting fossils with different textures. To address these shortcomings, we extend an existing model-editing method for correcting systematic mistakes in image classification to image segmentation with no additional labeled data needed and show its effectiveness in decreasing confusion between different kinds of fossils. We also highlight the best settings for model editing in our situation: making a single edit using all relevant pixels in one image (vs. using multiple images, multiple edits, or fewer pixels). Though we focus on fossil segmentation, our approach may be useful in other similar fine-grain segmentation problems where data is limited.

updated: Mon Apr 10 2023 03:43:57 GMT+0000 (UTC)

published: Sat Oct 08 2022 02:12:38 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト