Transmission-Guided Bayesian Generative Model for Smoke Segmentation

Siyuan Yan; Jing Zhang; Nick Barnes

煙セグメンテーションのための透過誘導ベイジアン生成モデル

煙のセグメンテーションは、野火を正確に特定して初期段階で消火できるようにするために不可欠です。ディープニューラルネットワークは画像セグメンテーションタスクで有望な結果を達成していますが、その非剛体形状と透明な外観のために、煙のセグメンテーションを過信する傾向があります。これは、正確な煙のセグメンテーションのための限られたトレーニングデータによる知識レベルの不確実性と、グラウンドトゥルースのラベリングの難しさを表すラベリングレベルの不確実性の両方によって引き起こされます。 2 種類の不確実性を効果的にモデル化するために、ベイジアン生成モデルを導入して、モデルパラメーターの事後分布とその予測を同時に推定します。さらに、煙の画像は、物理ベースの画像のかすみ除去方法に触発された低コントラストとあいまいさに悩まされています。ピクセル距離と透過特性に基づいてペアワイズ関係を学習するようにネットワークを誘導する透過誘導ローカルコヒーレンス損失を設計します。この分野の開発を促進するために、ピクセル単位の注釈を付けた 1,400 の実画像と 4,000 の合成画像で構成される高品質の煙セグメンテーションデータセット SMOKE5K も提供しています。ベンチマークテストデータセットの実験結果は、モデルが正確な予測と、その予測に関するモデルの無知を表す信頼できる不確実性マップの両方を達成することを示しています。私たちのコードとデータセットは、https://github.com/redlessme/Transmission-BVM で公開されています。

Smoke segmentation is essential to precisely localize wildfire so that it can be extinguished in an early phase. Although deep neural networks have achieved promising results on image segmentation tasks, they are prone to be overconfident for smoke segmentation due to its non-rigid shape and transparent appearance. This is caused by both knowledge level uncertainty due to limited training data for accurate smoke segmentation and labeling level uncertainty representing the difficulty in labeling ground-truth. To effectively model the two types of uncertainty, we introduce a Bayesian generative model to simultaneously estimate the posterior distribution of model parameters and its predictions. Further, smoke images suffer from low contrast and ambiguity, inspired by physics-based image dehazing methods, we design a transmission-guided local coherence loss to guide the network to learn pair-wise relationships based on pixel distance and the transmission feature. To promote the development of this field, we also contribute a high-quality smoke segmentation dataset, SMOKE5K, consisting of 1,400 real and 4,000 synthetic images with pixel-wise annotation. Experimental results on benchmark testing datasets illustrate that our model achieves both accurate predictions and reliable uncertainty maps representing model ignorance about its prediction. Our code and dataset are publicly available at: https://github.com/redlessme/Transmission-BVM.

updated: Thu Mar 02 2023 01:48:05 GMT+0000 (UTC)

published: Thu Mar 02 2023 01:48:05 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト