Improving Data Efficiency for Plant Cover Prediction with Label Interpolation and Monte-Carlo Cropping

Matthias Körschens; Solveig Franziska Bucher; Christine Römermann; Joachim Denzler

ラベル補間とモンテカルロトリミングによる植物被覆予測のデータ効率の向上

植物群落の構成は環境変化の重要な指標であり、このため、生態学的フィールド研究では通常、いわゆる植物被覆の観点から分析されます。この種のデータを手動で取得するには、時間がかかり、労力がかかり、人的ミスが発生しやすくなります。自動カメラシステムは、調査された植生プロットの高解像度画像を高頻度で収集できます。その後のアルゴリズム分析と組み合わせることで、人的労力をほとんどかけることなく、植物群落の構成に関する情報を客観的に抽出することが可能になります。自動カメラシステムは、自動分析のための深層学習システムのトレーニングに必要な大量の画像データを簡単に収集できます。ただし、植生画像に植物被覆データの注釈を付けるには多大な作業が必要となるため、利用できるラベル付きサンプルはごくわずかです。自動カメラシステムはラベルなしで多くの画像を収集できるため、収集した植生プロット時系列のまばらなラベルを中間の密なラベルなし画像まで補間して、トレーニングデータセットを元のサイズの 7 倍に人為的に増やすアプローチを導入します。さらに、モンテカルロクロッピングと呼ばれる新しい方法を導入します。このアプローチでは、トレーニング画像の切り取られた部分のコレクションをトレーニングして、高解像度画像を効率的に処理し、トレーニング画像を暗黙的に拡張してトレーニングを高速化します。私たちは草本植物群落の画像を含む植物被覆データセットで両方のアプローチを評価し、私たちの方法が調査された種、群落、およびセグメンテーションメトリクスの改善につながることを発見しました。

The plant community composition is an essential indicator of environmental changes and is, for this reason, usually analyzed in ecological field studies in terms of the so-called plant cover. The manual acquisition of this kind of data is time-consuming, laborious, and prone to human error. Automated camera systems can collect high-resolution images of the surveyed vegetation plots at a high frequency. In combination with subsequent algorithmic analysis, it is possible to objectively extract information on plant community composition quickly and with little human effort. An automated camera system can easily collect the large amounts of image data necessary to train a Deep Learning system for automatic analysis. However, due to the amount of work required to annotate vegetation images with plant cover data, only few labeled samples are available. As automated camera systems can collect many pictures without labels, we introduce an approach to interpolate the sparse labels in the collected vegetation plot time series down to the intermediate dense and unlabeled images to artificially increase our training dataset to seven times its original size. Moreover, we introduce a new method we call Monte-Carlo Cropping. This approach trains on a collection of cropped parts of the training images to deal with high-resolution images efficiently, implicitly augment the training images, and speed up training. We evaluate both approaches on a plant cover dataset containing images of herbaceous plant communities and find that our methods lead to improvements in the species, community, and segmentation metrics investigated.

updated: Mon Jul 17 2023 15:17:39 GMT+0000 (UTC)

published: Mon Jul 17 2023 15:17:39 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト