Combining Supervised and Un-supervised Learning for Automatic Citrus Segmentation

Heqing Huang; Tongbin Huang; Zhen Li; Zhiwei Wei; Shilei Lv

自動柑橘類セグメンテーションのための教師あり学習と教師なし学習の組み合わせ

柑橘類のセグメンテーションは、自動柑橘類摘み取りの重要なステップです。現在のほとんどの画像セグメンテーションアプローチは、ピクセル単位のセグメンテーションによって良好なセグメンテーション結果を達成しますが、これらの教師あり学習ベースの方法は、大量の注釈付きデータを必要とし、実際のアプリケーションでの柑橘類の位置の継続的な時間的変化を考慮しません。この論文では、最初に、教師ありの方法で少数のラベル付き柑橘類画像を使用して単純なCNNをトレーニングします。これにより、各フレームから柑橘類の位置を大まかに予測できます。次に、最先端の教師なし学習アプローチを拡張して、ラベルのない柑橘類のビデオからフレーム間の柑橘類の潜在的な動きを事前に学習します。両方のネットワークを活用するために、教師あり学習静的情報と教師なし学習運動情報を組み合わせるマルチモーダルトランスフォーマーを採用しています。実験結果は、両方のネットワークを組み合わせると、予測精度が88.3％IOUおよび93.6％の精度に達し、元の監視対象ベースラインの1.2％および2.4％を上回っていることを示しています。既存の柑橘類のセグメンテーション方法のほとんどと比較して、私たちの方法は、セグメンテーション効果を高めるために、ピクセルレベルの位置情報と柑橘類の変化の時間情報を学習しながら、少量の教師ありデータと多数の教師なしデータを使用します。

Citrus segmentation is a key step of automatic citrus picking. While most current image segmentation approaches achieve good segmentation results by pixel-wise segmentation, these supervised learning-based methods require a large amount of annotated data, and do not consider the continuous temporal changes of citrus position in real-world applications. In this paper, we first train a simple CNN with a small number of labelled citrus images in a supervised manner, which can roughly predict the citrus location from each frame. Then, we extend a state-of-the-art unsupervised learning approach to pre-learn the citrus's potential movements between frames from unlabelled citrus's videos. To take advantages of both networks, we employ the multimodal transformer to combine supervised learned static information and unsupervised learned movement information. The experimental results show that combing both network allows the prediction accuracy reached at 88.3% IOU and 93.6% precision, outperforming the original supervised baseline 1.2% and 2.4%. Compared with most of the existing citrus segmentation methods, our method uses a small amount of supervised data and a large number of unsupervised data, while learning the pixel level location information and the temporal information of citrus changes to enhance the segmentation effect.

updated: Tue May 04 2021 15:09:01 GMT+0000 (UTC)

published: Tue May 04 2021 15:09:01 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト