One Class One Click: Quasi Scene-level Weakly Supervised Point Cloud Semantic Segmentation with Active Learning

Puzuo Wang; Wei Yao; Jie Shao

ワンクラスワンクリック: アクティブラーニングによる準シーンレベルの弱教師付き点群セマンティックセグメンテーション

主要なパフォーマンスを達成するために膨大な注釈に依存すると、大規模な点群セマンティックセグメンテーションの実用性が大幅に制限されます。データ注釈コストを削減する目的で、効果的なラベル付けスキームが開発され、弱い監督戦略の下で競争力のある結果を達成するのに貢献します。現在の弱いラベルフォームを再検討し、ポイントレベルおよびシーンレベルの注釈をカプセル化する、低コストでありながら有益な準シーンレベルラベルである One Class One Click (OCOC) を導入します。グローバルおよびローカルの観点からの弱い監督を含むことにより、希少なラベルを活用するために、アクティブな弱い監督フレームワークが提案されています。コンテキスト制約は、補助的なシーン分類タスクによって、それぞれグローバルな特徴の埋め込みとポイント単位の予測集約に基づいて課され、モデル予測を OCOC ラベルのみに制限します。さらに、ポイントレベルの監視信号を効果的に補完する、コンテキストを意識した疑似ラベリング戦略を設計します。最後に、不確実性尺度を備えたアクティブな学習スキーム - 一時的な出力の不一致を統合して有益なサンプルを調べ、サブクラウドクエリに関するガイダンスを提供します。空中、モバイル、および地上のプラットフォームから収集された 3 つの LiDAR ベンチマークを使用した広範な実験的分析は、提案された方法が、ラベルが少ないにもかかわらず、非常に有望な結果を達成することを示しています。平均 F1 スコアの点で、本物のシーンレベルの弱い教師ありメソッドよりも最大 25% 優れたパフォーマンスを発揮し、完全な教師ありスキームに対して競争力のある結果を達成します。地上の LiDAR データセット - Semantics3D では、約 2 つのラベルを使用して、平均 F1 スコア 85.2% を達成し、ベースラインモデルと比較して 11.58% 増加しています。

Reliance on vast annotations to achieve leading performance severely restricts the practicality of large-scale point cloud semantic segmentation. For the purpose of reducing data annotation costs, effective labeling schemes are developed and contribute to attaining competitive results under weak supervision strategy. Revisiting current weak label forms, we introduce One Class One Click (OCOC), a low cost yet informative quasi scene-level label, which encapsulates point-level and scene-level annotations. An active weakly supervised framework is proposed to leverage scarce labels by involving weak supervision from global and local perspectives. Contextual constraints are imposed by an auxiliary scene classification task, respectively based on global feature embedding and point-wise prediction aggregation, which restricts the model prediction merely to OCOC labels. Furthermore, we design a context-aware pseudo labeling strategy, which effectively supplement point-level supervisory signals. Finally, an active learning scheme with a uncertainty measure - temporal output discrepancy is integrated to examine informative samples and provides guidance on sub-clouds query, which is conducive to quickly attaining desirable OCOC annotations and reduces the labeling cost to an extremely low extent. Extensive experimental analysis using three LiDAR benchmarks collected from airborne, mobile and ground platforms demonstrates that our proposed method achieves very promising results though subject to scarce labels. It considerably outperforms genuine scene-level weakly supervised methods by up to 25% in terms of average F1 score and achieves competitive results against full supervision schemes. On terrestrial LiDAR dataset - Semantics3D, using approximately 2 of labels, our method achieves an average F1 score of 85.2%, which increases by 11.58% compared to the baseline model.

updated: Wed Nov 23 2022 01:23:26 GMT+0000 (UTC)

published: Wed Nov 23 2022 01:23:26 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト