Open-Vocabulary Affordance Detection in 3D Point Clouds

Toan Ngyen; Minh Nhat Vu; An Vuong; Dzung Nguyen; Thieu Vo; Ngan Le; Anh Nguyen

3D点群におけるオープン語彙アフォーダンス検出

アフォーダンスの検出は、さまざまなロボットアプリケーションでの困難な問題です。従来のアフォーダンス検出方法は、定義済みの一連のアフォーダンスラベルに限定されているため、複雑で動的な環境におけるインテリジェントロボットの適応性が制限される可能性があります。この論文では、Open-Vocabulary Affordance Detection (OpenAD) メソッドを紹介します。これは、3D 点群で無制限の数のアフォーダンスを検出することができます。アフォーダンステキストとポイント機能を同時に学習することで、OpenAD はアフォーダンス間の意味関係をうまく活用します。したがって、提案された方法により、ゼロショット検出が可能になり、単一の注釈の例がなくても、以前には見られなかったアフォーダンスを検出できます。集中的な実験結果は、OpenAD が幅広いアフォーダンス検出セットアップで効果的に機能し、他のベースラインよりも大幅に優れていることを示しています。さらに、提案された OpenAD が実世界のロボットアプリケーションで高速な推論速度 (約 100 ミリ秒) で実用的であることを示します。

Affordance detection is a challenging problem with a wide variety of robotic applications. Traditional affordance detection methods are limited to a predefined set of affordance labels, hence potentially restricting the adaptability of intelligent robots in complex and dynamic environments. In this paper, we present the Open-Vocabulary Affordance Detection (OpenAD) method, which is capable of detecting an unbounded number of affordances in 3D point clouds. By simultaneously learning the affordance text and the point feature, OpenAD successfully exploits the semantic relationships between affordances. Therefore, our proposed method enables zero-shot detection and can detect previously unseen affordances without a single annotation example. Intensive experimental results show that OpenAD works effectively on a wide range of affordance detection setups and outperforms other baselines by a large margin. Additionally, we demonstrate the practicality of the proposed OpenAD in real-world robotic applications with a fast inference speed (~100 ms).

updated: Sat Mar 04 2023 12:26:47 GMT+0000 (UTC)

published: Sat Mar 04 2023 12:26:47 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト