Your Room is not Private: Gradient Inversion Attack for Deep Q-Learning

Miao Li; Wenhao Ding; Ding Zhao

あなたの部屋はプライベートではありません: 深層 Q 学習のための勾配反転攻撃

ロボットが仮想環境内で移動し、認識し、関与できるようにする身体型人工知能 (AI) の卓越性は、コンピュータービジョンと大規模言語モデルの目覚ましい進歩により、大きな注目を集めています。ロボットが大量の個人情報にアクセスするため、身体化された AI の領域内ではプライバシーが極めて重要な懸念事項として浮上します。しかし、具体化された AI タスクにおけるプライバシー漏洩の問題、特に意思決定アルゴリズムに関連した問題は、研究において十分な考慮が払われていません。この論文は、状態、アクション、および Q 値を再構築するために勾配反転を利用して、ディープ Q ラーニングアルゴリズムに対する攻撃を提案することで、このギャップに対処することを目的としています。攻撃に勾配を使用するという選択は、一般的に使用されているフェデレーテッドラーニング技術が、データを保存したりパブリックサーバーに送信したりすることなく、プライベートユーザーデータに基づいて計算された勾配のみを利用してモデルを最適化するという事実によって動機付けられています。それにもかかわらず、これらの勾配には、個人データを公開する可能性がある十分な情報が含まれています。私たちのアプローチを検証するために、AI2THOR シミュレーターで実験を実施し、身体化 AI で一般的なタスクである能動的な知覚に関するアルゴリズムを評価します。実験結果は、120 部屋すべてのレイアウトにわたるデータからすべての情報を正常に復元するという私たちの方法の有効性を説得力を持って示しています。

The prominence of embodied Artificial Intelligence (AI), which empowers robots to navigate, perceive, and engage within virtual environments, has attracted significant attention, owing to the remarkable advancements in computer vision and large language models. Privacy emerges as a pivotal concern within the realm of embodied AI, as the robot access substantial personal information. However, the issue of privacy leakage in embodied AI tasks, particularly in relation to decision-making algorithms, has not received adequate consideration in research. This paper aims to address this gap by proposing an attack on the Deep Q-Learning algorithm, utilizing gradient inversion to reconstruct states, actions, and Q-values. The choice of using gradients for the attack is motivated by the fact that commonly employed federated learning techniques solely utilize gradients computed based on private user data to optimize models, without storing or transmitting the data to public servers. Nevertheless, these gradients contain sufficient information to potentially expose private data. To validate our approach, we conduct experiments on the AI2THOR simulator and evaluate our algorithm on active perception, a prevalent task in embodied AI. The experimental results convincingly demonstrate the effectiveness of our method in successfully recovering all information from the data across all 120 room layouts.

updated: Thu Jun 15 2023 16:53:26 GMT+0000 (UTC)

published: Thu Jun 15 2023 16:53:26 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト