Black-box Dataset Ownership Verification via Backdoor Watermarking

Yiming Li; Mingyan Zhu; Xue Yang; Yong Jiang; Tao Wei; Shu-Tao Xia

バックドア透かしによるブラックボックスデータセット所有権の検証

ディープラーニング、特にディープニューラルネットワーク (DNN) は、その高い有効性と効率性から、多くの重要なアプリケーションに広く採用され、成功を収めています。 DNN の急速な開発は、いくつかの高品質のデータセット (ImageNet など) の存在から恩恵を受けてきました。これにより、研究者や開発者はメソッドのパフォーマンスを簡単に検証できます。現在、公開されているほとんどすべての既存のデータセットは、許可なく商用目的ではなく、学術または教育目的でのみ採用できることを要求しています。ただし、それを保証する良い方法はまだありません。このホワイトペーパーでは、リリースされたデータセットの保護を、(疑わしい) サードパーティモデルのトレーニングに採用されているかどうかを検証することとして定式化します。この場合、防御者はモデルのクエリのみを実行でき、そのパラメーターとトレーニングの詳細に関する情報はありません。この定式化に基づいて、バックドア透かしを介して外部パターンを埋め込み、所有権を検証してそれらを保護することを提案します。私たちの方法には、データセットの透かしとデータセットの検証を含む 2 つの主要な部分が含まれています。具体的には、データセットの透かしにポイズンのみのバックドア攻撃 (BadNets など) を利用し、データセット検証のための仮説テストに基づく方法を設計します。また、私たちの方法のいくつかの理論的分析も提供します。さまざまなタスクの複数のベンチマークデータセットに対する実験が行われ、この方法の有効性が検証されます。主な実験を再現するためのコードは、https://github.com/THUYimingLi/DVBW で入手できます。

Deep learning, especially deep neural networks (DNNs), has been widely and successfully adopted in many critical applications for its high effectiveness and efficiency. The rapid development of DNNs has benefited from the existence of some high-quality datasets (e.g., ImageNet), which allow researchers and developers to easily verify the performance of their methods. Currently, almost all existing released datasets require that they can only be adopted for academic or educational purposes rather than commercial purposes without permission. However, there is still no good way to ensure that. In this paper, we formulate the protection of released datasets as verifying whether they are adopted for training a (suspicious) third-party model, where defenders can only query the model while having no information about its parameters and training details. Based on this formulation, we propose to embed external patterns via backdoor watermarking for the ownership verification to protect them. Our method contains two main parts, including dataset watermarking and dataset verification. Specifically, we exploit poison-only backdoor attacks (e.g., BadNets) for dataset watermarking and design a hypothesis-test-guided method for dataset verification. We also provide some theoretical analyses of our methods. Experiments on multiple benchmark datasets of different tasks are conducted, which verify the effectiveness of our method. The code for reproducing main experiments is available at https://github.com/THUYimingLi/DVBW.

updated: Fri Mar 31 2023 01:11:50 GMT+0000 (UTC)

published: Thu Aug 04 2022 05:32:20 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト