Differentially Private Video Activity Recognition

Zelun Luo; Yuliang Zou; Yijin Yang; Zane Durante; De-An Huang; Zhiding Yu; Chaowei Xiao; Li Fei-Fei; Animashree Anandkumar

差分プライベートビデオアクティビティ認識

近年、差分プライバシーは画像分類において大幅な進歩を遂げています。ただし、ビデオアクティビティ認識への応用はまだ研究されていません。この文書では、差分プライバシーをビデオアクティビティ認識に適用する際の課題について説明します。この課題は主に次のようなことに起因します。(1) ビデオ全体に求められるプライバシーレベルと、一般に短くセグメント化された現代のビデオアーキテクチャによって処理される入力データの性質との間の矛盾。クリップ; (2) 画像分類におけるビデオデータセットと比べて、ビデオデータセットの複雑さとサイズが膨大であるため、従来の差分プライバシー手法は不十分です。これらの問題に取り組むために、私たちは、クリップベースの分類モデルを通じてビデオレベルの差分プライバシーを強制するための新しいフレームワークであるマルチクリップ DP-SGD を提案します。この方法では、追加のプライバシー損失を引き起こすことなく、各ビデオから複数のクリップをサンプリングし、それらの勾配を平均し、DP-SGD で勾配クリッピングを適用します。さらに、パラメータ効率の高い転移学習戦略を組み込んで、大規模なビデオデータセットに対してモデルをスケーラブルにします。 UCF-101 および HMDB-51 データセットに対する広範な評価を通じて、私たちのアプローチは印象的なパフォーマンスを示し、UCF-101 でイプシロン = 5 のプライバシーバジェットで 81% の精度を達成し、DP を直接適用した場合と比較して 76% の改善を記録しました。シンガポールドル。さらに、転移学習戦略が多用途であり、CheXpert、ImageNet、CIFAR-10、CIFAR-100 を含む一連のデータセットにわたって差分プライベート画像分類を強化できることを実証します。

In recent years, differential privacy has seen significant advancements in image classification; however, its application to video activity recognition remains under-explored. This paper addresses the challenges of applying differential privacy to video activity recognition, which primarily stem from: (1) a discrepancy between the desired privacy level for entire videos and the nature of input data processed by contemporary video architectures, which are typically short, segmented clips; and (2) the complexity and sheer size of video datasets relative to those in image classification, which render traditional differential privacy methods inadequate. To tackle these issues, we propose Multi-Clip DP-SGD, a novel framework for enforcing video-level differential privacy through clip-based classification models. This method samples multiple clips from each video, averages their gradients, and applies gradient clipping in DP-SGD without incurring additional privacy loss. Moreover, we incorporate a parameter-efficient transfer learning strategy to make the model scalable for large-scale video datasets. Through extensive evaluations on the UCF-101 and HMDB-51 datasets, our approach exhibits impressive performance, achieving 81% accuracy with a privacy budget of epsilon=5 on UCF-101, marking a 76% improvement compared to a direct application of DP-SGD. Furthermore, we demonstrate that our transfer learning strategy is versatile and can enhance differentially private image classification across an array of datasets including CheXpert, ImageNet, CIFAR-10, and CIFAR-100.

updated: Tue Jun 27 2023 18:47:09 GMT+0000 (UTC)

published: Tue Jun 27 2023 18:47:09 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト