DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization

Xiaojun Tang; Junsong Fan; Chuanchen Luo; Zhaoxiang Zhang; Man Zhang; Zongyuan Yang

DDG-Net: 弱く監視された時間的アクションの位置特定のための識別可能性駆動型グラフネットワーク

弱教師付き時間的動作位置特定 (WTAL) は、実用的ではありますが、困難なタスクです。データセットが大規模であるため、既存の手法のほとんどは、他のデータセットで事前トレーニングされたネットワークを使用して特徴を抽出しますが、これは WTAL には十分に適していません。この問題に対処するために、研究者は機能強化のためのいくつかのモジュールを設計し、ローカリゼーションモジュールのパフォーマンスを向上させ、特にスニペット間の時間的関係をモデル化しています。しかし、それらはいずれも、曖昧な情報が他者の識別性を低下させる悪影響を無視している。この現象を考慮して、我々は、曖昧なスニペットと適切に設計された接続を備えた識別スニペットを明示的にモデル化し、曖昧な情報の伝達を防ぎ、スニペットレベルの表現の識別性を高めるDiscriminability-Driven Graph Network (DDG-Net)を提案します。さらに、特徴の同化を防ぎ、より識別的な表現を生成するようにグラフ畳み込みネットワークを駆動するために、特徴の一貫性の損失を提案します。 THUMOS14 および ActivityNet1.2 ベンチマークに関する広範な実験により、DDG-Net の有効性が実証され、両方のデータセットで新しい最先端の結果が確立されました。ソースコードは https://github.com/XiaojunTang22/ICCV2023-DDGNet で入手できます。

Weakly-supervised temporal action localization (WTAL) is a practical yet challenging task. Due to large-scale datasets, most existing methods use a network pretrained in other datasets to extract features, which are not suitable enough for WTAL. To address this problem, researchers design several modules for feature enhancement, which improve the performance of the localization module, especially modeling the temporal relationship between snippets. However, all of them neglect the adverse effects of ambiguous information, which would reduce the discriminability of others. Considering this phenomenon, we propose Discriminability-Driven Graph Network (DDG-Net), which explicitly models ambiguous snippets and discriminative snippets with well-designed connections, preventing the transmission of ambiguous information and enhancing the discriminability of snippet-level representations. Additionally, we propose feature consistency loss to prevent the assimilation of features and drive the graph convolution network to generate more discriminative representations. Extensive experiments on THUMOS14 and ActivityNet1.2 benchmarks demonstrate the effectiveness of DDG-Net, establishing new state-of-the-art results on both datasets. Source code is available at https://github.com/XiaojunTang22/ICCV2023-DDGNet.

updated: Mon Jul 31 2023 05:48:39 GMT+0000 (UTC)

published: Mon Jul 31 2023 05:48:39 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト