PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery

Sheng Zhang; Salman Khan; Zhiqiang Shen; Muzammal Naseer; Guangyi Chen; Fahad Khan

PromptCAL: 一般化された新規カテゴリ発見のための補助プロンプトによる対照的親和性学習

既存の半教師あり学習モデルは、注釈なしの分布データを使用した学習で目覚ましい成功を収めていますが、閉じたセットの仮定のために、新しいセマンティッククラスからサンプリングされたラベルのないデータで学習することはほとんどありません。この作業では、実用的だが十分に調査されていない一般化小説カテゴリ発見 (GNCD) 設定を対象としています。 GNCD 設定は、部分的にラベル付けされた既知のクラスの情報を活用して、既知のクラスと新しいクラスからのラベル付けされていないトレーニングデータを分類することを目的としています。この困難な問題に対処するために、PromptCAL と呼ばれる、補助的な視覚的プロンプトを使用した 2 段階の対照的アフィニティ学習法を提案します。私たちのアプローチは、信頼できるペアワイズサンプルアフィニティを発見して、クラストークンと視覚的プロンプトの既知のクラスと新しいクラスの両方のより良いセマンティッククラスタリングを学習します。まず、洗練されたアフィニティ関係のために、プロンプトに適応した事前トレーニング済みのビジョントランスフォーマーのセマンティックな識別性を強化するために、識別的なプロンプト正則化損失を提案します。さらに、意味論的に強化された迅速な監督のための反復的な半教師付きアフィニティグラフ生成方法に基づいて、セマンティック表現を調整するための対照的なアフィニティ学習ステージを提案します。広範な実験的評価により、PromptCAL メソッドは、限られた注釈でも新しいクラスを発見するのにより効果的であり、一般的できめ細かいベンチマークで現在の最先端技術を凌駕することが実証されています (CUB-200 で約 11% の向上、および 9 % ImageNet-100 で) 全体的な精度で。

Although existing semi-supervised learning models achieve remarkable success in learning with unannotated in-distribution data, they mostly fail to learn on unlabeled data sampled from novel semantic classes due to their closed-set assumption. In this work, we target a pragmatic but under-explored Generalized Novel Category Discovery (GNCD) setting. The GNCD setting aims to categorize unlabeled training data coming from known and novel classes by leveraging the information of partially labeled known classes. We propose a two-stage Contrastive Affinity Learning method with auxiliary visual Prompts, dubbed PromptCAL, to address this challenging problem. Our approach discovers reliable pairwise sample affinities to learn better semantic clustering of both known and novel classes for the class token and visual prompts. First, we propose a discriminative prompt regularization loss to reinforce semantic discriminativeness of prompt-adapted pre-trained vision transformer for refined affinity relationships. Besides, we propose a contrastive affinity learning stage to calibrate semantic representations based on our iterative semi-supervised affinity graph generation method for semantically-enhanced prompt supervision. Extensive experimental evaluation demonstrates that our PromptCAL method is more effective in discovering novel classes even with limited annotations and surpasses the current state-of-the-art on generic and fine-grained benchmarks (with nearly 11% gain on CUB-200, and 9% on ImageNet-100) on overall accuracy.

updated: Sun Dec 11 2022 20:06:14 GMT+0000 (UTC)

published: Sun Dec 11 2022 20:06:14 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト