PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery

Sheng Zhang; Salman Khan; Zhiqiang Shen; Muzammal Naseer; Guangyi Chen; Fahad Khan

PromptCAL: 一般化された新規カテゴリ発見のための補助プロンプトによる対照的親和性学習

既存の半教師あり学習モデルは、注釈なしの分布データを使用した学習で目覚ましい成功を収めていますが、閉じたセットの仮定のために、新しいセマンティッククラスからサンプリングされたラベルのないデータで学習することはほとんどありません。この作業では、実用的だが十分に調査されていない一般化小説カテゴリ発見 (GNCD) 設定を対象としています。 GNCD 設定は、部分的にラベル付けされた既知のクラスの情報を活用して、既知のクラスと新しいクラスからのラベル付けされていないトレーニングデータを分類することを目的としています。この困難な問題に対処するために、PromptCAL と呼ばれる、補助的な視覚的プロンプトを使用した 2 段階の対照的アフィニティ学習法を提案します。私たちのアプローチは、信頼できるペアワイズサンプルアフィニティを発見して、クラストークンと視覚的プロンプトの既知のクラスと新しいクラスの両方のより良いセマンティッククラスタリングを学習します。まず、識別可能なプロンプト正則化損失を提案して、洗練されたアフィニティ関係のために、プロンプトに適応した事前トレーニング済みビジョントランスフォーマーのセマンティック識別性を強化します。意味的に強化された監督。広範な実験的評価により、PromptCAL メソッドは、限られた注釈でも新しいクラスを発見するのにより効果的であり、一般的できめ細かいベンチマークで現在の最先端技術を凌駕することが実証されています (たとえば、CUB-200 で約 11% のゲイン、および ImageNet-100 で 9%) 全体的な精度で。コードは https://github.com/sheng-eatamath/PromptCAL で入手できます。

Although existing semi-supervised learning models achieve remarkable success in learning with unannotated in-distribution data, they mostly fail to learn on unlabeled data sampled from novel semantic classes due to their closed-set assumption. In this work, we target a pragmatic but under-explored Generalized Novel Category Discovery (GNCD) setting. The GNCD setting aims to categorize unlabeled training data coming from known and novel classes by leveraging the information of partially labeled known classes. We propose a two-stage Contrastive Affinity Learning method with auxiliary visual Prompts, dubbed PromptCAL, to address this challenging problem. Our approach discovers reliable pairwise sample affinities to learn better semantic clustering of both known and novel classes for the class token and visual prompts. First, we propose a discriminative prompt regularization loss to reinforce semantic discriminativeness of prompt-adapted pre-trained vision transformer for refined affinity relationships.Besides, we propose contrastive affinity learning to calibrate semantic representations based on our iterative semi-supervised affinity graph generation method for semantically-enhanced supervision. Extensive experimental evaluation demonstrates that our PromptCAL method is more effective in discovering novel classes even with limited annotations and surpasses the current state-of-the-art on generic and fine-grained benchmarks (e.g., with nearly 11% gain on CUB-200, and 9% on ImageNet-100) on overall accuracy. Our code is available at https://github.com/sheng-eatamath/PromptCAL.

updated: Sun Mar 26 2023 10:30:22 GMT+0000 (UTC)

published: Sun Dec 11 2022 20:06:14 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト