Confidence-Aware Active Feedback for Interactive Instance Search

Yue Zhang; Chao Liang; Longxiang Jiang

インタラクティブなインスタンス検索のための信頼性を意識したアクティブなフィードバック

オンライン関連性フィードバック (RF) は、インスタンス検索 (INS) タスクで広く利用され、不完全なランキング結果をさらに絞り込みますが、多くの場合、対話効率が低くなります。アクティブラーニング (AL) 手法は、貴重なフィードバック候補を選択することで、この問題に対処します。ただし、主流の AL メソッドでは、コールドスタート用の初期ラベルセットが必要であり、多くの場合、計算が複雑になります。したがって、インタラクティブな INS タスクでオンライン RF の要件を完全に満たすことはできません。この問題に対処するために、インタラクティブな INS タスクでのオンライン RF 用に特別に設計された、信頼度を意識したアクティブフィードバック法 (CAAF) を提案します。自習型学習における明示的な難易度モデリングスキームに着想を得た CAAF は、ペアワイズマニホールドランキング損失を利用して、ラベル付けされていない各サンプルのランキング信頼度を評価します。ランキングの信頼性は、貴重なフィードバック候補を示すことによって対話効率を向上させるだけでなく、マニホールドランキングの拡散重みを調整することによってランキングの品質も向上させます。さらに、CAAF の計算の複雑さを軽減するために、近似最適化スキームと上位 K 検索スキームの 2 つの高速化戦略を設計します。建物、風景、人物、および人間の行動を検索する画像 INS タスクとビデオ INS タスクの両方に関する広範な実験により、提案された方法の有効性が実証されました。特に、NIST TRECVID 2021 の実世界の大規模ビデオ INS タスクでは、CAAF は 25% 少ないフィードバックサンプルを使用して、チャンピオンソリューションとほぼ同等のパフォーマンスを達成しています。さらに、フィードバックサンプル数が同じ場合、CAAF の mAP は 51.9% であり、チャンピオンソリューションを 5.9% 大幅に上回っています。

Online relevance feedback (RF) is widely utilized in instance search (INS) tasks to further refine imperfect ranking results, but it often has low interaction efficiency. The active learning (AL) technique addresses this problem by selecting valuable feedback candidates. However, mainstream AL methods require an initial labeled set for a cold start and are often computationally complex to solve. Therefore, they cannot fully satisfy the requirements for online RF in interactive INS tasks. To address this issue, we propose a confidence-aware active feedback method (CAAF) that is specifically designed for online RF in interactive INS tasks. Inspired by the explicit difficulty modeling scheme in self-paced learning, CAAF utilizes a pairwise manifold ranking loss to evaluate the ranking confidence of each unlabeled sample. The ranking confidence improves not only the interaction efficiency by indicating valuable feedback candidates but also the ranking quality by modulating the diffusion weights in manifold ranking. In addition, we design two acceleration strategies, an approximate optimization scheme and a top-K search scheme, to reduce the computational complexity of CAAF. Extensive experiments on both image INS tasks and video INS tasks searching for buildings, landscapes, persons, and human behaviors demonstrate the effectiveness of the proposed method. Notably, in the real-world, large-scale video INS task of NIST TRECVID 2021, CAAF uses 25% fewer feedback samples to achieve a performance that is nearly equivalent to the champion solution. Moreover, with the same number of feedback samples, CAAF's mAP is 51.9%, significantly surpassing the champion solution by 5.9%.

updated: Fri Sep 02 2022 10:39:23 GMT+0000 (UTC)

published: Sat Oct 23 2021 16:14:03 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト