PRISM: A Unified Framework of Parameterized Submodular Information Measures for Targeted Data Subset Selection and Summarization

Vishal Kaushal; Suraj Kothawade; Ganesh Ramakrishnan; Jeff Bilmes; Rishabh Iyer

PRISM：ターゲットデータサブセットの選択と要約のためのパラメータ化された劣モジュラ情報測定の統一フレームワーク

データが増えるにつれ、特定の特性を持つ、より小さくても効果的なサブセットを見つけるための手法が重要になります。これに動機付けられて、このようなターゲットサブセットが必要なアプリケーションで使用できる、パラメーター化された劣モジュラ情報メジャーの豊富なクラスであるPRISMを紹介します。このような2つのアプリケーションでPRISMの有用性を示します。まず、PRISMを適用して、ターゲットセットに一致するラベルなしポイントのサブセットがトレーニングセットに追加されるターゲットサブセット選択（PRISM-TSS）によって、指定された追加のラベル付けコストで教師ありモデルのパフォーマンスを向上させます。 PRISM-TSSが一般化され、ターゲットデータサブセット選択へのいくつかの既存のアプローチに接続されていることを示します。次に、PRISMをより微妙なターゲットを絞った要約（PRISM-TSUM）に適用します。この要約では、データ（画像コレクション、テキスト、ビデオなど）が要約され、ユーザーの意図を追加してより迅速に人間が消費できるようになります。 PRISM-TSUMは、クエリに焦点を合わせた、トピックに関係のない、プライバシーを保護する、更新の要約など、さまざまな種類の対象を絞った要約を統一された方法で処理します。 PRISM-TSUMは、対象を絞った要約に関するいくつかの既存の過去の作業も一般化および統合することを示します。画像分類と画像収集の要約に関する広範な実験を通じて、PRISM-TSSとPRISM-TSUMの最先端技術に対する優位性を実証的に検証します。

With increasing data, techniques for finding smaller, yet effective subsets with specific characteristics become important. Motivated by this, we present PRISM, a rich class of Parameterized Submodular Information Measures, that can be used in applications where such targeted subsets are desired. We demonstrate the utility of PRISM in two such applications. First, we apply PRISM to improve a supervised model's performance at a given additional labeling cost by targeted subset selection (PRISM-TSS) where a subset of unlabeled points matching a target set are added to the training set. We show that PRISM-TSS generalizes and is connected to several existing approaches to targeted data subset selection. Second, we apply PRISM to a more nuanced targeted summarization (PRISM-TSUM) where data (e.g., image collections, text or videos) is summarized for quicker human consumption with additional user intent. PRISM-TSUM handles multiple flavors of targeted summarization such as query-focused, topic-irrelevant, privacy-preserving and update summarization in a unified way. We show that PRISM-TSUM also generalizes and unifies several existing past work on targeted summarization. Through extensive experiments on image classification and image-collection summarization we empirically verify the superiority of PRISM-TSS and PRISM-TSUM over the state-of-the-art.

updated: Sat Feb 27 2021 04:53:47 GMT+0000 (UTC)

published: Sat Feb 27 2021 04:53:47 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト