ICECAP: Information Concentrated Entity-aware Image Captioning

Anwen Hu; Shizhe Chen; Qin Jin

ICECAP：情報が集中したエンティティ対応の画像キャプション

現在のほとんどの画像キャプションシステムは、一般的な画像コンテンツの記述に重点を置いており、正確な名前付きエンティティや具体的なイベントなど、画像を深く理解するための背景知識が不足しています。この作業では、関連するニュース記事を活用してターゲット画像に関する背景知識を提供することにより、有益なキャプションを生成することを目的とした、エンティティ認識のニュース画像キャプションタスクに焦点を当てます。ただし、ニュース記事の長さのため、以前の作品では、粗い記事または文レベルのニュース記事しか使用されていません。これらは、関連するイベントを絞り込み、名前付きエンティティを正確に選択するのに十分な粒度ではありません。これらの制限を克服するために、情報集中型エンティティ認識ニュース画像キャプション（ICECAP）モデルを提案します。このモデルは、対応するニュース記事内の関連するテキスト情報に文レベルから単語レベルまで徐々に集中します。私たちのモデルは、最初にクロスモダリティ検索モデルを使用して関連する文に大まかな集中を作成し、次に文内の関連する単語にさらに集中することによってキャプションを生成します。 BreakingNewsとGoodNewsの両方のデータセットでの広範な実験は、他の最先端技術をしのぐ、提案された方法の有効性を示しています。 ICECAPのコードは、https：//github.com/HAWLYQ/ICECAPで公開されています。

Most current image captioning systems focus on describing general image content, and lack background knowledge to deeply understand the image, such as exact named entities or concrete events. In this work, we focus on the entity-aware news image captioning task which aims to generate informative captions by leveraging the associated news articles to provide background knowledge about the target image. However, due to the length of news articles, previous works only employ news articles at the coarse article or sentence level, which are not fine-grained enough to refine relevant events and choose named entities accurately. To overcome these limitations, we propose an Information Concentrated Entity-aware news image CAPtioning (ICECAP) model, which progressively concentrates on relevant textual information within the corresponding news article from the sentence level to the word level. Our model first creates coarse concentration on relevant sentences using a cross-modality retrieval model and then generates captions by further concentrating on relevant words within the sentences. Extensive experiments on both BreakingNews and GoodNews datasets demonstrate the effectiveness of our proposed method, which outperforms other state-of-the-arts. The code of ICECAP is publicly available at https://github.com/HAWLYQ/ICECAP.

updated: Wed Aug 04 2021 13:27:51 GMT+0000 (UTC)

published: Wed Aug 04 2021 13:27:51 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト