Journalistic Guidelines Aware News Image Captioning

Xuewen Yang; Svebor Karaman; Joel Tetreault; Alex Jaimes

ジャーナリズムのガイドラインを意識したニュース画像のキャプション

ニュース記事の画像のキャプションのタスクは、ニュース記事の画像の説明的で有益なキャプションを生成することを目的としています。一般的な用語で画像のコンテンツを単に説明する従来の画像キャプションとは異なり、ニュース画像キャプションはジャーナリズムのガイドラインに従い、画像コンテンツを説明するために名前付きエンティティに大きく依存し、多くの場合、関連する記事全体からコンテキストを引き出します。この作品では、ジャーナリストが従うキャプションガイドラインに動機付けられた、このタスクへの新しいアプローチを提案します。私たちのアプローチであるジャーナリズムガイドライン対応ニュース画像キャプション（JoGANIC）は、キャプションの構造を活用して、生成品質を向上させ、表現デザインをガイドします。 2つの大規模な公開データセットでの詳細なアブレーション研究を含む実験結果は、JoGANICがキャプション生成と名前付きエンティティ関連のメトリックの両方で最先端の方法を大幅に上回っていることを示しています。

The task of news article image captioning aims to generate descriptive and informative captions for news article images. Unlike conventional image captions that simply describe the content of the image in general terms, news image captions follow journalistic guidelines and rely heavily on named entities to describe the image content, often drawing context from the whole article they are associated with. In this work, we propose a new approach to this task, motivated by caption guidelines that journalists follow. Our approach, Journalistic Guidelines Aware News Image Captioning (JoGANIC), leverages the structure of captions to improve the generation quality and guide our representation design. Experimental results, including detailed ablation studies, on two large-scale publicly available datasets show that JoGANIC substantially outperforms state-of-the-art methods both on caption generation and named entity related metrics.

updated: Fri Sep 10 2021 18:16:21 GMT+0000 (UTC)

published: Tue Sep 07 2021 04:49:50 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト