COO: Comic Onomatopoeia Dataset for Recognizing Arbitrary or Truncated Texts

Jeonghun Baek; Yusuke Matsui; Kiyoharu Aizawa

COO：任意のテキストまたは切り捨てられたテキストを認識するためのコミックオノマトペデータセット

不規則なテキストを認識することは、テキスト認識において挑戦的なトピックでした。このトピックに関する研究を奨励するために、日本の漫画のオノマトペテキストで構成される新しい漫画オノマトペデータセット（COO）を提供します。 COOには、極端に湾曲した、部分的に縮小されたテキスト、または任意に配置されたテキストなど、多くの任意のテキストがあります。さらに、一部のテキストはいくつかの部分に分かれています。各部分は切り捨てられたテキストであり、それ自体では意味がありません。これらの部分は、意図された意味を表すためにリンクする必要があります。したがって、切り捨てられたテキスト間のリンクを予測する新しいタスクを提案します。オノマトペ領域を検出し、その意図された意味をキャプチャするために、テキスト検出、テキスト認識、およびリンク予測の3つのタスクを実行します。広範な実験を通じて、COOの特性を分析します。私たちのデータとコードはhttps://github.com/ku21fan/COO-Comic-Onomatopoeiaで入手できます。

Recognizing irregular texts has been a challenging topic in text recognition. To encourage research on this topic, we provide a novel comic onomatopoeia dataset (COO), which consists of onomatopoeia texts in Japanese comics. COO has many arbitrary texts, such as extremely curved, partially shrunk texts, or arbitrarily placed texts. Furthermore, some texts are separated into several parts. Each part is a truncated text and is not meaningful by itself. These parts should be linked to represent the intended meaning. Thus, we propose a novel task that predicts the link between truncated texts. We conduct three tasks to detect the onomatopoeia region and capture its intended meaning: text detection, text recognition, and link prediction. Through extensive experiments, we analyze the characteristics of the COO. Our data and code are available at https://github.com/ku21fan/COO-Comic-Onomatopoeia.

updated: Mon Jul 11 2022 07:39:35 GMT+0000 (UTC)

published: Mon Jul 11 2022 07:39:35 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト