Automatic Text Area Segmentation in Natural Images

Syed Ali Raza Jafri; Mireille Boutin; Edward J. Delp

自然画像の自動テキスト領域分割

自然画像のテキスト領域をセグメント化する階層的な方法を提示します。このメソッドは、テキストがほぼ均一な背景に対照的な色で書かれていることを前提としています。ただし、テキストの記述に使用される言語または文字セットに関しては想定されていません。特に、テキストには単純なグラフィックスまたはシンボルを含めることができます。このアプローチの重要な特徴は、背景に実際にテキストがあるかどうかをテストする前に、まずテキストの背景を見つけることに集中することです。均一な領域は自然画像で簡単に見つけることができ、テキストの背景が「穴」（テキストが書き込まれる）を含む領域を定義するため、「穴」を含む均一な領域を探し、テキスト背景候補としてラベル付けします。次に、各候補領域の凸包内のテキストの有無をさらにテストします。英語とウルドゥー語のテキストを含む65枚の画像のデータベースでメソッドをテストしました。この方法は、これらの画像の63のすべてのテキスト領域を正しくセグメント化しました。これらのうち4つだけが、セグメント化されたテキストを含まない領域でした。

We present a hierarchical method for segmenting text areas in natural images. The method assumes that the text is written with a contrasting color on a more or less uniform background. But no assumption is made regarding the language or character set used to write the text. In particular, the text can contain simple graphics or symbols. The key feature of our approach is that we first concentrate on finding the background of the text, before testing whether there is actually text on the background. Since uniform areas are easy to find in natural images, and since text backgrounds define areas which contain "holes" (where the text is written) we thus look for uniform areas containing "holes" and label them as text backgrounds candidates. Each candidate area is then further tested for the presence of text within its convex hull. We tested our method on a database of 65 images including English and Urdu text. The method correctly segmented all the text areas in 63 of these images, and in only 4 of these were areas that do not contain text also segmented.

updated: Thu Jan 31 2008 01:46:32 GMT+0000 (UTC)

published: Thu Jan 31 2008 01:46:32 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト