Synthesis in Style: Semantic Segmentation of Historical Documents using Synthetic Data

Christian Bartz; Hendrik Rätz; Jona Otholt; Christoph Meinel; Haojin Yang

スタイルでの合成：合成データを使用した歴史的文書のセマンティックセグメンテーション

歴史的文書の自動分析における最も差し迫った問題の1つは、注釈付きのトレーニングデータの可用性です。問題は、サンプルのラベル付けは人間の専門知識を必要とし、したがってうまく自動化できないため、時間のかかる作業であるということです。この作業では、注釈が利用できない歴史的文書の合成ラベル付きデータセットを構築するための新しい方法を提案します。 StyleGANモデルをトレーニングして、元のドキュメントのコア機能をキャプチャするドキュメント画像を合成します。もともと、StyleGANアーキテクチャはラベルを生成することを目的としていませんでしたが、現実的な画像を生成するための基礎となるセマンティクスを間接的に学習します。このアプローチを使用すると、中間特徴マップからセマンティック情報を抽出し、それを使用してグラウンドトゥルースラベルを生成できます。合成データセットを使用して歴史的文書のテキストをセグメント化できるかどうかを調査するために、それを使用して複数の教師ありセグメンテーションモデルをトレーニングし、それらのパフォーマンスを評価します。また、最先端の合成アプローチによって作成された別のデータセットでこれらのモデルをトレーニングし、データセットでトレーニングされたモデルが、人間による注釈の労力をさらに少なくして、より良い結果を達成することを示します。

One of the most pressing problems in the automated analysis of historical documents is the availability of annotated training data. The problem is that labeling samples is a time-consuming task because it requires human expertise and thus, cannot be automated well. In this work, we propose a novel method to construct synthetic labeled datasets for historical documents where no annotations are available. We train a StyleGAN model to synthesize document images that capture the core features of the original documents. While originally, the StyleGAN architecture was not intended to produce labels, it indirectly learns the underlying semantics to generate realistic images. Using our approach, we can extract the semantic information from the intermediate feature maps and use it to generate ground truth labels. To investigate if our synthetic dataset can be used to segment the text in historical documents, we use it to train multiple supervised segmentation models and evaluate their performance. We also train these models on another dataset created by a state-of-the-art synthesis approach to show that the models trained on our dataset achieve better results while requiring even less human annotation effort.

updated: Tue Jan 25 2022 10:38:29 GMT+0000 (UTC)

published: Wed Jul 14 2021 15:36:47 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト