Augraphy: A Data Augmentation Library for Document Images

Alexander Groleau; Kok Wei Chee; Stefan Larson; Samay Maini; Jonathan Boarman

Augraphy: ドキュメント画像用のデータ拡張ライブラリ

このホワイトペーパーでは、現実世界のドキュメント画像データセットで一般的に見られる歪みを生成するデータ拡張パイプラインを構築するための Python ライブラリである Augraphy を紹介します。 Augraphy は、古いマシンや汚れたマシンを使用した印刷、スキャン、ファックスなどの標準的なオフィス操作によって変更されたかのように見えるクリーンなドキュメントイメージの拡張バージョンを作成するためのさまざまな戦略を提供することで、他のデータ拡張ツールとは一線を画しています。時間の経過とともにインク、および手書きのマーキング。このホワイトペーパーでは、Augraphy ツールについて説明し、ドキュメントのノイズ除去などのタスク用の多様なトレーニングデータを生成するためのデータ拡張ツールとして、またドキュメントイメージモデリングタスクでモデルのロバスト性を評価するための困難なテストデータを生成するために、Augraphy ツールをどのように使用できるかを示します。

This paper introduces Augraphy, a Python library for constructing data augmentation pipelines which produce distortions commonly seen in real-world document image datasets. Augraphy stands apart from other data augmentation tools by providing many different strategies to produce augmented versions of clean document images that appear as if they have been altered by standard office operations, such as printing, scanning, and faxing through old or dirty machines, degradation of ink over time, and handwritten markings. This paper discusses the Augraphy tool, and shows how it can be used both as a data augmentation tool for producing diverse training data for tasks such as document denoising, and also for generating challenging test data to evaluate model robustness on document image modeling tasks.

updated: Fri Mar 24 2023 21:49:21 GMT+0000 (UTC)

published: Tue Aug 30 2022 22:36:19 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト