Robustness Evaluation of Transformer-based Form Field Extractors via Form Attacks

Le Xue; Mingfei Gao; Zeyuan Chen; Caiming Xiong; Ran Xu

フォームアタックによるトランスベースのフォームフィールドエクストラクタのロバスト性評価

フォーム攻撃を介してトランスベースのフォームフィールド抽出方法の堅牢性を評価するための新しいフレームワークを提案します。 OCRの場所/順序の再配置、フォームの背景操作、フォームのフィールド値の拡張など、OCRレベルとフォームレベルの両方からのフォーム攻撃に対する最先端のフィールドエクストラクタの脆弱性を評価するために、14の新しいフォーム変換を紹介します。実際の請求書と領収書を使用して堅牢性の評価を行い、包括的な調査分析を行います。実験結果は、評価されたモデルが、フィールド値の変動（F1スコアの約15％の低下）、入力テキストの順序の乱れ（F1スコアの約15％の低下）、およびフィールド値の隣接する単語（F1スコアが約10％低下）。分析に基づいて、フィールドエクストラクタの設計とデータ収集のプロセスを改善するための推奨事項を作成します。

We propose a novel framework to evaluate the robustness of transformer-based form field extraction methods via form attacks. We introduce 14 novel form transformations to evaluate the vulnerability of the state-of-the-art field extractors against form attacks from both OCR level and form level, including OCR location/order rearrangement, form background manipulation and form field-value augmentation. We conduct robustness evaluation using real invoices and receipts, and perform comprehensive research analysis. Experimental results suggest that the evaluated models are very susceptible to form perturbations such as the variation of field-values (~15% drop in F1 score), the disarrangement of input text order(~15% drop in F1 score) and the disruption of the neighboring words of field-values(~10% drop in F1 score). Guided by the analysis, we make recommendations to improve the design of field extractors and the process of data collection.

updated: Fri Oct 08 2021 23:58:24 GMT+0000 (UTC)

published: Fri Oct 08 2021 23:58:24 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト