Tiny CNN for feature point description for document analysis: approach and dataset

A. Sheshkus; A. Chirvonaya; V. L. Arlazarov

ドキュメント分析の特徴点の説明のための小さなCNN：アプローチとデータセット

この論文では、ドキュメント分析とテンプレートマッチングのコンテキストで特徴点の記述の問題を研究します。私たちの研究は、特に限られた計算リソースを持つデバイスで使用できる軽量のニューラルネットワークをトレーニングする場合、タスクに特定のトレーニングデータが必要であることを示しています。この論文では、パッチ検索をトレーニングする方法を備えたデータセットを構築して提供します。軽量ニューラルネットワークをトレーニングすることでこのデータの有効性を証明し、ドキュメントと一般的なパッチマッチングの両方でどのように機能するかを示します。トレーニングは、HPatchesトレーニングデータセットと比較して提供されたデータセットで行われました。テストには、HPatchesテストフレームワークと、複雑な背景に描かれたさまざまなドキュメントを含む2つの公開されているデータセットMIDV-500とMIDV-2019を使用します。

In this paper, we study the problem of feature points description in the context of document analysis and template matching. Our study shows that the specific training data is required for the task especially if we are to train a lightweight neural network that will be usable on devices with limited computational resources. In this paper, we construct and provide a dataset with a method of training patches retrieval. We prove the effectiveness of this data by training a lightweight neural network and show how it performs in both documents and general patches matching. The training was done on the provided dataset in comparison with HPatches training dataset and for the testing we use HPatches testing framework and two publicly available datasets with various documents pictured on complex backgrounds: MIDV-500 and MIDV-2019.

updated: Thu Sep 09 2021 09:45:47 GMT+0000 (UTC)

published: Thu Sep 09 2021 09:45:47 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト