Text Detection & Recognition in the Wild for Robot Localization

Zobeir Raisi; John Zelek

ロボットのローカリゼーションのための野生のテキスト検出と認識

標識はいたるところにあり、ロボットは標識を利用して、ローカライズ（視覚的場所認識（VPR）を含む）と地図作成を支援できる必要があります。ポーズ、不規則なテキスト、照明、オクルージョンなどの要因により、野生での堅牢なテキストの検出と認識は困難です。テキスト文字列とバウンディングボックスを同時に出力するエンドツーエンドのシーンテキストスポッティングモデルを提案します。このモデルはVPRにより適しています。私たちの中心的な貢献は、エンドツーエンドのシーンテキストスポッティングフレームワークを利用して、さまざまな困難な場所にある不規則で閉塞されたテキスト領域を適切にキャプチャすることです。提案されたアーキテクチャのVPRのパフォーマンスを評価するために、挑戦的なSelf-Collected Text Place（SCTP）ベンチマークデータセットでいくつかの実験を行いました。初期の実験結果は、このベンチマークでテストした場合、提案された方法が適合率と再現率の点でSOTA方法よりも優れていることを示しています。

Signage is everywhere and a robot should be able to take advantage of signs to help it localize (including Visual Place Recognition (VPR)) and map. Robust text detection & recognition in the wild is challenging due to such factors as pose, irregular text, illumination, and occlusion. We propose an end-to-end scene text spotting model that simultaneously outputs the text string and bounding boxes. This model is more suitable for VPR. Our central contribution is introducing utilizing an end-to-end scene text spotting framework to adequately capture the irregular and occluded text regions in different challenging places. To evaluate our proposed architecture's performance for VPR, we conducted several experiments on the challenging Self-Collected Text Place (SCTP) benchmark dataset. The initial experimental results show that the proposed method outperforms the SOTA methods in terms of precision and recall when tested on this benchmark.

updated: Thu May 19 2022 14:04:10 GMT+0000 (UTC)

published: Tue May 17 2022 18:16:34 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト