Kernel Inversed Pyramidal Resizing Network for Efficient Pavement Distress Recognition

Rong Qin; Luwen Huangfu; Devon Hood; James Ma; Sheng Huang

効率的な舗装の遭難認識のためのカーネル逆ピラミッド型サイズ変更ネットワーク

舗装損傷認識 (PDR) は、舗装検査の重要なステップであり、画像ベースの自動化によってプロセスを迅速化し、人件費を削減できます。舗装画像は多くの場合、高解像度であり、損傷した領域と損傷していない領域の比率が低くなっています。高度なアプローチでは、画像をパッチに分割することでこれらのプロパティを活用し、スケールスペース内の識別機能を調べます。ただし、これらのアプローチは通常、画像のサイズ変更中の情報の損失と、複雑な学習フレームワークによる低効率に悩まされます。本稿では、PDR のための斬新で効率的な方法を提案します。 Kernel Inversed Pyramidal Resizing Network (KIPRN) という名前の軽量ネットワークが画像のサイズ変更のために導入され、解像度とスケール情報を利用するための事前ネットワークとして画像分類ネットワークに柔軟にプラグインできます。 KIPRN では、ピラミッド型畳み込みとカーネル逆畳み込みは、さまざまな特徴の粒度とスケールにわたって識別可能な情報をマイニングするように特別に設計されています。マイニングされた情報はサイズ変更された画像に渡され、PDR の画像分類ネットワークを支援する有益な画像ピラミッドが生成されます。この方法を 3 つのよく知られた畳み込みニューラルネットワーク (CNN) に適用し、CQU-BPDD という大規模な舗装画像データセットで評価を行いました。広範な結果は、KIPRN がこれらの CNN モデルの舗装遭難認識を一般的に改善できることを示しており、KIPRN と EfficientNet-B3 の単純な組み合わせが、パフォーマンスと効率の両方で最先端のパッチベースの方法よりも大幅に優れていることを示しています。

Pavement Distress Recognition (PDR) is an important step in pavement inspection and can be powered by image-based automation to expedite the process and reduce labor costs. Pavement images are often in high-resolution with a low ratio of distressed to non-distressed areas. Advanced approaches leverage these properties via dividing images into patches and explore discriminative features in the scale space. However, these approaches usually suffer from information loss during image resizing and low efficiency due to complex learning frameworks. In this paper, we propose a novel and efficient method for PDR. A light network named the Kernel Inversed Pyramidal Resizing Network (KIPRN) is introduced for image resizing, and can be flexibly plugged into the image classification network as a pre-network to exploit resolution and scale information. In KIPRN, pyramidal convolution and kernel inversed convolution are specifically designed to mine discriminative information across different feature granularities and scales. The mined information is passed along to the resized images to yield an informative image pyramid to assist the image classification network for PDR. We applied our method to three well-known Convolutional Neural Networks (CNNs), and conducted an evaluation on a large-scale pavement image dataset named CQU-BPDD. Extensive results demonstrate that KIPRN can generally improve the pavement distress recognition of these CNN models and show that the simple combination of KIPRN and EfficientNet-B3 significantly outperforms the state-of-the-art patch-based method in both performance and efficiency.

updated: Sun Dec 04 2022 10:40:40 GMT+0000 (UTC)

published: Sun Dec 04 2022 10:40:40 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト