CNN Model & Tuning for Global Road Damage Detection

Rahul Vishwakarma; Ravigopal Vennelakanti

グローバルな道路損傷検出のためのCNNモデルとチューニング

このホワイトペーパーでは、モデルの選択、チューニング戦略、Global Road Damage DetectionChallengeで得られた結果などのソリューションに関するレポートを提供します。このビッグデータカップチャレンジは、ビッグデータ2020に関するIEEE国際会議の一環として開催されました。オブジェクト検出のためのシングルステージおよびマルチステージネットワークアーキテクチャを評価し、次のような人気のある最先端のオープンソースPyTorchフレームワークを使用してベンチマークを提供します。 Detectron2とYolov5。チェコ、インド、日本のスマートフォンカメラを使用してキャプチャされた、提供された道路損傷トレーニングデータセットのデータ準備について説明します。単一の一般化可能なモデルに関して、国ごとにトレーニングの効果を調査しました。 Deep Residual Network（Resnet）とFeature Pyramid Network（FPN）バックボーンを備えた2ステージのFasterR-CNNで実行された実験のチューニング戦略について簡単に説明します。さらに、これをCross Stage Partial Network（CSPNet）バックボーンを備えた1ステージのYolov5モデルと比較します。 Resnet-50およびResnet-101バックボーンを使用したマルチステージFasterR-CNNモデルを使用して、Test2で0.542、Test1データセットで0.536の平均F1スコアを示します。これは、より複雑なモデルと比較した場合のResnet-50モデルの一般化可能性を示しています。実験は、K80を搭載したGoogle Colabと、1080Ti、NVIDIAコンシューマーグレードGPUを搭載したLinuxPCを使用して実施されました。平均F1スコアを前処理、トレーニング、テスト、送信するためのPyTorchベースのDetectron2コードは、https：//github.com/vishwakarmarhl/rdd2020で入手できます。

This paper provides a report on our solution including model selection, tuning strategy and results obtained for Global Road Damage Detection Challenge. This Big Data Cup Challenge was held as a part of IEEE International Conference on Big Data 2020. We assess single and multi-stage network architectures for object detection and provide a benchmark using popular state-of-the-art open-source PyTorch frameworks like Detectron2 and Yolov5. Data preparation for provided Road Damage training dataset, captured using smartphone camera from Czech, India and Japan is discussed. We studied the effect of training on a per country basis with respect to a single generalizable model. We briefly describe the tuning strategy for the experiments conducted on two-stage Faster R-CNN with Deep Residual Network (Resnet) and Feature Pyramid Network (FPN) backbone. Additionally, we compare this to a one-stage Yolov5 model with Cross Stage Partial Network (CSPNet) backbone. We show a mean F1 score of 0.542 on Test2 and 0.536 on Test1 datasets using a multi-stage Faster R-CNN model, with Resnet-50 and Resnet-101 backbones respectively. This shows the generalizability of the Resnet-50 model when compared to its more complex counterparts. Experiments were conducted using Google Colab having K80 and a Linux PC with 1080Ti, NVIDIA consumer grade GPU. A PyTorch based Detectron2 code to preprocess, train, test and submit the Avg F1 score to is made available at https://github.com/vishwakarmarhl/rdd2020

updated: Wed Mar 17 2021 09:01:23 GMT+0000 (UTC)

published: Wed Mar 17 2021 09:01:23 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト