Doubly Contrastive End-to-End Semantic Segmentation for Autonomous Driving under Adverse Weather

Jongoh Jeong; Jong-Hwan Kim

悪天候下での自動運転のための二重に対照的なエンドツーエンドのセマンティックセグメンテーション

最近の自動運転車にとって、道路状況を理解するタスクは非常に重要になっています。特に、リアルタイムのセマンティックセグメンテーションは、インテリジェントな自動運転エージェントが運転エリア内の路傍の物体を認識するために不可欠です。以前の研究では、主に計算負荷の高い操作でセグメンテーションのパフォーマンスを改善しようとしてきたため、トレーニングと展開の両方に非常に多くのハードウェアリソースが必要となり、リアルタイムアプリケーションには適していません。そのため、特に霧、夜間、雨、雪などの悪天候下で、自動運転のためのより実用的な軽量モデルのパフォーマンスを向上させる二重の対照的なアプローチを提案します。提案されたアプローチは、グローバルな一貫性のためのメモリバンクや従来の対比法で使用される事前トレーニングステップを必要とせずに、エンドツーエンドの教師あり学習スキームで画像レベルとピクセルレベルの両方のコントラストを活用します。 ACDC データセットで SwiftNet を使用してこの方法の有効性を検証し、推論時に単一の RTX 3080 Mobile GPU で 66.7 FPS (2048x1024 解像度) で mIoU (ResNet-18 バックボーン) で最大 1.34%p の改善を達成しました。さらに、画像レベルの監視を自己監視に置き換えると、晴天の画像で事前トレーニングした場合に同等のパフォーマンスが得られることを示しています。

Road scene understanding tasks have recently become crucial for self-driving vehicles. In particular, real-time semantic segmentation is indispensable for intelligent self-driving agents to recognize roadside objects in the driving area. As prior research works have primarily sought to improve the segmentation performance with computationally heavy operations, they require far significant hardware resources for both training and deployment, and thus are not suitable for real-time applications. As such, we propose a doubly contrastive approach to improve the performance of a more practical lightweight model for self-driving, specifically under adverse weather conditions such as fog, nighttime, rain and snow. Our proposed approach exploits both image- and pixel-level contrasts in an end-to-end supervised learning scheme without requiring a memory bank for global consistency or the pretraining step used in conventional contrastive methods. We validate the effectiveness of our method using SwiftNet on the ACDC dataset, where it achieves up to 1.34%p improvement in mIoU (ResNet-18 backbone) at 66.7 FPS (2048x1024 resolution) on a single RTX 3080 Mobile GPU at inference. Furthermore, we demonstrate that replacing image-level supervision with self-supervision achieves comparable performance when pre-trained with clear weather images.

updated: Mon Nov 21 2022 00:26:41 GMT+0000 (UTC)

published: Mon Nov 21 2022 00:26:41 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト