Deep Multi-Task Networks For Occluded Pedestrian Pose Estimation

Arindam Das; Sudip Das; Ganesh Sistu; Jonathan Horgan; Ujjwal Bhattacharya; Edward Jones; Martin Glavin; Ciarán Eising

閉塞した歩行者の姿勢推定のための深層マルチタスクネットワーク

歩行者の姿勢推定に関する既存の作業のほとんどは、閉塞された部分の注釈が関連する自動車データセットで利用できないため、閉塞された歩行者の姿勢の推定を考慮していません。たとえば、自動車シーンでの歩行者検出用のよく知られたデータセットである CityPersons は姿勢の注釈を提供しませんが、自動車以外のデータセットである MS-COCO には人間の姿勢推定が含まれています。この作業では、これら 2 つの分布で別々に実行される検出タスクとインスタンスセグメンテーションタスクを通じて歩行者の特徴を抽出するためのマルチタスクフレームワークを提案します。その後、エンコーダーは、両方の分布からの歩行者インスタンスに対して教師なしインスタンスレベルドメイン適応法を使用してポーズ固有の特徴を学習します。提案されたフレームワークは、ポーズ推定、歩行者検出、およびインスタンスセグメンテーションの最先端のパフォーマンスを向上させました。

Most of the existing works on pedestrian pose estimation do not consider estimating the pose of an occluded pedestrian, as the annotations of the occluded parts are not available in relevant automotive datasets. For example, CityPersons, a well-known dataset for pedestrian detection in automotive scenes does not provide pose annotations, whereas MS-COCO, a non-automotive dataset, contains human pose estimation. In this work, we propose a multi-task framework to extract pedestrian features through detection and instance segmentation tasks performed separately on these two distributions. Thereafter, an encoder learns pose specific features using an unsupervised instance-level domain adaptation method for the pedestrian instances from both distributions. The proposed framework has improved state-of-the-art performances of pose estimation, pedestrian detection, and instance segmentation.

updated: Mon Aug 08 2022 14:03:51 GMT+0000 (UTC)

published: Wed Jun 15 2022 13:09:24 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト