Multi-view Gait Recognition based on Siamese Vision Transformer

Yanchen Yang; Lijun Yun; Ruoyu Li; Feiyan Cheng

Siamese Vision Transformer に基づく多視点歩行認識

ビジョントランスフォーマーは歩行認識に使用されていますが、多視点歩行認識への応用はまだ限られています。異なるビューは、歩行輪郭の特徴の抽出と識別の精度に大きく影響します。これに対処するために、このホワイトペーパーでは Siamese Mobile Vision Transformer (SMViT) を提案します。このモデルは、人間の歩行空間の局所的な特性に焦点を当てるだけでなく、多次元のステップ状態特性を抽出できる遠距離注意関連の特性も考慮しています。さらに、さまざまな視点が歩行特性にどのように影響し、信頼できる視点特徴関係因子を生成するかについて説明します。 CASIA B データセットでの SMViT の平均認識率は 96.4% に達しました。実験結果は、SMViT が、GaitGAN、Multi_view GAN、Posegait、およびその他の歩行認識モデルなどの高度なステップ認識モデルと比較して、最先端のパフォーマンスを達成できることを示しています。

While the Vision Transformer has been used in gait recognition, its application in multi-view gait recognition is still limited. Different views significantly affect the extraction and identification accuracy of the characteristics of gait contour. To address this, this paper proposes a Siamese Mobile Vision Transformer (SMViT). This model not only focuses on the local characteristics of the human gait space but also considers the characteristics of long-distance attention associations, which can extract multi-dimensional step status characteristics. In addition, it describes how different perspectives affect gait characteristics and generate reliable perspective feature relationship factors. The average recognition rate of SMViT on the CASIA B data set reached 96.4%. The experimental results show that SMViT can attain state-of-the-art performance compared to advanced step recognition models such as GaitGAN, Multi_view GAN, Posegait and other gait recognition models.

updated: Wed Oct 19 2022 09:38:54 GMT+0000 (UTC)

published: Wed Oct 19 2022 09:38:54 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト