To Perceive or Not to Perceive: Lightweight Stacked Hourglass Network

Jameel Hassan Abdul Samadh; Salwa K. Al Khatib

知覚するかしないか: 軽量の積み上げ砂時計ネットワーク

人間の姿勢推定 (HPE) は、コンピュータービジョンの古典的なタスクであり、関節の位置を識別することによって人の向きを表すことに重点を置いています。モデルのパフォーマンスの損失を最小限に抑えて、積み重ねられた砂時計ネットワークのより軽いバージョンを設計します。軽量の 2 スタックの砂時計には、深さ方向に分離可能な畳み込み、連結による残余接続、および砂時計のネック間の残余接続を備えたチャネルの数が減少しています。最終的なモデルでは、パラメーター数が 79% 減少し、MAdd が同様に低下しており、パフォーマンスがわずかに低下しています。

Human pose estimation (HPE) is a classical task in computer vision that focuses on representing the orientation of a person by identifying the positions of their joints. We design a lighterversion of the stacked hourglass network with minimal loss in performance of the model. The lightweight 2-stacked hourglass has a reduced number of channels with depthwise separable convolutions, residual connections with concatenation, and residual connections between the necks of the hourglasses. The final model has a marginal drop in performance with 79% reduction in the number of parameters and a similar drop in MAdds

updated: Thu Feb 09 2023 18:04:43 GMT+0000 (UTC)

published: Thu Feb 09 2023 18:04:43 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト