Live American Sign Language Letter Classification with Convolutional Neural Networks

Kyle Boone; Ben Wurster; Seth Thao; Yu Hen Hu

畳み込みニューラルネットワークを使用したライブアメリカ手話文字分類

このプロジェクトは、特にライブビデオフィードの範囲内で、画像内の ASL 文字を認識できるニューラルネットワークの構築を中心としています。畳み込みネットワークと VGG16 転移学習アプローチの両方が異なる背景の設定で一般化できなかったため、最初のテスト結果は期待を下回りました。次に、事前にトレーニングされた手の関節検出モデルの使用が採用され、生成された関節の位置が完全に接続されたニューラルネットワークに入力されました。このアプローチの結果は、以前の方法の結果を上回り、ライブビデオフィードアプリケーションに十分に一般化されました。

This project is centered around building a neural network that is able to recognize ASL letters in images, particularly within the scope of a live video feed. Initial testing results came up short of expectations when both the convolutional network and VGG16 transfer learning approaches failed to generalize in settings of different backgrounds. The use of a pre-trained hand joint detection model was then adopted with the produced joint locations being fed into a fully-connected neural network. The results of this approach exceeded those of prior methods and generalized well to a live video feed application.

updated: Fri May 26 2023 18:29:33 GMT+0000 (UTC)

published: Fri May 26 2023 18:29:33 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト