Comparative Analysis of Deep Learning Models for Brand Logo Classification in Real-World Scenarios

Qimao Yang; Huili Chen; Qiwei Dong

現実世界のシナリオにおけるブランドロゴ分類のための深層学習モデルの比較分析

このレポートは、現実世界のシナリオにおけるブランドロゴ分類のための深層学習モデルに関する包括的な研究を示しています。このデータセットには、10 の著名なブランドのロゴのラベル付き画像 3,717 枚が含まれています。畳み込みニューラルネットワーク (CNN) とビジョントランスフォーマー (ViT) の 2 種類のモデルのパフォーマンスが評価されました。 ViT モデル (DaViT small) は 99.60% という最高精度を達成し、DenseNet29 は 366.62 FPS という最速の推論速度を達成しました。この調査結果は、DaViT モデルがその優れた精度によりオフラインアプリケーションに適した選択肢であることを示唆しています。この研究は、ブランドロゴの分類タスクにおけるディープラーニングの実際の応用を実証します。

This report presents a comprehensive study on deep learning models for brand logo classification in real-world scenarios. The dataset contains 3,717 labeled images of logos from ten prominent brands. Two types of models, Convolutional Neural Networks (CNN) and Vision Transformer (ViT), were evaluated for their performance. The ViT model, DaViT small, achieved the highest accuracy of 99.60%, while the DenseNet29 achieved the fastest inference speed of 366.62 FPS. The findings suggest that the DaViT model is a suitable choice for offline applications due to its superior accuracy. This study demonstrates the practical application of deep learning in brand logo classification tasks.

updated: Sat May 20 2023 17:24:06 GMT+0000 (UTC)

published: Sat May 20 2023 17:24:06 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト