Solution for Large-scale Long-tailed Recognition with Noisy Labels

Yuqiao Xian; Jia-Xin Zhuang; Fufu Yu

ノイズの多いラベルを使用した大規模なロングテール認識のソリューション

これは、CVPR 2021 AliProductsChallengeのテクニカルレポートです。 AliProducts Challengeは、世界をリードするeコマース企業が直面する大規模できめ細かい商品画像認識の問題を研究するために提案されたコンテストです。大規模な製品認識は、ノイズの多い注釈、不均衡な（ロングテール）データ分散、およびきめ細かい分類の課題に同時に対応します。私たちのソリューションでは、ResNeSt、EfficientNetV2、DeiTなど、CNNとTransformerの両方の最先端のモデルアーキテクチャを採用しています。反復データクリーニング、分類器の重みの正規化、高解像度の微調整、およびテスト時間の拡張が、ノイズの多い不均衡なデータセットを使用したトレーニングのパフォーマンスを向上させるための重要なコンポーネントであることがわかりました。最後に、アンサンブルモデルを使用して、リーダーボードで6.4365％の平均クラスエラー率を取得します。

This is a technical report for CVPR 2021 AliProducts Challenge. AliProducts Challenge is a competition proposed for studying the large-scale and fine-grained commodity image recognition problem encountered by worldleading ecommerce companies. The large-scale product recognition simultaneously meets the challenge of noisy annotations, imbalanced (long-tailed) data distribution and fine-grained classification. In our solution, we adopt stateof-the-art model architectures of both CNNs and Transformer, including ResNeSt, EfficientNetV2, and DeiT. We found that iterative data cleaning, classifier weight normalization, high-resolution finetuning, and test time augmentation are key components to improve the performance of training with the noisy and imbalanced dataset. Finally, we obtain 6.4365% mean class error rate in the leaderboard with our ensemble model.

updated: Sun Jun 20 2021 12:09:38 GMT+0000 (UTC)

published: Sun Jun 20 2021 12:09:38 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト