Class Normalization for (Continual)? Generalized Zero-Shot Learning

Ivan Skorokhodov; Mohamed Elhoseiny

（継続的）のクラス正規化？一般化されたゼロショット学習

正規化手法は、従来の教師あり学習体制でトレーニングを成功させるための重要な要素であることが証明されています。ただし、ゼロショット学習（ZSL）の世界では、これらのアイデアはごくわずかな注目しか受けていません。この作業では、理論と実践の両方の観点から、ZSLシナリオの正規化を研究します。最初に、ゼロショット学習で使用される2つの一般的なトリック（正規化+スケールと属性の正規化）について理論的な説明を行い、フォワードパス中に分散を維持することでトレーニングに役立つことを示します。次に、ディープZSLモデルを正規化するには不十分であることを示し、クラス正規化（CN）を提案します。これは、この問題を証明可能かつ実際に軽減する正規化スキームです。第三に、ZSLモデルは通常、従来の分類器と比較してより不規則な損失面を持ち、提案された方法がこの問題を部分的に改善することを示します。次に、4つの標準ZSLデータセットでアプローチをテストし、ベルやホイッスルなしで最適化されたシンプルなMLPを使用して、洗練された最新のSotAを上回り、トレーニング速度を最大50倍高速化します。最後に、ZSLをより広範な問題（継続的なZSL）に一般化し、この新しいセットアップのいくつかの原則的なメトリックと厳密なベースラインを紹介します。プロジェクトページはhttps://universome.github.io/class-normにあります。

Normalization techniques have proved to be a crucial ingredient of successful training in a traditional supervised learning regime. However, in the zero-shot learning (ZSL) world, these ideas have received only marginal attention. This work studies normalization in ZSL scenario from both theoretical and practical perspectives. First, we give a theoretical explanation to two popular tricks used in zero-shot learning: normalize+scale and attributes normalization and show that they help training by preserving variance during a forward pass. Next, we demonstrate that they are insufficient to normalize a deep ZSL model and propose Class Normalization (CN): a normalization scheme, which alleviates this issue both provably and in practice. Third, we show that ZSL models typically have more irregular loss surface compared to traditional classifiers and that the proposed method partially remedies this problem. Then, we test our approach on 4 standard ZSL datasets and outperform sophisticated modern SotA with a simple MLP optimized without any bells and whistles and having ~50 times faster training speed. Finally, we generalize ZSL to a broader problem -- continual ZSL, and introduce some principled metrics and rigorous baselines for this new setup. The project page is located at https://universome.github.io/class-norm.

updated: Wed Apr 14 2021 16:12:34 GMT+0000 (UTC)

published: Fri Jun 19 2020 19:05:24 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト