Online Deep Learning based on Auto-Encoder

Si-si Zhang; Jian-wei Liu; Xin Zuo; Run-kun Lu; Si-ming Lian

オートエンコーダに基づくオンラインディープラーニング

オンライン学習は、大量のリアルタイムで高速なデータをスケッチするための重要な技術的手段です。この方向性は注目を集めていますが、この分野のほとんどの文献は、次の3つの問題を無視しています。例のクラスラベルを予測します。（2）見えないデータポイントに事前に割り当てられたモデルのアイデアは、進化する確率分布を持つストリーミングデータのモデリングには適していません。この課題は、モデルの柔軟性と呼ばれます。したがって、これを念頭に置いて、設計する必要のあるオンラインディープラーニングモデルは、さまざまな基本構造を持つ必要があります。（3）さらに、これらの抽象的な階層的潜在表現を融合して分類パフォーマンスを向上させることが最も重要であり、データ分散が変化するデータストリーミングを処理する場合は、さまざまなレベルの暗黙的表現情報にさまざまな重みを与える必要があります。これらの問題に対処するために、オートエンコーダー（ODLAE）に基づく2フェーズのオンラインディープラーニングを提案します。オートエンコーダに基づいて、再構成の損失を考慮して、インスタンスの抽象的な階層的潜在表現を抽出します。予測損失に基づいて、2つの融合戦略を考案します。出力レベルの融合戦略。エンコーダの分類結果を各隠れ層に融合することによって得られます。そして、機能レベルの融合戦略。これは、自己注意メカニズムを活用して、すべての隠れ層の出力を融合します。最後に、アルゴリズムの堅牢性を向上させるために、ノイズ除去オートエンコーダーを利用して階層的な潜在表現を生成しようとします。提案されたアルゴリズム（ODLAE）の有効性がいくつかのベースラインを上回っていることを検証するために、さまざまなデータセットでの実験結果が示されています。

Online learning is an important technical means for sketching massive real-time and high-speed data. Although this direction has attracted intensive attention, most of the literature in this area ignore the following three issues: (1) they think little of the underlying abstract hierarchical latent information existing in examples, even if extracting these abstract hierarchical latent representations is useful to better predict the class labels of examples; (2) the idea of preassigned model on unseen datapoints is not suitable for modeling streaming data with evolving probability distribution. This challenge is referred as model flexibility. And so, with this in minds, the online deep learning model we need to design should have a variable underlying structure; (3) moreover, it is of utmost importance to fusion these abstract hierarchical latent representations to achieve better classification performance, and we should give different weights to different levels of implicit representation information when dealing with the data streaming where the data distribution changes. To address these issues, we propose a two-phase Online Deep Learning based on Auto-Encoder (ODLAE). Based on auto-encoder, considering reconstruction loss, we extract abstract hierarchical latent representations of instances; Based on predictive loss, we devise two fusion strategies: the output-level fusion strategy, which is obtained by fusing the classification results of encoder each hidden layer; and feature-level fusion strategy, which is leveraged self-attention mechanism to fusion every hidden layer output. Finally, in order to improve the robustness of the algorithm, we also try to utilize the denoising auto-encoder to yield hierarchical latent representations. Experimental results on different datasets are presented to verify the validity of our proposed algorithm (ODLAE) outperforms several baselines.

updated: Wed Jan 19 2022 02:14:57 GMT+0000 (UTC)

published: Wed Jan 19 2022 02:14:57 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト