Unsupervised Out-of-Distribution Dialect Detection with Mahalanobis Distance

Sourya Dipta Das; Yash Vadi; Abhishek Unnam; Kuldeep Yadav

マハラノビス距離を使用した教師なし分布外方言検出

方言分類は、システム全体のパフォーマンスを向上させるために、機械翻訳や音声認識などのさまざまなアプリケーションで使用されます。現実のシナリオでは、展開された方言分類モデルは、トレーニングデータの分布とは異なる異常な入力 (分布外 (OOD) サンプルとも呼ばれます) に遭遇する可能性があります。これらの OOD サンプルは、モデルのトレーニング中にそのサンプルの方言が認識されないため、予期しない出力につながる可能性があります。分布外検出は、方言分類の文脈ではほとんど注目されていない新しい研究分野です。これに向けて、分布外のサンプルを検出するための、シンプルかつ効果的な教師なしマハラノビス距離特徴ベースの方法を提案しました。マルチタスク学習には、wav2vec 2.0 トランスフォーマーベースの方言分類子モデルのすべての中間層からの潜在的な埋め込みを利用します。私たちが提案したアプローチは、他の最先端の OOD 検出方法よりも大幅に優れています。

Dialect classification is used in a variety of applications, such as machine translation and speech recognition, to improve the overall performance of the system. In a real-world scenario, a deployed dialect classification model can encounter anomalous inputs that differ from the training data distribution, also called out-of-distribution (OOD) samples. Those OOD samples can lead to unexpected outputs, as dialects of those samples are unseen during model training. Out-of-distribution detection is a new research area that has received little attention in the context of dialect classification. Towards this, we proposed a simple yet effective unsupervised Mahalanobis distance feature-based method to detect out-of-distribution samples. We utilize the latent embeddings from all intermediate layers of a wav2vec 2.0 transformer-based dialect classifier model for multi-task learning. Our proposed approach outperforms other state-of-the-art OOD detection methods significantly.

updated: Wed Aug 09 2023 11:33:53 GMT+0000 (UTC)

published: Wed Aug 09 2023 11:33:53 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト