Robust Classification by Pre-conditioned LASSO and Transductive Diffusion Component Analysis

Yanwei Fu; De-An Huang; Leonid Sigal

事前条件付きLASSOおよびトランスダクティブ拡散成分分析による堅牢な分類

最新の機械学習ベースの認識アプローチでは、多数のラベル付きトレーニング画像を含む大規模なデータセットが必要です。ただし、このようなデータセットは、収集と注釈付けが本質的に難しく、コストがかかります。そのため、Webを活用できる自動データセット収集方法に大きな関心が寄せられています。安価で効率的で信頼性の低い方法で収集された％。ただし、この方法でデータセットを収集するには、一般的で一般的な異常値を検出および除外するための堅牢で効率的な方法が必要です。したがって、％外れ値は、これらのデータセットを使用する際の顕著な扱いです。これまで、機械学習コミュニティでは、堅牢な分類のために外れ値を直接検出するための限られた努力しかありませんでした。事前条件付きLASSOの最近の研究に着想を得て、このペーパーでは事前条件付きLASSOを使用して異常値検出タスクを定式化し、教師なしトランスダクティブ拡散成分分析を使用して、ラベル付きインスタンスとラベルなしインスタンスからデータ多様体のトポロジ構造を統合し、機能の次元。合成実験と2つの実際の分類タスクの結果は、フレームワークが異常値を堅牢に検出し、分類を改善できることを示しています。

Modern machine learning-based recognition approaches require large-scale datasets with large number of labelled training images. However, such datasets are inherently difficult and costly to collect and annotate. Hence there is a great and growing interest in automatic dataset collection methods that can leverage the web. % which are collected % in a cheap, efficient and yet unreliable way. Collecting datasets in this way, however, requires robust and efficient ways for detecting and excluding outliers that are common and prevalent. % Outliers are thus a % prominent treat of using these dataset. So far, there have been a limited effort in machine learning community to directly detect outliers for robust classification. Inspired by the recent work on Pre-conditioned LASSO, this paper formulates the outlier detection task using Pre-conditioned LASSO and employs unsupervised transductive diffusion component analysis to both integrate the topological structure of the data manifold, from labeled and unlabeled instances, and reduce the feature dimensionality. Synthetic experiments as well as results on two real-world classification tasks show that our framework can robustly detect the outliers and improve classification.

updated: Wed Dec 25 2019 02:06:46 GMT+0000 (UTC)

published: Thu Nov 19 2015 20:13:51 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト