TabNet: Attentive Interpretable Tabular Learning

Sercan O. Arik; Tomas Pfister

TabNet: 注意深い解釈可能な表学習

我々は、新しく高性能で解釈可能な正準的な深層表形式データ学習アーキテクチャであるTabNetを提案する。TabNetは、各決定ステップでどの特徴から推論すべきかを選択するために逐次注意を使用し、学習容量が最も顕著な特徴に使用されるため、解釈可能性とより効率的な学習を可能にする。我々は、性能が飽和していない広範囲の表形式データにおいて、TabNetが他のニューラルネットワークや決定木を上回る性能を発揮し、解釈可能な特徴の帰属と大域的なモデルの動作についての洞察が得られることを実証した。最後に、我々の知る限りで初めて、表形式データに対する自己教師付き学習を実証し、ラベル付けされていないデータが豊富な場合の教師なし表現学習と比較して性能を大幅に向上させている。

We propose a novel high-performance and interpretable canonical deep tabular data learning architecture, TabNet. TabNet uses sequential attention to choose which features to reason from at each decision step, enabling interpretability and more efficient learning as the learning capacity is used for the most salient features. We demonstrate that TabNet outperforms other neural network and decision tree variants on a wide range of non-performance-saturated tabular datasets and yields interpretable feature attributions plus insights into the global model behavior. Finally, for the first time to our knowledge, we demonstrate self-supervised learning for tabular data, significantly improving performance with unsupervised representation learning when unlabeled data is abundant.

updated: Fri Feb 14 2020 18:37:55 GMT+0000 (UTC)

published: Tue Aug 20 2019 15:46:53 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト