Do Neural Networks Trained with Topological Features Learn Different Internal Representations?

Sarah McGuire; Shane Jackson; Tegan Emerson; Henry Kvinge

トポロジー機能でトレーニングされたニューラルネットワークは、異なる内部表現を学習しますか?

トポロジカルデータ分析によって抽出された特徴を活用して、機械学習モデルをトレーニングする一連の作業が増えています。トポロジカル機械学習 (TML) としても知られるこの分野は、いくつかの顕著な成功を収めていますが、トポロジー機能から学習するプロセスが生データから学習するプロセスとどのように異なるかについての理解はまだ限られています。この作業では、トポロジー機能でトレーニングされたモデルが、元の生データでトレーニングされたモデルによって学習されたものとは根本的に異なるデータの内部表現を学習するかどうかを尋ねることにより、このより大きな問題の 1 つのコンポーネントに対処し始めます。「違い」を定量化するために、ニューラルネットワーク内のデータの隠れた表現の類似性を測定するために使用できる 2 つの一般的なメトリック、ニューラルスティッチングと中心化されたカーネルアラインメントを利用します。これらから、トポロジー機能を使用したトレーニングが、モデルが学習する表現をどのように変更し、変更しないかについて、さまざまな結論を導き出します。おそらく当然のことながら、構造的に、トポロジー機能でトレーニングおよび評価されたモデルの隠れた表現は、対応する生データでトレーニングおよび評価されたものと比較して大幅に異なることがわかりました。一方、私たちの実験では、場合によっては、単純なアフィン変換を使用して、これらの表現を (少なくとも対応するタスクを解決するために必要な程度まで) 調整できることが示されています。これは、生データでトレーニングされたニューラルネットワークが、予測を行う過程でいくつかの限られたトポロジー的特徴を抽出する可能性があることを意味すると推測します。

There is a growing body of work that leverages features extracted via topological data analysis to train machine learning models. While this field, sometimes known as topological machine learning (TML), has seen some notable successes, an understanding of how the process of learning from topological features differs from the process of learning from raw data is still limited. In this work, we begin to address one component of this larger issue by asking whether a model trained with topological features learns internal representations of data that are fundamentally different than those learned by a model trained with the original raw data. To quantify ``different'', we exploit two popular metrics that can be used to measure the similarity of the hidden representations of data within neural networks, neural stitching and centered kernel alignment. From these we draw a range of conclusions about how training with topological features does and does not change the representations that a model learns. Perhaps unsurprisingly, we find that structurally, the hidden representations of models trained and evaluated on topological features differ substantially compared to those trained and evaluated on the corresponding raw data. On the other hand, our experiments show that in some cases, these representations can be reconciled (at least to the degree required to solve the corresponding task) using a simple affine transformation. We conjecture that this means that neural networks trained on raw data may extract some limited topological features in the process of making predictions.

updated: Mon Nov 14 2022 19:19:04 GMT+0000 (UTC)

published: Mon Nov 14 2022 19:19:04 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト