Wasserstein Auto-Encoders of Merge Trees (and Persistence Diagrams)

Mahieu Pont; Julien Tierny

マージツリー (および永続図) の Wasserstein 自動エンコーダー

この論文では、古典的な自動エンコーダニューラルネットワークアーキテクチャをマージツリーの Wasserstein 計量空間に新たに拡張した、マージツリーの Wasserstein 自動エンコーディング (MT-WAE) の計算フレームワークを紹介します。ベクトル化されたデータを操作する従来の自動エンコーダとは対照的に、私たちの定式化では、ネットワークの各層で関連する計量空間上でマージツリーを明示的に操作するため、優れた精度と解釈可能性が得られます。私たちの新しいニューラルネットワークのアプローチは、マージツリーエンコーディングにおける以前の線形の試み [65] の非線形一般化として解釈できます。また、これは簡単に永続化図にも拡張されます。公開アンサンブルでの大規模な実験により、MT-WAE の計算が平均して数分程度で完了するというアルゴリズムの効率性が実証されました。私たちは、マージツリーエンコーディングに関する以前の研究 [65] を応用した 2 つのアプリケーションで、私たちの貢献の有用性を示します。まず、MT-WAE をデータ削減に適用し、自動エンコーダーの最終層でマージツリーを座標で簡潔に表現することで、マージツリーを確実に圧縮します。次に、アンサンブルデータの視覚的分析のために、オートエンコーダーの潜在空間を利用することによる次元削減へのアプリケーションを文書化します。マージツリー間のワッサーシュタイン距離とそのクラスターの両方を潜在空間に保存するのに役立つ 2 つのペナルティ項を導入することで、フレームワークの多用途性を示します。どちらのアプリケーションでも、定量的な実験によりフレームワークの関連性が評価されます。最後に、再現性を高めるために使用できる C++ 実装を提供します。

This paper presents a computational framework for the Wasserstein auto-encoding of merge trees (MT-WAE), a novel extension of the classical auto-encoder neural network architecture to the Wasserstein metric space of merge trees. In contrast to traditional auto-encoders which operate on vectorized data, our formulation explicitly manipulates merge trees on their associated metric space at each layer of the network, resulting in superior accuracy and interpretability. Our novel neural network approach can be interpreted as a non-linear generalization of previous linear attempts [65] at merge tree encoding. It also trivially extends to persistence diagrams. Extensive experiments on public ensembles demonstrate the efficiency of our algorithms, with MT-WAE computations in the orders of minutes on average. We show the utility of our contributions in two applications adapted from previous work on merge tree encoding [65]. First, we apply MT-WAE to data reduction and reliably compress merge trees by concisely representing them with their coordinates in the final layer of our auto-encoder. Second, we document an application to dimensionality reduction, by exploiting the latent space of our auto-encoder, for the visual analysis of ensemble data. We illustrate the versatility of our framework by introducing two penalty terms, to help preserve in the latent space both the Wasserstein distances between merge trees, as well as their clusters. In both applications, quantitative experiments assess the relevance of our framework. Finally, we provide a C++ implementation that can be used for reproducibility.

updated: Wed Jul 05 2023 09:46:52 GMT+0000 (UTC)

published: Wed Jul 05 2023 09:46:52 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト