PetroGAN: A novel GAN-based approach to generate realistic, label-free petrographic datasets

I. Ferreira; L. Ochoa; A. Koeshidayatullah

PetroGAN：現実的でラベルのない岩石学的データセットを生成するための新しいGANベースのアプローチ

ディープラーニングアーキテクチャは、地球科学におけるデータ分析を強化し、地質学的問題への従来のアプローチを補完します。地球科学の深層学習アプリケーションは有望な兆候を示していますが、実際の可能性は未開拓のままです。これは主に、地質データセット、特に記載岩石学が限られており、時間と費用がかかるため、高品質のラベル付きデータセットを提供するには深い知識が必要です。生成的敵対的ネットワーク（GAN）に基づく新しい深層学習フレームワークを開発してこれらの問題に取り組み、最初の現実的な合成岩石学的データセットを作成しました。 StyleGAN2アーキテクチャは、統計的および審美的特性の堅牢な複製を可能にし、岩石学的データの内部分散を改善するために選択されています。トレーニングデータセットは、平面偏光と交差偏光の両方での岩の薄片の10070枚の画像で構成されています。アルゴリズムは264GPU時間トレーニングされ、岩石画像の最先端のフレシェ開始距離（FID）スコア12.49に達しました。さらに、FID値は岩相の種類と画像の解像度によって異なることを観察しました。私たちの調査では、対象分野の専門家は、生成された画像が実際の画像と見分けがつかないことを発見しました。この研究は、GANが現実的な合成データを生成し、潜在空間を実験し、自己ラベル付けの将来のツールとして、地質データセットを作成する労力を削減するための強力な方法であることを強調しています。

Deep learning architectures have enriched data analytics in the geosciences, complementing traditional approaches to geological problems. Although deep learning applications in geosciences show encouraging signs, the actual potential remains untapped. This is primarily because geological datasets, particularly petrography, are limited, time-consuming, and expensive to obtain, requiring in-depth knowledge to provide a high-quality labeled dataset. We approached these issues by developing a novel deep learning framework based on generative adversarial networks (GANs) to create the first realistic synthetic petrographic dataset. The StyleGAN2 architecture is selected to allow robust replication of statistical and esthetical characteristics, and improving the internal variance of petrographic data. The training dataset consists of 10070 images of rock thin sections both in plane- and cross-polarized light. The algorithm trained for 264 GPU hours and reached a state-of-the-art Fréchet Inception Distance (FID) score of 12.49 for petrographic images. We further observed the FID values vary with lithology type and image resolution. Our survey established that subject matter experts found the generated images were indistinguishable from real images. This study highlights that GANs are a powerful method for generating realistic synthetic data, experimenting with the latent space, and as a future tool for self-labelling, reducing the effort of creating geological datasets.

updated: Thu Apr 07 2022 01:55:53 GMT+0000 (UTC)

published: Thu Apr 07 2022 01:55:53 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト