A comparative study of source-finding techniques in HI emission line cubes using SoFiA, MTObjects, and supervised deep learning

J. A. Barkai; M. A. W. Verheijen; E. T. Martínez; M. H. F. Wilkinson

SoFiA、MTObjects、および教師あり深層学習を使用した HI 輝線立方体のソース検出手法の比較研究

原子中性水素 (HI) の 21 cm スペクトル線放出は、電波天文学で観測される主要な波長の 1 つです。ただし、信号は本質的に微弱であり、銀河の HI コンテンツは宇宙環境に依存するため、HI 宇宙を調査するには大規模な調査ボリュームと調査深度が必要です。これらの調査から得られるデータの量は、技術の進歩とともに増加し続けているため、完全性と純度の間のトレードオフを考慮しながら、HI ソースを特定して特徴付けるための自動技術の必要性も高まっています。この調査の目的は、最高のマスク品質と 3D 中性水素キューブ内のアーティファクトが最も少ないソースを見つけてマスキングするための最適なパイプラインを見つけることでした。 3D 中性水素 21 cm スペクトル線データキューブ内のソースを最適に識別してマスクするためのパイプラインを作成するために、さまざまな既存の方法が調査されました。 SoFiA と MTObjects という 2 つの従来のソース検出方法と、V-Net として知られる 3D 畳み込みニューラルネットワークアーキテクチャを使用した新しい教師付きディープラーニングアプローチがテストされました。これら 3 つのソース検出方法は、後処理ステップとして従来の機械学習分類器を追加することでさらに改善され、誤検知が除去されました。パイプラインは、追加の模擬銀河が挿入された Westerbork Synthesis Radio Telescope の HI データキューブでテストされました。ランダムフォレスト分類器と組み合わせた SoFiA が最良の結果を提供し、V-Net とランダムフォレストの組み合わせが僅差で 2 位でした。これは、実際のソースよりも多くの模擬ソースがトレーニングセットに含まれていることが原因であると考えられます。したがって、SoFiA よりも優れたパフォーマンスを発揮できるように、より適切にラベル付けされたデータを使用して V-Net ネットワークの品質を改善する余地があります。

The 21 cm spectral line emission of atomic neutral hydrogen (HI) is one of the primary wavelengths observed in radio astronomy. However, the signal is intrinsically faint and the HI content of galaxies depends on the cosmic environment, requiring large survey volumes and survey depth to investigate the HI Universe. As the amount of data coming from these surveys continues to increase with technological improvements, so does the need for automatic techniques for identifying and characterising HI sources while considering the tradeoff between completeness and purity. This study aimed to find the optimal pipeline for finding and masking the most sources with the best mask quality and the fewest artefacts in 3D neutral hydrogen cubes. Various existing methods were explored in an attempt to create a pipeline to optimally identify and mask the sources in 3D neutral hydrogen 21 cm spectral line data cubes. Two traditional source-finding methods were tested, SoFiA and MTObjects, as well as a new supervised deep learning approach, in which a 3D convolutional neural network architecture, known as V-Net was used. These three source-finding methods were further improved by adding a classical machine learning classifier as a post-processing step to remove false positive detections. The pipelines were tested on HI data cubes from the Westerbork Synthesis Radio Telescope with additional inserted mock galaxies. SoFiA combined with a random forest classifier provided the best results, with the V-Net-random forest combination a close second. We suspect this is due to the fact that there are many more mock sources in the training set than real sources. There is, therefore, room to improve the quality of the V-Net network with better-labelled data such that it can potentially outperform SoFiA.

updated: Wed Nov 23 2022 09:45:07 GMT+0000 (UTC)

published: Wed Nov 23 2022 09:45:07 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト