Finding Strong Gravitational Lenses Through Self-Attention

Hareesh Thuruthipilly; Adam Zadrozny; Agnieszka Pollo; Marek Biesiada

自己注意による強力な重力レンズの発見

LSSTのような今後の大規模な調査では、現代の掃天観測よりも何桁も大きいデータを分析することにより、約10^5個の強力な重力レンズが見つかると予想されています。この場合、自動化されていない手法は、たとえ可能であっても、非常に困難で時間がかかります。強力な重力レンズを見つけるために、自己注意の原理に基づいた新しい自動化アーキテクチャを提案します。畳み込みニューラルネットワークに対する自己注意ベースのエンコーダモデルの利点が調査され、エンコーダモデルの結果を最適化する方法が分析されます。ボローニャレンズチャレンジから重力レンズを識別するために、21の自己注意ベースのエンコーダモデルと5つの畳み込みニューラルネットワークを構築してトレーニングしました。各モデルは、18,000のシミュレートされた画像を使用して個別にトレーニングされ、2,000の画像を使用して相互検証され、100,000の画像を含むテストセットに適用されました。評価には、分類精度、受信者動作特性曲線（AUROC）の下の面積、TPR_0スコア、およびTPR_10スコアの4つの異なるメトリックを使用しました。自己注意ベースのエンコーダモデルとチャレンジに参加しているCNNのパフォーマンスが比較されます。彼らは、TPR_0とTPR_10 $で、ボローニャレンズチャレンジに参加したCNNモデルを高いマージンで上回ることができました。自己注意ベースのモデルには、単純なCNNと比較して明らかな利点があります。それらは、現在使用されている残差ニューラルネットワークと比較して非常に競合するパフォーマンスを持っています。 CNNと比較して、自己注意ベースのモデルは、信頼性の高いレンズ候補を識別でき、実際のデータから潜在的な候補を除外することができます。さらに、エンコーダーレイヤーを導入すると、効果的なフィルターとして機能することで、CNNに存在する過剰適合の問題に対処することもできます。

The upcoming large scale surveys like LSST are expected to find approximately 10^5 strong gravitational lenses by analysing data of many orders of magnitude larger than those in contemporary astronomical surveys. In this case, non-automated techniques will be highly challenging and time-consuming, even if they are possible at all. We propose a new automated architecture based on the principle of self-attention to find strong gravitational lenses. The advantages of self-attention-based encoder models over convolution neural networks are investigated, and ways to optimise the outcome of encoder models are analysed. We constructed and trained 21 self-attention based encoder models and five convolution neural networks to identify gravitational lenses from the Bologna Lens Challenge. Each model was trained separately using 18,000 simulated images, cross-validated using 2,000 images, and then applied to a test set with 100,000 images. We used four different metrics for evaluation: classification accuracy, area under the receiver operating characteristic curve (AUROC), the TPR_0 score and the TPR_10 score. The performances of self-attention-based encoder models and CNNs participating in the challenge are compared. They were able to surpass the CNN models that participated in the Bologna Lens Challenge by a high margin for the TPR_0 and TPR_10$. Self-Attention based models have clear advantages compared to simpler CNNs. They have highly competing performance in comparison to the currently used residual neural networks. Compared to CNNs, self-attention based models can identify highly confident lensing candidates and will be able to filter out potential candidates from real data. Moreover, introducing the encoder layers can also tackle the over-fitting problem present in the CNNs by acting as effective filters.

updated: Tue May 17 2022 07:59:15 GMT+0000 (UTC)

published: Mon Oct 18 2021 11:40:48 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト