Classification of Astronomical Bodies by Efficient Layer Fine-Tuning of Deep Neural Networks

Sabeesh Ethiraj; Bharath Kumar Bolla

ディープニューラルネットワークの効率的な層の微調整による天体の分類

SDSS-IVデータセットには、天文台によってキャプチャされた銀河、星、クエーサーなどのさまざまな天体に関する情報が含まれています。 SDSS-IVデータセットを分類するために転移学習を利用したディープマルチモーダル学習の研究に触発され、分類シナリオでの効果を研究するために、これらのアーキテクチャの微調整に関する研究をさらに拡張しました。 Resnet-50、DenseNet-121 VGG-16、Xception、EfficientNetB2、MobileNetV2、NasnetMobileなどのアーキテクチャは、さまざまなレベルでレイヤーごとの微調整を使用して構築されています。私たちの調査結果は、Imagenetウェイトを使用してすべてのレイヤーをフリーズし、最終的なトレーニング可能なレイヤーを追加することは、最適なソリューションではない可能性があることを示唆しています。さらに、ベースラインモデルと、トレーニング可能なレイヤーの数が多いモデルは、特定のアーキテクチャで同様に実行されます。モデルはさまざまなレベルで微調整する必要があり、モデルを理想と呼ぶには特定のトレーニング比率が必要です。アーキテクチャが異なれば、精度の高いトレーニング可能なレイヤーの数の変化に対する反応も異なります。 DenseNet-121、Xception、EfficientNetB2などのモデルは、ほぼ完全なトレーニング曲線と比較的一致するピーク精度を達成しましたが、Resnet-50、VGG-16、MobileNetV2、NasnetMobileなどのモデルは、トレーニング曲線の適合性が低く、ピーク精度が低く遅延していました。また、モバイルニューラルネットワークのパラメータとモデルサイズは小さいものの、検証精度が一貫して低いため、低計算デバイスでの展開に必ずしも理想的ではない場合があることもわかりました。モデルの評価には、TuningParameterRatioやTuningLayerRatioなどのカスタマイズされた評価指標が使用されます。

The SDSS-IV dataset contains information about various astronomical bodies such as Galaxies, Stars, and Quasars captured by observatories. Inspired by our work on deep multimodal learning, which utilized transfer learning to classify the SDSS-IV dataset, we further extended our research in the fine tuning of these architectures to study the effect in the classification scenario. Architectures such as Resnet-50, DenseNet-121 VGG-16, Xception, EfficientNetB2, MobileNetV2 and NasnetMobile have been built using layer wise fine tuning at different levels. Our findings suggest that freezing all layers with Imagenet weights and adding a final trainable layer may not be the optimal solution. Further, baseline models and models that have higher number of trainable layers performed similarly in certain architectures. Model need to be fine tuned at different levels and a specific training ratio is required for a model to be termed ideal. Different architectures had different responses to the change in the number of trainable layers w.r.t accuracies. While models such as DenseNet-121, Xception, EfficientNetB2 achieved peak accuracies that were relatively consistent with near perfect training curves, models such as Resnet-50,VGG-16, MobileNetV2 and NasnetMobile had lower, delayed peak accuracies with poorly fitting training curves. It was also found that though mobile neural networks have lesser parameters and model size, they may not always be ideal for deployment on a low computational device as they had consistently lower validation accuracies. Customized evaluation metrics such as Tuning Parameter Ratio and Tuning Layer Ratio are used for model evaluation.

updated: Sat May 14 2022 20:08:19 GMT+0000 (UTC)

published: Sat May 14 2022 20:08:19 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト