A Mathematical Analysis of Learning Loss for Active Learning in Regression

Megh Shukla; Shuaib Ahmed

回帰における能動学習の学習損失の数学的分析

アクティブラーニングはデータ効率が高いため、業界では引き続き重要です。限られた予算で費用効果が高いだけでなく、モデルを継続的に改良することで、モデル開発段階での障害シナリオの早期発見と解決が可能になります。産業用アプリケーションでは、予測可能なすべてのユースケースで基盤となるモデルが正確に機能することが求められるため、モデルの障害を特定して修正することが重要です。障害の特定によってモデルを継続的に改良することに特化した、人気のある最先端の手法の1つに、LearningLossがあります。シンプルでエレガントですが、このアプローチは経験的に動機付けられています。私たちの論文は、LearningLoss ++と呼ばれる新しい修正を提案することを可能にするLearningLossの基盤を開発します。 LearningLossとLearningLoss ++の間の勾配の厳密な分析と比較により、LearningLossがどのように機能するかを解釈する上で勾配が重要であることを示します。また、損失を予測するためにさまざまなスケールの機能を組み合わせた畳み込みアーキテクチャを提案します。 Learning Lossで行われたように、（MPIIおよびLSPデータセットを使用して）人間の姿勢推定のタスクの回帰についてLearningLoss ++を検証します。 LearningLoss ++は、モデルのパフォーマンスが低下する可能性が高いシナリオの特定において優れていることを示しています。これは、モデルの改良により、オープンワールドでの信頼性の高いパフォーマンスにつながります。

Active learning continues to remain significant in the industry since it is data efficient. Not only is it cost effective on a constrained budget, continuous refinement of the model allows for early detection and resolution of failure scenarios during the model development stage. Identifying and fixing failures with the model is crucial as industrial applications demand that the underlying model performs accurately in all foreseeable use cases. One popular state-of-the-art technique that specializes in continuously refining the model via failure identification is Learning Loss. Although simple and elegant, this approach is empirically motivated. Our paper develops a foundation for Learning Loss which enables us to propose a novel modification we call LearningLoss++. We show that gradients are crucial in interpreting how Learning Loss works, with rigorous analysis and comparison of the gradients between Learning Loss and LearningLoss++. We also propose a convolutional architecture that combines features at different scales to predict the loss. We validate LearningLoss++ for regression on the task of human pose estimation (using MPII and LSP datasets), as done in Learning Loss. We show that LearningLoss++ outperforms in identifying scenarios where the model is likely to perform poorly, which on model refinement translates into reliable performance in the open world.

updated: Mon Apr 19 2021 13:54:20 GMT+0000 (UTC)

published: Mon Apr 19 2021 13:54:20 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト