Interactive Attention AI to translate low light photos to captions for night scene understanding in women safety

Rajagopal A; Nirmala V; Arun Muthuraj Vedamanickam

女性の安全を理解するために、暗い場所の写真をキャプションに変換するインタラクティブアテンションAI

画像キャプションと低照度画像エンハンスメントのディープラーニングベースのモデルには驚くべき進歩があります。この論文は、文学で初めて、夜のシーンを文章に変換するディープラーニングモデルを開発し、視覚障害のある女性の安全におけるAIアプリケーションの新しい可能性を開きます。画像キャプションと視覚的な質問応答に触発されて、新しいインタラクティブ画像キャプションが開発されました。ユーザーは、注意スコアに影響を与えることで、選択した関心のある人物にAIを集中させることができます。注意コンテキストベクトルは、CNN特徴ベクトルとユーザー提供の開始語から計算されます。 Encoder-Attention-Decoderニューラルネットワークは、低輝度画像からキャプションを生成することを学習します。このホワイトペーパーでは、夜間の環境を知覚するためのインタラクティブビジョン-言語モデルで新しいAI機能を研究することにより、女性の安全をどのように実現できるかを示します。

There is amazing progress in Deep Learning based models for Image captioning and Low Light image enhancement. For the first time in literature, this paper develops a Deep Learning model that translates night scenes to sentences, opening new possibilities for AI applications in the safety of visually impaired women. Inspired by Image Captioning and Visual Question Answering, a novel Interactive Image Captioning is developed. A user can make the AI focus on any chosen person of interest by influencing the attention scoring. Attention context vectors are computed from CNN feature vectors and user-provided start word. The Encoder-Attention-Decoder neural network learns to produce captions from low brightness images. This paper demonstrates how women safety can be enabled by researching a novel AI capability in the Interactive Vision-Language model for perception of the environment in the night.

updated: Tue Jan 04 2022 04:21:07 GMT+0000 (UTC)

published: Tue Jan 04 2022 04:21:07 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト