Spatial-context-aware deep neural network for multi-class image classification

Jialu Zhang; Qian Zhang; Jianfeng Ren; Yitian Zhao; Jiang Liu

マルチクラス画像分類のための空間コンテキスト認識ディープニューラルネットワーク

マルチラベル画像分類は、コンピュータビジョンにおいて基本的ですが、やりがいのある作業です。過去数十年にわたって、セマンティックラベル間の関係を調査するソリューションは大きな進歩を遂げました。ただし、ラベルの基になる空間コンテキスト情報は十分に活用されていません。この問題に取り組むために、意味情報と空間情報の両方を考慮に入れてラベルを予測するために、空間コンテキストを意識したディープニューラルネットワークが提案されています。この提案されたフレームワークは、画像のマルチラベリングに広く使用されている2つのベンチマークデータセットであるMicrosoftCOCOとPASCALVOCで評価されます。結果は、提案されたアプローチが、マルチラベル画像分類問題に対処する上で最先端のソリューションよりも優れていることを示しています。

Multi-label image classification is a fundamental but challenging task in computer vision. Over the past few decades, solutions exploring relationships between semantic labels have made great progress. However, the underlying spatial-contextual information of labels is under-exploited. To tackle this problem, a spatial-context-aware deep neural network is proposed to predict labels taking into account both semantic and spatial information. This proposed framework is evaluated on Microsoft COCO and PASCAL VOC, two widely used benchmark datasets for image multi-labelling. The results show that the proposed approach is superior to the state-of-the-art solutions on dealing with the multi-label image classification problem.

updated: Wed Nov 24 2021 06:36:10 GMT+0000 (UTC)

published: Wed Nov 24 2021 06:36:10 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト