Medical Visual Question Answering: A Survey

Zhihong Lin; Donghao Zhang; Qingyi Tao; Danli Shi; Gholamreza Haffari; Qi Wu; Mingguang He; Zongyuan Ge

医療視覚的質問回答: アンケート

Medical Visual Question Answering~(VQA) は、医療用人工知能と一般的な VQA チャレンジを組み合わせたものです。医療画像と自然言語による臨床関連の質問が与えられると、医療 VQA システムは、もっともらしく説得力のある答えを予測することが期待されます。一般領域の VQA は広範囲に研究されていますが、医療用 VQA はそのタスクの特徴により、依然として特別な調査と探索が必要です。この調査の最初の部分では、データソース、データ量、タスクの特徴について最新の公的に利用可能な医療 VQA データセットを収集し、議論します。 2 番目の部分では、医療 VQA タスクで使用されるアプローチを確認します。私たちは彼らの技術、革新性、そして改善の可能性を要約し、議論します。最後の部分では、この分野の医療特有の課題を分析し、今後の研究の方向性について議論します。私たちの目標は、医療視覚的質問応答分野に興味のある研究者に包括的で役立つ情報を提供し、この分野でのさらなる研究を奨励することです。

Medical Visual Question Answering~(VQA) is a combination of medical artificial intelligence and popular VQA challenges. Given a medical image and a clinically relevant question in natural language, the medical VQA system is expected to predict a plausible and convincing answer. Although the general-domain VQA has been extensively studied, the medical VQA still needs specific investigation and exploration due to its task features. In the first part of this survey, we collect and discuss the publicly available medical VQA datasets up-to-date about the data source, data quantity, and task feature. In the second part, we review the approaches used in medical VQA tasks. We summarize and discuss their techniques, innovations, and potential improvements. In the last part, we analyze some medical-specific challenges for the field and discuss future research directions. Our goal is to provide comprehensive and helpful information for researchers interested in the medical visual question answering field and encourage them to conduct further research in this field.

updated: Wed Jun 07 2023 00:37:15 GMT+0000 (UTC)

published: Fri Nov 19 2021 05:55:15 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト