Multiple Meta-model Quantifying for Medical Visual Question Answering

Tuong Do; Binh X. Nguyen; Erman Tjiputra; Minh Tran; Quang D. Tran; Anh Nguyen

医療視覚質問応答のための複数のメタモデルの定量化

転移学習は、意味のある特徴を抽出し、医療用視覚質問応答（VQA）タスクのデータ制限を克服するための重要なステップです。ただし、既存の医療VQA手法のほとんどは、転送学習のために外部データに依存していますが、データセット内のメタデータは十分に活用されていません。このホワイトペーパーでは、メタアノテーションを効果的に学習し、医療VQAタスクに意味のある機能を活用する新しい複数のメタモデルの定量化手法を紹介します。私たちが提案する方法は、自動注釈によってメタデータを増やし、ノイズの多いラベルを処理し、医療VQAタスクに堅牢な機能を提供するメタモデルを出力するように設計されています。 2つの公的医療VQAデータセットに関する広範な実験結果は、メタモデルをトレーニングするために外部データを必要とせずに、私たちのアプローチが他の最先端の方法と比較して優れた精度を達成することを示しています。

Transfer learning is an important step to extract meaningful features and overcome the data limitation in the medical Visual Question Answering (VQA) task. However, most of the existing medical VQA methods rely on external data for transfer learning, while the meta-data within the dataset is not fully utilized. In this paper, we present a new multiple meta-model quantifying method that effectively learns meta-annotation and leverages meaningful features to the medical VQA task. Our proposed method is designed to increase meta-data by auto-annotation, deal with noisy labels, and output meta-models which provide robust features for medical VQA tasks. Extensively experimental results on two public medical VQA datasets show that our approach achieves superior accuracy in comparison with other state-of-the-art methods, while does not require external data to train meta-models.

updated: Wed May 19 2021 04:06:05 GMT+0000 (UTC)

published: Wed May 19 2021 04:06:05 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト