Probabilistic Integration of Object Level Annotations in Chest X-ray Classification

Tom van Sonsbeek; Xiantong Zhen; Dwarikanath Mahapatra; Marcel Worring

胸部 X 線分類におけるオブジェクトレベルの注釈の確率論的統合

医用画像データセットとその注釈は、一般的なドメインで同等のものほど速く成長していません。これにより、視覚分野に大きな影響を与えてきた最新のよりデータ集約的な方法からの変換がますます困難になり、効率が低下します。この論文では、胸部 X 線画像の疾患分類のための新しい確率的潜在変数モデルを提案します。具体的には、グローバルな疾患ラベルを含む胸部 X 線データセットを検討し、より小さなサブセットには、視線パターンと疾患境界ボックスの形でオブジェクトレベルの専門家の注釈を含めます。 2 段階の方法で単一のトレーニングパイプラインを介してこれらの異なるラベルの粒度を処理できる 2 段階の最適化アルゴリズムを提案します。私たちのパイプラインでは、グローバルデータセットの特徴はモデルの下位層で学習されます。きめ細かなエキスパートオブジェクトレベルの注釈の特定の詳細とニュアンスは、条件付き変分推論に触発された知識蒸留法を使用して、モデルの最終層で学習されます。その後、モデルの重みが凍結されて、この学習プロセスがガイドされ、豊富な注釈が付けられた小さなデータサブセットでのオーバーフィッティングが防止されます。提案された方法は、共通のベンチマークデータセットである胸部 X 線 14 および MIMIC-CXR で、さまざまなバックボーンにわたって一貫した分類の改善をもたらします。これは、特にオブジェクトレベルのアノテーションを使用して、粗いものから細かいものまでラベルを 2 段階で学習することが、より最適なアノテーションの使用に効果的な方法であることを示しています。

Medical image datasets and their annotations are not growing as fast as their equivalents in the general domain. This makes translation from the newest, more data-intensive methods that have made a large impact on the vision field increasingly more difficult and less efficient. In this paper, we propose a new probabilistic latent variable model for disease classification in chest X-ray images. Specifically we consider chest X-ray datasets that contain global disease labels, and for a smaller subset contain object level expert annotations in the form of eye gaze patterns and disease bounding boxes. We propose a two-stage optimization algorithm which is able to handle these different label granularities through a single training pipeline in a two-stage manner. In our pipeline global dataset features are learned in the lower level layers of the model. The specific details and nuances in the fine-grained expert object-level annotations are learned in the final layers of the model using a knowledge distillation method inspired by conditional variational inference. Subsequently, model weights are frozen to guide this learning process and prevent overfitting on the smaller richly annotated data subsets. The proposed method yields consistent classification improvement across different backbones on the common benchmark datasets Chest X-ray14 and MIMIC-CXR. This shows how two-stage learning of labels from coarse to fine-grained, in particular with object level annotations, is an effective method for more optimal annotation usage.

updated: Thu Oct 13 2022 12:53:42 GMT+0000 (UTC)

published: Thu Oct 13 2022 12:53:42 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト