OSCARS: An Outlier-Sensitive Content-Based Radiography Retrieval System

Xiaoyuan Guo; Jiali Duan; Saptarshi Purkayastha; Hari Trivedi; Judy Wawira Gichoya; Imon Banerjee

OSCARS：外れ値に敏感なコンテンツベースのX線撮影検索システム

ノイズの多いデータセットの検索関連性を改善することは、医療分野での大規模でクリーンなデータセットのキュレーションに対する新たなニーズです。既存のメソッドはクラスごとの取得（別名、クラス間）に適用できますが、同じクラス（別名、クラス内）内の類似性の粒度を区別することはできません。この問題は、同じクラスのノイズの多いサンプルがトレーニング中に同等に扱われる医療外部データセットで悪化します。私たちの目標は、きめ細かい検索のためにクラス内/クラス間の両方の類似点を特定することです。これを達成するために、2つのステップで構成される外れ値に敏感なコンテンツベースのrAdiologhy検索システム（OSCARS）を提案します。まず、教師なしの方法で、クリーンな内部データセットで外れ値検出器をトレーニングします。次に、トレーニングされた検出器を使用して、外部データセットの異常スコアを生成します。その分布は、クラス内の変動をビン化するために使用されます。次に、4連（a、p、nintra、ninter）サンプリング戦略を提案します。この場合、クラス内のネガティブnintraは、aが属するビンアンカー以外の同じクラスのビンからサンプリングされ、ninerはクラス間からランダムにサンプリングされます。。クラス内およびクラス間の特徴学習のバランスをとるために、加重メトリック学習目標を提案します。 2つの代表的な公共X線撮影データセットで実験しました。実験は、私たちのアプローチの有効性を示しています。トレーニングと評価のコードはhttps://github.com/XiaoyuanGuo/oscarsにあります。

Improving the retrieval relevance on noisy datasets is an emerging need for the curation of a large-scale clean dataset in the medical domain. While existing methods can be applied for class-wise retrieval (aka. inter-class), they cannot distinguish the granularity of likeness within the same class (aka. intra-class). The problem is exacerbated on medical external datasets, where noisy samples of the same class are treated equally during training. Our goal is to identify both intra/inter-class similarities for fine-grained retrieval. To achieve this, we propose an Outlier-Sensitive Content-based rAdiologhy Retrieval System (OSCARS), consisting of two steps. First, we train an outlier detector on a clean internal dataset in an unsupervised manner. Then we use the trained detector to generate the anomaly scores on the external dataset, whose distribution will be used to bin intra-class variations. Second, we propose a quadruplet (a, p, nintra, ninter) sampling strategy, where intra-class negatives nintra are sampled from bins of the same class other than the bin anchor a belongs to, while niner are randomly sampled from inter-classes. We suggest a weighted metric learning objective to balance the intra and inter-class feature learning. We experimented on two representative public radiography datasets. Experiments show the effectiveness of our approach. The training and evaluation code can be found in https://github.com/XiaoyuanGuo/oscars.

updated: Wed Apr 06 2022 20:18:35 GMT+0000 (UTC)

published: Wed Apr 06 2022 20:18:35 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト