Algorithmic encoding of protected characteristics in image-based models for disease detection

Ben Glocker; Charles Jones; Melanie Bernhardt; Stefan Winzeck

病気の検出のための画像ベースのモデルにおける保護された特性のアルゴリズムによる符号化

臨床的意思決定にAIを使用すると、健康格差が拡大する可能性があることが正しく強調されています。機械学習モデルは、たとえば、患者の人種的アイデンティティと臨床転帰の間の望ましくない相関関係を検出する場合があります。このような相関関係は、モデル開発に使用される（履歴）データによく見られます。画像ベースの疾患検出モデルのバイアスを報告する研究が増えています。十分なサービスを受けていない集団からのデータが不足していることに加えて、これらのバイアスがどのようにエンコードされ、異種のパフォーマンスをどのように削減または削除するかについてはほとんどわかっていません。アルゴリズムが生物学的性別や人種的アイデンティティなどの患者の特徴を認識し、予測を行う際にこの情報を直接的または間接的に使用する可能性があるという懸念があります。しかし、そのような情報が実際に使用されているかどうかをどのように確認できるかは不明です。この記事は、病気の検出モデルの内部動作を評価するためのさまざまな方法論を探求することにより、これらの問題に光を当てることを目的としています。マルチタスク学習とモデル検査を調査して、保護された特性と疾患の予測との関係を評価します。私たちの分析フレームワークは、医用画像AIの将来の研究において貴重な洞察を提供できると信じています。また、私たちの調査結果は、パフォーマンスの不一致の根本的な原因をよりよく理解するためのさらなる調査を必要としています。

It has been rightfully emphasized that the use of AI for clinical decision making could amplify health disparities. A machine learning model may pick up undesirable correlations, for example, between a patient's racial identity and clinical outcome. Such correlations are often present in (historical) data used for model development. There has been an increase in studies reporting biases in image-based disease detection models. Besides the scarcity of data from underserved populations, very little is known about how these biases are encoded and how one may reduce or even remove disparate performance. There are concerns that an algorithm may recognize patient characteristics such as biological sex or racial identity, and then directly or indirectly use this information when making predictions. But it remains unclear how we can establish whether such information is actually used. This article aims to shed some light on these issues by exploring different methodology for assessing the inner working of disease detection models. We explore multitask learning and model inspection to assess the relationship between protected characteristics and prediction of disease. We believe our analysis framework could provide valuable insights in future studies in medical imaging AI. Our findings also call for further research to better understand the underlying causes of performance disparities.

updated: Tue Jan 18 2022 16:55:50 GMT+0000 (UTC)

published: Wed Oct 27 2021 20:30:57 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト