DeepStroke: An Efficient Stroke Screening Framework for Emergency Rooms with Multimodal Adversarial Deep Learning

Tongan Cai; Haomiao Ni; Mingli Yu; Xiaolei Huang; Kelvin Wong; John Volpi; James Z. Wang; Stephen T. C. Wong

DeepStroke：マルチモーダルな敵対的ディープラーニングを備えた緊急治療室向けの効率的な脳卒中スクリーニングフレームワーク

緊急治療室（ER）の設定では、脳卒中のトリアージまたはスクリーニングが一般的な課題です。 MRIのスループットが遅く、コストが高いため、通常、MRIの代わりにクイックCTが実行されます。臨床検査は一般的にプロセス中に参照されますが、誤診率は高いままです。急性期の脳卒中が疑われる患者の顔面筋の協調不全と発話不能のパターンを認識することにより、コンピューター支援による脳卒中の存在評価を実現するための、新しいマルチモーダルディープラーニングフレームワークであるDeepStrokeを提案します。私たちが提案するDeepStrokeは、脳卒中のトリアージ中にすぐに利用できる1分間の顔面ビデオデータと音声データを使用して、局所的な顔面神経麻痺の検出と全体的な言語障害の分析を行います。顔属性のバイアスを減らし、一般化可能性を向上させるために、転移学習が採用されました。マルチモーダルラテラルフュージョンを活用して、低レベルと高レベルの機能を組み合わせ、関節トレーニングの相互正則化を提供します。アイデンティティのない、脳卒中を区別する機能を取得するために、新しい敵対的トレーニングが導入されています。実際のER患者を対象としたビデオオーディオデータセットの実験では、DeepStrokeが最先端のモデルよりも優れており、トリアージチームとER医師の両方よりも優れたパフォーマンスを達成し、従来よりも10.94％高い感度と7.37％高い精度を維持していることが示されています特異性が調整されたときのストロークトリアージ。一方、各評価は6分未満で完了することができ、フレームワークが臨床翻訳に大きな可能性を秘めていることを示しています。

In an emergency room (ER) setting, stroke triage or screening is a common challenge. A quick CT is usually done instead of MRI due to MRI's slow throughput and high cost. Clinical tests are commonly referred to during the process, but the misdiagnosis rate remains high. We propose a novel multimodal deep learning framework, DeepStroke, to achieve computer-aided stroke presence assessment by recognizing patterns of minor facial muscles incoordination and speech inability for patients with suspicion of stroke in an acute setting. Our proposed DeepStroke takes one-minute facial video data and audio data readily available during stroke triage for local facial paralysis detection and global speech disorder analysis. Transfer learning was adopted to reduce face-attribute biases and improve generalizability. We leverage a multi-modal lateral fusion to combine the low- and high-level features and provide mutual regularization for joint training. Novel adversarial training is introduced to obtain identity-free and stroke-discriminative features. Experiments on our video-audio dataset with actual ER patients show that DeepStroke outperforms state-of-the-art models and achieves better performance than both a triage team and ER doctors, attaining a 10.94% higher sensitivity and maintaining 7.37% higher accuracy than traditional stroke triage when specificity is aligned. Meanwhile, each assessment can be completed in less than six minutes, demonstrating the framework's great potential for clinical translation.

updated: Mon Jun 27 2022 18:02:49 GMT+0000 (UTC)

published: Fri Sep 24 2021 16:46:13 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト