Face Identification Proficiency Test Designed Using Item Response Theory

Géraldine Jeckeln; Ying Hu; Jacqueline G. Cavazos; Amy N. Yates; Carina A. Hahn; Larry Tang; Jonathon Phillips; Alice J. O'Toole

項目反応理論を使用して設計された顔識別能力テスト

顔識別能力の測定は、専門の法医学的顔検査官や、適用されたシナリオで顔識別タスクを実行する他の人による正確で一貫したパフォーマンスを保証するために不可欠です。現在の習熟度テストは、刺激項目の静的なセットに依存しているため、同じ個人に複数回有効に実施することはできません。習熟度テストを作成するには、「既知の」難易度のアイテムを多数組み立てる必要があります。次に、アイテムのサブセットを使用して、同じ難易度の複数のテストを構築できます。ここでは、項目反応理論（IRT）に基づく刺激難易度測定に基づく習熟度テストであるトライアドアイデンティティマッチング（TIM）テストを紹介します。参加者は、顔画像の「トライアド」（N = 225）（1つのアイデンティティの2つの画像と異なるアイデンティティの1つの画像）を表示し、異なるアイデンティティを選択します。実験1では、大学生（N = 197）がTIMテストで幅広い精度を示しました。さらに、IRTモデリングは、TIMテストがさまざまな難易度のアイテムを生成することを示しました。実験2では、IRTベースのアイテムの難易度を使用して、TIMテストを3つの同じように「簡単」なサブセットと3つの同じように「難しい」サブセットに分割しました。シミュレーション結果は、TIMアイテムのフルセットとキュレートされたサブセットが、被験者の能力の信頼できる推定値をもたらしたことを示しました。要約すると、TIMテストは、さまざまな能力レベル（たとえば、顔の処理に欠陥のある専門家や集団）全体の習熟度を測定するための柔軟性、調整、および適応性のあるフレームワークを開発するための開始点を提供できます。

Measures of face identification proficiency are essential to ensure accurate and consistent performance by professional forensic face examiners and others who perform face identification tasks in applied scenarios. Current proficiency tests rely on static sets of stimulus items, and so, cannot be administered validly to the same individual multiple times. To create a proficiency test, a large number of items of "known" difficulty must be assembled. Multiple tests of equal difficulty can be constructed then using subsets of items. Here, we introduce a proficiency test, the Triad Identity Matching (TIM) test, based on stimulus difficulty measures based on Item Response Theory (IRT). Participants view face-image "triads" (N=225) (two images of one identity and one image of a different identity) and select the different identity. In Experiment 1, university students (N=197) showed wide-ranging accuracy on the TIM test. Furthermore, IRT modeling demonstrated that the TIM test produces items of various difficulty levels. In Experiment 2, IRT-based item difficulty measures were used to partition the TIM test into three equally "easy" and three equally "difficult" subsets. Simulation results indicated that the full set, as well as curated subsets, of the TIM items yielded reliable estimates of subject ability. In summary, the TIM test can provide a starting point for developing a framework that is flexible, calibrated, and adaptive to measure proficiency across various ability levels (e.g., professionals or populations with face processing deficits)

updated: Tue Jun 22 2021 22:37:32 GMT+0000 (UTC)

published: Tue Jun 22 2021 22:37:32 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト