Computer Vision and Conflicting Values: Describing People with Automated Alt Text

Margot Hanley; Solon Barocas; Karen Levy; Shiri Azenkot; Helen Nissenbaum

コンピュータビジョンと相反する価値観：自動化された代替テキストで人々を説明する

学者たちは最近、画像内の人々の説明を自動的に生成するためのコンピュータビジョンの使用によって提起されたさまざまな物議を醸す問題に注目を集めています。これらの懸念にもかかわらず、自動画像記述は、視覚障害者や弱視者の情報への公平なアクセスを確保するための重要なツールになっています。このホワイトペーパーでは、代替テキストの作成にコンピュータービジョンの使用を採用している企業が直面する倫理的ジレンマを調査します。視覚障害者向けの画像のテキストによる説明、Facebookの自動代替テキストツールを主要なケーススタディとして使用します。まず、Facebookが人種、性別、年齢などのIDカテゴリに関して採用したポリシーと、これらの用語を代替テキストで表示するかどうかに関する会社の決定を分析します。次に、博物館が文化的人工物の代替テキストの説明に何を含めるかを決定する方法に焦点を当てて、博物館コミュニティで実践されている代替の（そして手動の）アプローチについて説明します。これらのポリシーを比較し、注目すべき対照点を使用して、これらのポリシー選択の背後にある特定の懸念を特徴付ける分析フレームワークを開発します。結論として、これらの懸念のいくつかを回避していると思われる2つの戦略を検討し、代替テキストを自動化するためのコンピュータービジョンの使用によって引き起こされる規範的なジレンマを回避する簡単な方法はないことを発見しました。

Scholars have recently drawn attention to a range of controversial issues posed by the use of computer vision for automatically generating descriptions of people in images. Despite these concerns, automated image description has become an important tool to ensure equitable access to information for blind and low vision people. In this paper, we investigate the ethical dilemmas faced by companies that have adopted the use of computer vision for producing alt text: textual descriptions of images for blind and low vision people, We use Facebook's automatic alt text tool as our primary case study. First, we analyze the policies that Facebook has adopted with respect to identity categories, such as race, gender, age, etc., and the company's decisions about whether to present these terms in alt text. We then describe an alternative -- and manual -- approach practiced in the museum community, focusing on how museums determine what to include in alt text descriptions of cultural artifacts. We compare these policies, using notable points of contrast to develop an analytic framework that characterizes the particular apprehensions behind these policy choices. We conclude by considering two strategies that seem to sidestep some of these concerns, finding that there are no easy ways to avoid the normative dilemmas posed by the use of computer vision to automate alt text.

updated: Wed May 26 2021 18:01:16 GMT+0000 (UTC)

published: Wed May 26 2021 18:01:16 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト