The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes

Douwe Kiela; Hamed Firooz; Aravind Mohan; Vedanuj Goswami; Amanpreet Singh; Pratik Ringshia; Davide Testuggine

ヘイトフルミームの課題：マルチモーダルミームでのヘイトスピーチの検出

この作業は、マルチモーダルミームでのヘイトスピーチの検出に焦点を当てた、マルチモーダル分類の新しいチャレンジセットを提案します。単峰型モデルが苦労し、マルチモーダルモデルのみが成功できるように構成されています。単峰型信号に依存しにくくするために、難しい例（「良性交絡因子」）がデータセットに追加されています。このタスクには微妙な推論が必要ですが、バイナリ分類問題として評価するのは簡単です。単峰型モデルと、さまざまな高度なマルチモーダルモデルのベースラインパフォーマンス値を提供します。最先端の方法は人間と比較してパフォーマンスが低い（64.73％対84.7％の精度）ことがわかり、タスクの難しさを示し、この重要な問題がコミュニティにもたらす課題を強調しています。

This work proposes a new challenge set for multimodal classification, focusing on detecting hate speech in multimodal memes. It is constructed such that unimodal models struggle and only multimodal models can succeed: difficult examples ("benign confounders") are added to the dataset to make it hard to rely on unimodal signals. The task requires subtle reasoning, yet is straightforward to evaluate as a binary classification problem. We provide baseline performance numbers for unimodal models, as well as for multimodal models with various degrees of sophistication. We find that state-of-the-art methods perform poorly compared to humans (64.73% vs. 84.7% accuracy), illustrating the difficulty of the task and highlighting the challenge that this important problem poses to the community.

updated: Wed Apr 07 2021 18:43:54 GMT+0000 (UTC)

published: Sun May 10 2020 21:31:00 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト