A Multimodal Framework for the Detection of Hateful Memes

Lippe, Phillip; Holla, Nithin; Chandra, Shantanu; Rajamanickam, Santhosh; Antoniou, Georgios; Shutova, Ekaterina; Yannakoudakis, Helen

Computer Science > Computation and Language

arXiv:2012.12871 (cs)

[Submitted on 23 Dec 2020 (v1), last revised 24 Dec 2020 (this version, v2)]

Title:A Multimodal Framework for the Detection of Hateful Memes

Authors:Phillip Lippe, Nithin Holla, Shantanu Chandra, Santhosh Rajamanickam, Georgios Antoniou, Ekaterina Shutova, Helen Yannakoudakis

View PDF

Abstract:An increasingly common expression of online hate speech is multimodal in nature and comes in the form of memes. Designing systems to automatically detect hateful content is of paramount importance if we are to mitigate its undesirable effects on the society at large. The detection of multimodal hate speech is an intrinsically difficult and open problem: memes convey a message using both images and text and, hence, require multimodal reasoning and joint visual and language understanding. In this work, we seek to advance this line of research and develop a multimodal framework for the detection of hateful memes. We improve the performance of existing multimodal approaches beyond simple fine-tuning and, among others, show the effectiveness of upsampling of contrastive examples to encourage multimodality and ensemble learning based on cross-validation to improve robustness. We furthermore analyze model misclassifications and discuss a number of hypothesis-driven augmentations and their effects on performance, presenting important implications for future research in the field. Our best approach comprises an ensemble of UNITER-based models and achieves an AUROC score of 80.53, placing us 4th on phase 2 of the 2020 Hateful Memes Challenge organized by Facebook.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2012.12871 [cs.CL]
	(or arXiv:2012.12871v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2012.12871
Journal reference:	PMLR 133:344-360, 2021

Submission history

From: Nithin Holla [view email]
[v1] Wed, 23 Dec 2020 18:37:11 UTC (4,822 KB)
[v2] Thu, 24 Dec 2020 14:28:17 UTC (4,822 KB)

Computer Science > Computation and Language

Title:A Multimodal Framework for the Detection of Hateful Memes

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:A Multimodal Framework for the Detection of Hateful Memes

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators