Societal Biases in Retrieved Contents: Measurement Framework and Adversarial Mitigation for BERT Rankers

Rekabsaz, Navid; Kopeinik, Simone; Schedl, Markus

doi:10.1145/3404835.3462949

Computer Science > Information Retrieval

arXiv:2104.13640 (cs)

[Submitted on 28 Apr 2021 (v1), last revised 11 May 2021 (this version, v2)]

Title:Societal Biases in Retrieved Contents: Measurement Framework and Adversarial Mitigation for BERT Rankers

Authors:Navid Rekabsaz, Simone Kopeinik, Markus Schedl

View PDF

Abstract:Societal biases resonate in the retrieved contents of information retrieval (IR) systems, resulting in reinforcing existing stereotypes. Approaching this issue requires established measures of fairness in respect to the representation of various social groups in retrieval results, as well as methods to mitigate such biases, particularly in the light of the advances in deep ranking models. In this work, we first provide a novel framework to measure the fairness in the retrieved text contents of ranking models. Introducing a ranker-agnostic measurement, the framework also enables the disentanglement of the effect on fairness of collection from that of rankers. To mitigate these biases, we propose AdvBert, a ranking model achieved by adapting adversarial bias mitigation for IR, which jointly learns to predict relevance and remove protected attributes. We conduct experiments on two passage retrieval collections (MSMARCO Passage Re-ranking and TREC Deep Learning 2019 Passage Re-ranking), which we extend by fairness annotations of a selected subset of queries regarding gender attributes. Our results on the MSMARCO benchmark show that, (1) all ranking models are less fair in comparison with ranker-agnostic baselines, and (2) the fairness of Bert rankers significantly improves when using the proposed AdvBert models. Lastly, we investigate the trade-off between fairness and utility, showing that we can maintain the significant improvements in fairness without any significant loss in utility.

Comments:	Accepted at SIGIR 2021
Subjects:	Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2104.13640 [cs.IR]
	(or arXiv:2104.13640v2 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2104.13640
Related DOI:	https://doi.org/10.1145/3404835.3462949

Submission history

From: Navid Rekabsaz [view email]
[v1] Wed, 28 Apr 2021 08:53:54 UTC (201 KB)
[v2] Tue, 11 May 2021 07:02:56 UTC (1,273 KB)

Computer Science > Information Retrieval

Title:Societal Biases in Retrieved Contents: Measurement Framework and Adversarial Mitigation for BERT Rankers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Societal Biases in Retrieved Contents: Measurement Framework and Adversarial Mitigation for BERT Rankers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators