"I'm fully who I am": Towards Centering Transgender and Non-Binary Voices to Measure Biases in Open Language Generation

Ovalle, Anaelia; Goyal, Palash; Dhamala, Jwala; Jaggers, Zachary; Chang, Kai-Wei; Galstyan, Aram; Zemel, Richard; Gupta, Rahul

doi:10.1145/3593013.3594078

Computer Science > Computation and Language

arXiv:2305.09941 (cs)

[Submitted on 17 May 2023 (v1), last revised 1 Jun 2023 (this version, v4)]

Title:"I'm fully who I am": Towards Centering Transgender and Non-Binary Voices to Measure Biases in Open Language Generation

Authors:Anaelia Ovalle, Palash Goyal, Jwala Dhamala, Zachary Jaggers, Kai-Wei Chang, Aram Galstyan, Richard Zemel, Rahul Gupta

View PDF

Abstract:Transgender and non-binary (TGNB) individuals disproportionately experience discrimination and exclusion from daily life. Given the recent popularity and adoption of language generation technologies, the potential to further marginalize this population only grows. Although a multitude of NLP fairness literature focuses on illuminating and addressing gender biases, assessing gender harms for TGNB identities requires understanding how such identities uniquely interact with societal gender norms and how they differ from gender binary-centric perspectives. Such measurement frameworks inherently require centering TGNB voices to help guide the alignment between gender-inclusive NLP and whom they are intended to serve. Towards this goal, we ground our work in the TGNB community and existing interdisciplinary literature to assess how the social reality surrounding experienced marginalization of TGNB persons contributes to and persists within Open Language Generation (OLG). This social knowledge serves as a guide for evaluating popular large language models (LLMs) on two key aspects: (1) misgendering and (2) harmful responses to gender disclosure. To do this, we introduce TANGO, a dataset of template-based real-world text curated from a TGNB-oriented community. We discover a dominance of binary gender norms reflected by the models; LLMs least misgendered subjects in generated text when triggered by prompts whose subjects used binary pronouns. Meanwhile, misgendering was most prevalent when triggering generation with singular they and neopronouns. When prompted with gender disclosures, TGNB disclosure generated the most stigmatizing language and scored most toxic, on average. Our findings warrant further research on how TGNB harms manifest in LLMs and serve as a broader case study toward concretely grounding the design of gender-inclusive AI in community voices and interdisciplinary literature.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
ACM classes:	I.2; I.7; K.4
Cite as:	arXiv:2305.09941 [cs.CL]
	(or arXiv:2305.09941v4 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2305.09941
Journal reference:	2023 ACM Conference on Fairness, Accountability, and Transparency
Related DOI:	https://doi.org/10.1145/3593013.3594078

Submission history

From: Anaelia Ovalle [view email]
[v1] Wed, 17 May 2023 04:21:45 UTC (2,147 KB)
[v2] Thu, 18 May 2023 00:50:23 UTC (2,193 KB)
[v3] Tue, 30 May 2023 05:21:34 UTC (2,277 KB)
[v4] Thu, 1 Jun 2023 20:42:13 UTC (2,159 KB)

Computer Science > Computation and Language

Title:"I'm fully who I am": Towards Centering Transgender and Non-Binary Voices to Measure Biases in Open Language Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:"I'm fully who I am": Towards Centering Transgender and Non-Binary Voices to Measure Biases in Open Language Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators