DYNAMICQA: Tracing Internal Knowledge Conflicts in Language Models

Marjanović, Sara Vera; Yu, Haeun; Atanasova, Pepa; Maistro, Maria; Lioma, Christina; Augenstein, Isabelle

Computer Science > Computation and Language

arXiv:2407.17023 (cs)

[Submitted on 24 Jul 2024 (v1), last revised 7 Oct 2024 (this version, v2)]

Title:DYNAMICQA: Tracing Internal Knowledge Conflicts in Language Models

Authors:Sara Vera Marjanović, Haeun Yu, Pepa Atanasova, Maria Maistro, Christina Lioma, Isabelle Augenstein

View PDF HTML (experimental)

Abstract:Knowledge-intensive language understanding tasks require Language Models (LMs) to integrate relevant context, mitigating their inherent weaknesses, such as incomplete or outdated knowledge. However, conflicting knowledge can be present in the LM's parameters, termed intra-memory conflict, which can affect a model's propensity to accept contextual knowledge. To study the effect of intra-memory conflict on an LM's ability to accept relevant context, we utilize two knowledge conflict measures and a novel dataset containing inherently conflicting data, DynamicQA. This dataset includes facts with a temporal dynamic nature where facts can change over time and disputable dynamic facts, which can change depending on the viewpoint. DynamicQA is the first to include real-world knowledge conflicts and provide context to study the link between the different types of knowledge conflicts. We also evaluate several measures on their ability to reflect the presence of intra-memory conflict: semantic entropy and a novel coherent persuasion score. With our extensive experiments, we verify that LMs exhibit a greater degree of intra-memory conflict with dynamic facts compared to facts that have a single truth value. Furthermore, we reveal that facts with intra-memory conflict are harder to update with context, suggesting that retrieval-augmented generation will struggle with the most commonly adapted facts.

Comments:	15 pages, 6 figures, Accepted to Findings of EMNLP 2024
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
MSC classes:	68T50
ACM classes:	I.2.7
Cite as:	arXiv:2407.17023 [cs.CL]
	(or arXiv:2407.17023v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2407.17023

Submission history

From: Haeun Yu [view email]
[v1] Wed, 24 Jul 2024 06:06:07 UTC (32,479 KB)
[v2] Mon, 7 Oct 2024 11:59:37 UTC (10,801 KB)

Computer Science > Computation and Language

Title:DYNAMICQA: Tracing Internal Knowledge Conflicts in Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:DYNAMICQA: Tracing Internal Knowledge Conflicts in Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators