Claim Check-Worthiness Detection as Positive Unlabelled Learning

Wright, Dustin; Augenstein, Isabelle

Computer Science > Computation and Language

arXiv:2003.02736 (cs)

[Submitted on 5 Mar 2020 (v1), last revised 16 Sep 2020 (this version, v2)]

Title:Claim Check-Worthiness Detection as Positive Unlabelled Learning

Authors:Dustin Wright, Isabelle Augenstein

View PDF

Abstract:As the first step of automatic fact checking, claim check-worthiness detection is a critical component of fact checking systems. There are multiple lines of research which study this problem: check-worthiness ranking from political speeches and debates, rumour detection on Twitter, and citation needed detection from Wikipedia. To date, there has been no structured comparison of these various tasks to understand their relatedness, and no investigation into whether or not a unified approach to all of them is achievable. In this work, we illuminate a central challenge in claim check-worthiness detection underlying all of these tasks, being that they hinge upon detecting both how factual a sentence is, as well as how likely a sentence is to be believed without verification. As such, annotators only mark those instances they judge to be clear-cut check-worthy. Our best performing method is a unified approach which automatically corrects for this using a variant of positive unlabelled learning that finds instances which were incorrectly labelled as not check-worthy. In applying this, we out-perform the state of the art in two of the three tasks studied for claim check-worthiness detection in English.

Comments:	13 pages, 2 figures, 9 tables
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2003.02736 [cs.CL]
	(or arXiv:2003.02736v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2003.02736

Submission history

From: Dustin Wright [view email]
[v1] Thu, 5 Mar 2020 16:06:07 UTC (778 KB)
[v2] Wed, 16 Sep 2020 16:52:15 UTC (7,739 KB)

Computer Science > Computation and Language

Title:Claim Check-Worthiness Detection as Positive Unlabelled Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Claim Check-Worthiness Detection as Positive Unlabelled Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators