From Cloze to Comprehension: Retrofitting Pre-trained Masked Language Model to Pre-trained Machine Reader

Xu, Weiwen; Li, Xin; Zhang, Wenxuan; Zhou, Meng; Lam, Wai; Si, Luo; Bing, Lidong

Computer Science > Computation and Language

arXiv:2212.04755v3 (cs)

[Submitted on 9 Dec 2022 (v1), last revised 16 Oct 2023 (this version, v3)]

Title:From Cloze to Comprehension: Retrofitting Pre-trained Masked Language Model to Pre-trained Machine Reader

Authors:Weiwen Xu, Xin Li, Wenxuan Zhang, Meng Zhou, Wai Lam, Luo Si, Lidong Bing

View PDF

Abstract:We present Pre-trained Machine Reader (PMR), a novel method for retrofitting pre-trained masked language models (MLMs) to pre-trained machine reading comprehension (MRC) models without acquiring labeled data. PMR can resolve the discrepancy between model pre-training and downstream fine-tuning of existing MLMs. To build the proposed PMR, we constructed a large volume of general-purpose and high-quality MRC-style training data by using Wikipedia hyperlinks and designed a Wiki Anchor Extraction task to guide the MRC-style pre-training. Apart from its simplicity, PMR effectively solves extraction tasks, such as Extractive Question Answering and Named Entity Recognition. PMR shows tremendous improvements over existing approaches, especially in low-resource scenarios. When applied to the sequence classification task in the MRC formulation, PMR enables the extraction of high-quality rationales to explain the classification process, thereby providing greater prediction explainability. PMR also has the potential to serve as a unified model for tackling various extraction and classification tasks in the MRC formulation.

Comments:	Accepted to NeurIPS 2023
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2212.04755 [cs.CL]
	(or arXiv:2212.04755v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2212.04755

Submission history

From: Weiwen Xu [view email]
[v1] Fri, 9 Dec 2022 10:21:56 UTC (2,982 KB)
[v2] Thu, 18 May 2023 07:36:32 UTC (450 KB)
[v3] Mon, 16 Oct 2023 05:45:30 UTC (429 KB)

Computer Science > Computation and Language

Title:From Cloze to Comprehension: Retrofitting Pre-trained Masked Language Model to Pre-trained Machine Reader

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:From Cloze to Comprehension: Retrofitting Pre-trained Masked Language Model to Pre-trained Machine Reader

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators