Enhanced Language Representation with Label Knowledge for Span Extraction

Yang, Pan; Cong, Xin; Sun, Zhenyun; Liu, Xingwu

Computer Science > Computation and Language

arXiv:2111.00884 (cs)

[Submitted on 1 Nov 2021]

Title:Enhanced Language Representation with Label Knowledge for Span Extraction

Authors:Pan Yang, Xin Cong, Zhenyun Sun, Xingwu Liu

View PDF

Abstract:Span extraction, aiming to extract text spans (such as words or phrases) from plain texts, is a fundamental process in Information Extraction. Recent works introduce the label knowledge to enhance the text representation by formalizing the span extraction task into a question answering problem (QA Formalization), which achieves state-of-the-art performance. However, QA Formalization does not fully exploit the label knowledge and suffers from low efficiency in training/inference. To address those problems, we introduce a new paradigm to integrate label knowledge and further propose a novel model to explicitly and efficiently integrate label knowledge into text representations. Specifically, it encodes texts and label annotations independently and then integrates label knowledge into text representation with an elaborate-designed semantics fusion module. We conduct extensive experiments on three typical span extraction tasks: flat NER, nested NER, and event detection. The empirical results show that 1) our method achieves state-of-the-art performance on four benchmarks, and 2) reduces training time and inference time by 76% and 77% on average, respectively, compared with the QA Formalization paradigm. Our code and data are available at this https URL.

Comments:	Accepted to the main conference of EMNLP 2021 (long paper)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2111.00884 [cs.CL]
	(or arXiv:2111.00884v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2111.00884

Submission history

From: Xin Cong [view email]
[v1] Mon, 1 Nov 2021 12:21:05 UTC (1,508 KB)

Computer Science > Computation and Language

Title:Enhanced Language Representation with Label Knowledge for Span Extraction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Enhanced Language Representation with Label Knowledge for Span Extraction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators