Establishing Strong Baselines for the New Decade: Sequence Tagging, Syntactic and Semantic Parsing with BERT

He, Han; Choi, Jinho D.

Computer Science > Computation and Language

arXiv:1908.04943 (cs)

[Submitted on 14 Aug 2019 (v1), last revised 23 May 2020 (this version, v4)]

Title:Establishing Strong Baselines for the New Decade: Sequence Tagging, Syntactic and Semantic Parsing with BERT

Authors:Han He, Jinho D. Choi

View PDF

Abstract:This paper presents new state-of-the-art models for three tasks, part-of-speech tagging, syntactic parsing, and semantic parsing, using the cutting-edge contextualized embedding framework known as BERT. For each task, we first replicate and simplify the current state-of-the-art approach to enhance its model efficiency. We then evaluate our simplified approaches on those three tasks using token embeddings generated by BERT. 12 datasets in both English and Chinese are used for our experiments. The BERT models outperform the previously best-performing models by 2.5% on average (7.5% for the most significant case). Moreover, an in-depth analysis on the impact of BERT embeddings is provided using self-attention, which helps understanding in this rich yet representation. All models and source codes are available in public so that researchers can improve upon and utilize them to establish strong baselines for the next decade.

Comments:	Accepted to the International Florida Artificial Intelligence Research Society Conference, FLAIRS 2020
Subjects:	Computation and Language (cs.CL)
ACM classes:	I.2.7
Cite as:	arXiv:1908.04943 [cs.CL]
	(or arXiv:1908.04943v4 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1908.04943

Submission history

From: Han He [view email]
[v1] Wed, 14 Aug 2019 03:45:15 UTC (1,898 KB)
[v2] Sun, 23 Feb 2020 05:03:05 UTC (1,922 KB)
[v3] Tue, 7 Apr 2020 17:10:30 UTC (1,922 KB)
[v4] Sat, 23 May 2020 04:25:58 UTC (1,922 KB)

Computer Science > Computation and Language

Title:Establishing Strong Baselines for the New Decade: Sequence Tagging, Syntactic and Semantic Parsing with BERT

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Establishing Strong Baselines for the New Decade: Sequence Tagging, Syntactic and Semantic Parsing with BERT

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators