Retrofitting Structure-aware Transformer Language Model for End Tasks

Fei, Hao; Ren, Yafeng; Ji, Donghong

Computer Science > Computation and Language

arXiv:2009.07408 (cs)

[Submitted on 16 Sep 2020]

Title:Retrofitting Structure-aware Transformer Language Model for End Tasks

Authors:Hao Fei, Yafeng Ren, Donghong Ji

View PDF

Abstract:We consider retrofitting structure-aware Transformer-based language model for facilitating end tasks by proposing to exploit syntactic distance to encode both the phrasal constituency and dependency connection into the language model. A middle-layer structural learning strategy is leveraged for structure integration, accomplished with main semantic task training under multi-task learning scheme. Experimental results show that the retrofitted structure-aware Transformer language model achieves improved perplexity, meanwhile inducing accurate syntactic phrases. By performing structure-aware fine-tuning, our model achieves significant improvements for both semantic- and syntactic-dependent tasks.

Comments:	Accepted as long paper in EMNLP2020 main proceeding
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2009.07408 [cs.CL]
	(or arXiv:2009.07408v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2009.07408

Submission history

From: Hao Fei [view email]
[v1] Wed, 16 Sep 2020 01:07:07 UTC (456 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-09

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Donghong Ji

export BibTeX citation

Computer Science > Computation and Language

Title:Retrofitting Structure-aware Transformer Language Model for End Tasks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Retrofitting Structure-aware Transformer Language Model for End Tasks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators