Cross-lingual Universal Dependency Parsing Only from One Monolingual Treebank

Sun, Kailai; Li, Zuchao; Zhao, Hai

Computer Science > Computation and Language

arXiv:2012.13163v2 (cs)

[Submitted on 24 Dec 2020 (v1), last revised 23 Apr 2021 (this version, v2)]

Title:Cross-lingual Universal Dependency Parsing Only from One Monolingual Treebank

Authors:Kailai Sun, Zuchao Li, Hai Zhao

View PDF

Abstract:Syntactic parsing is a highly linguistic processing task whose parser requires training on treebanks from the expensive human annotation. As it is unlikely to obtain a treebank for every human language, in this work, we propose an effective cross-lingual UD parsing framework for transferring parser from only one source monolingual treebank to any other target languages without treebank available. To reach satisfactory parsing accuracy among quite different languages, we introduce two language modeling tasks into dependency parsing as multi-tasking. Assuming only unlabeled data from target languages plus the source treebank can be exploited together, we adopt a self-training strategy for further performance improvement in terms of our multi-task framework. Our proposed cross-lingual parsers are implemented for English, Chinese, and 22 UD treebanks. The empirical study shows that our cross-lingual parsers yield promising results for all target languages, for the first time, approaching the parser performance which is trained in its own target treebank.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2012.13163 [cs.CL]
	(or arXiv:2012.13163v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2012.13163

Submission history

From: Zuchao Li [view email]
[v1] Thu, 24 Dec 2020 08:14:36 UTC (60 KB)
[v2] Fri, 23 Apr 2021 06:36:16 UTC (240 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-12

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Zuchao Li
Hai Zhao

export BibTeX citation

Computer Science > Computation and Language

Title:Cross-lingual Universal Dependency Parsing Only from One Monolingual Treebank

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Cross-lingual Universal Dependency Parsing Only from One Monolingual Treebank

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators