Unsupervised Distillation of Syntactic Information from Contextualized Word Representations

Ravfogel, Shauli; Elazar, Yanai; Goldberger, Jacob; Goldberg, Yoav

Computer Science > Computation and Language

arXiv:2010.05265 (cs)

[Submitted on 11 Oct 2020 (v1), last revised 11 Mar 2021 (this version, v2)]

Title:Unsupervised Distillation of Syntactic Information from Contextualized Word Representations

Authors:Shauli Ravfogel, Yanai Elazar, Jacob Goldberger, Yoav Goldberg

View PDF

Abstract:Contextualized word representations, such as ELMo and BERT, were shown to perform well on various semantic and syntactic tasks. In this work, we tackle the task of unsupervised disentanglement between semantics and structure in neural language representations: we aim to learn a transformation of the contextualized vectors, that discards the lexical semantics, but keeps the structural information. To this end, we automatically generate groups of sentences which are structurally similar but semantically different, and use metric-learning approach to learn a transformation that emphasizes the structural component that is encoded in the vectors. We demonstrate that our transformation clusters vectors in space by structural properties, rather than by lexical semantics. Finally, we demonstrate the utility of our distilled representations by showing that they outperform the original contextualized representations in a few-shot parsing setting.

Comments:	Accepted in BlackboxNLP@EMNLP2020
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2010.05265 [cs.CL]
	(or arXiv:2010.05265v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2010.05265

Submission history

From: Shauli Ravfogel [view email]
[v1] Sun, 11 Oct 2020 15:13:18 UTC (4,135 KB)
[v2] Thu, 11 Mar 2021 20:41:09 UTC (4,136 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-10

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Shauli Ravfogel
Yanai Elazar
Jacob Goldberger
Yoav Goldberg

export BibTeX citation

Computer Science > Computation and Language

Title:Unsupervised Distillation of Syntactic Information from Contextualized Word Representations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Unsupervised Distillation of Syntactic Information from Contextualized Word Representations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators