Using Context-to-Vector with Graph Retrofitting to Improve Word Embeddings

Zheng, Jiangbin; Wang, Yile; Wang, Ge; Xia, Jun; Huang, Yufei; Zhao, Guojiang; Zhang, Yue; Li, Stan Z.

Computer Science > Computation and Language

arXiv:2210.16848 (cs)

[Submitted on 30 Oct 2022 (v1), last revised 23 Mar 2023 (this version, v2)]

Title:Using Context-to-Vector with Graph Retrofitting to Improve Word Embeddings

Authors:Jiangbin Zheng, Yile Wang, Ge Wang, Jun Xia, Yufei Huang, Guojiang Zhao, Yue Zhang, Stan Z. Li

View PDF

Abstract:Although contextualized embeddings generated from large-scale pre-trained models perform well in many tasks, traditional static embeddings (e.g., Skip-gram, Word2Vec) still play an important role in low-resource and lightweight settings due to their low computational cost, ease of deployment, and stability. In this paper, we aim to improve word embeddings by 1) incorporating more contextual information from existing pre-trained models into the Skip-gram framework, which we call Context-to-Vec; 2) proposing a post-processing retrofitting method for static embeddings independent of training by employing priori synonym knowledge and weighted vector distribution. Through extrinsic and intrinsic tasks, our methods are well proven to outperform the baselines by a large margin.

Comments:	Accepted to ACL 2022
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2210.16848 [cs.CL]
	(or arXiv:2210.16848v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.16848

Submission history

From: Jiangbin Zheng [view email]
[v1] Sun, 30 Oct 2022 14:15:43 UTC (1,154 KB)
[v2] Thu, 23 Mar 2023 14:35:30 UTC (1,147 KB)

Computer Science > Computation and Language

Title:Using Context-to-Vector with Graph Retrofitting to Improve Word Embeddings

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Using Context-to-Vector with Graph Retrofitting to Improve Word Embeddings

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators