Learning Multiplex Representations on Text-Attributed Graphs with One Language Model Encoder

Jin, Bowen; Zhang, Wentao; Zhang, Yu; Meng, Yu; Zhao, Han; Han, Jiawei

Computer Science > Computation and Language

arXiv:2310.06684 (cs)

[Submitted on 10 Oct 2023 (v1), last revised 13 Jul 2024 (this version, v2)]

Title:Learning Multiplex Representations on Text-Attributed Graphs with One Language Model Encoder

Authors:Bowen Jin, Wentao Zhang, Yu Zhang, Yu Meng, Han Zhao, Jiawei Han

View PDF HTML (experimental)

Abstract:In real-world scenarios, texts in a graph are often linked by multiple semantic relations (e.g., papers in an academic graph are referenced by other publications, written by the same author, or published in the same venue), where text documents and their relations form a multiplex text-attributed graph. Mainstream text representation learning methods use pretrained language models (PLMs) to generate one embedding for each text unit, expecting that all types of relations between texts can be captured by these single-view embeddings. However, this presumption does not hold particularly in multiplex text-attributed graphs. Along another line of work, multiplex graph neural networks (GNNs) directly initialize node attributes as a feature vector for node representation learning, but they cannot fully capture the semantics of the nodes' associated texts. To bridge these gaps, we propose METAG, a new framework for learning Multiplex rEpresentations on Text-Attributed Graphs. In contrast to existing methods, METAG uses one text encoder to model the shared knowledge across relations and leverages a small number of parameters per relation to derive relation-specific representations. This allows the encoder to effectively capture the multiplex structures in the graph while also preserving parameter efficiency. We conduct experiments on nine downstream tasks in five graphs from both academic and e-commerce domains, where METAG outperforms baselines significantly and consistently. The code is available at this https URL.

Comments:	9 pages, 11 appendix pages
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2310.06684 [cs.CL]
	(or arXiv:2310.06684v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2310.06684

Submission history

From: Bowen Jin [view email]
[v1] Tue, 10 Oct 2023 14:59:22 UTC (29,350 KB)
[v2] Sat, 13 Jul 2024 17:43:09 UTC (755 KB)

Computer Science > Computation and Language

Title:Learning Multiplex Representations on Text-Attributed Graphs with One Language Model Encoder

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Learning Multiplex Representations on Text-Attributed Graphs with One Language Model Encoder

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators