research-article

Free access

Heterformer: Transformer-based Deep Node Representation Learning on Heterogeneous Text-Rich Networks

Authors:

Bowen Jin,

Yu Zhang,

Qi Zhu,

Jiawei HanAuthors Info & Claims

KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

Pages 1020 - 1031

https://doi.org/10.1145/3580305.3599376

Published: 04 August 2023 Publication History

PDF eReader

Abstract

Representation learning on networks aims to derive a meaningful vector representation for each node, thereby facilitating downstream tasks such as link prediction, node classification, and node clustering. In heterogeneous text-rich networks, this task is more challenging due to (1) presence or absence of text: Some nodes are associated with rich textual information, while others are not; (2) diversity of types: Nodes and edges of multiple types form a heterogeneous network structure. As pretrained language models (PLMs) have demonstrated their effectiveness in obtaining widely generalizable text representations, a substantial amount of effort has been made to incorporate PLMs into representation learning on text-rich networks. However, few of them can jointly consider heterogeneous structure (network) information as well as rich textual semantic information of each node effectively. In this paper, we propose Heterformer, a Heterogeneous Network-Empowered Transformer that performs contextualized text encoding and heterogeneous structure encoding in a unified model. Specifically, we inject heterogeneous structure information into each Transformer layer when encoding node texts. Meanwhile, Heterformer is capable of characterizing node/edge type heterogeneity and encoding nodes with or without texts. We conduct comprehensive experiments on three tasks (i.e., link prediction, node classification, and node clustering) on three large-scale datasets from different domains, where Heterformer outperforms competitive baselines significantly and consistently. The code can be found at https://github.com/PeterGriffinJin/Heterformer.

Supplementary Material

MOV File (990-2min-promo.mov)

Heterogeneous text-rich networks are everywhere in the real world, e.g., academic networks and social media networks. We propose Heterformer, a network-empowered Transformer architecture that simultaneously captures text semantics and heterogeneous structure information. Experiments are conducted in three real-world large-scale datasets, where we demonstrate the effectiveness of Heterformer.

Download
96.34 MB

References

[1]

Tom B Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, et al. 2020. Language models are few-shot learners. In NeurIPS.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Transformer-Based Representation Learning on Temporal Heterogeneous Graphs

HINormer: Representation Learning On Heterogeneous Information Networks with Graph Transformer

Quintuple-based Representation Learning for Bipartite Heterogeneous Networks

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

PDF

eReader

Get Access

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations

Access Granted