Prompt Tuning on Graph-augmented Low-resource Text Classification

Wen, Zhihao; Fang, Yuan

Computer Science > Information Retrieval

arXiv:2307.10230 (cs)

[Submitted on 15 Jul 2023 (v1), last revised 19 Aug 2024 (this version, v4)]

Title:Prompt Tuning on Graph-augmented Low-resource Text Classification

Authors:Zhihao Wen, Yuan Fang

View PDF HTML (experimental)

Abstract:Text classification is a fundamental problem in information retrieval with many real-world applications, such as predicting the topics of online articles and the categories of e-commerce product descriptions. However, low-resource text classification, with no or few labeled samples, presents a serious concern for supervised learning. Meanwhile, many text data are inherently grounded on a network structure, such as a hyperlink/citation network for online articles, and a user-item purchase network for e-commerce products. These graph structures capture rich semantic relationships, which can potentially augment low-resource text classification. In this paper, we propose a novel model called Graph-Grounded Pre-training and Prompting (G2P2) to address low-resource text classification in a two-pronged approach. During pre-training, we propose three graph interaction-based contrastive strategies to jointly pre-train a graph-text model; during downstream classification, we explore handcrafted discrete prompts and continuous prompt tuning for the jointly pre-trained model to achieve zero- and few-shot classification, respectively. Moreover, we explore the possibility of employing continuous prompt tuning for zero-shot inference. Specifically, we aim to generalize continuous prompts to unseen classes while leveraging a set of base classes. To this end, we extend G2P2 into G2P2$^*$, hinging on a new architecture of conditional prompt tuning. Extensive experiments on four real-world datasets demonstrate the strength of G2P2 in zero- and few-shot low-resource text classification tasks, and illustrate the advantage of G2P2$^*$ in dealing with unseen classes.

Comments:	15 pages, accepted by TKDE (IEEE Transactions on Knowledge and Data Engineering). arXiv admin note: substantial text overlap with arXiv:2305.03324
Subjects:	Information Retrieval (cs.IR)
Cite as:	arXiv:2307.10230 [cs.IR]
	(or arXiv:2307.10230v4 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2307.10230

Submission history

From: Zhihao Wen [view email]
[v1] Sat, 15 Jul 2023 11:49:43 UTC (390 KB)
[v2] Fri, 17 Nov 2023 04:00:09 UTC (1,006 KB)
[v3] Mon, 27 Nov 2023 10:38:30 UTC (511 KB)
[v4] Mon, 19 Aug 2024 13:53:12 UTC (504 KB)

Computer Science > Information Retrieval

Title:Prompt Tuning on Graph-augmented Low-resource Text Classification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Prompt Tuning on Graph-augmented Low-resource Text Classification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators