Hierarchy-Aware T5 with Path-Adaptive Mask Mechanism for Hierarchical Text Classification

Huang, Wei; Liu, Chen; Zhao, Yihua; Yang, Xinyun; Pan, Zhaoming; Zhang, Zhimin; Liu, Guiquan

Computer Science > Computation and Language

arXiv:2109.08585 (cs)

[Submitted on 17 Sep 2021]

Title:Hierarchy-Aware T5 with Path-Adaptive Mask Mechanism for Hierarchical Text Classification

Authors:Wei Huang, Chen Liu, Yihua Zhao, Xinyun Yang, Zhaoming Pan, Zhimin Zhang, Guiquan Liu

View PDF

Abstract:Hierarchical Text Classification (HTC), which aims to predict text labels organized in hierarchical space, is a significant task lacking in investigation in natural language processing. Existing methods usually encode the entire hierarchical structure and fail to construct a robust label-dependent model, making it hard to make accurate predictions on sparse lower-level labels and achieving low Macro-F1. In this paper, we propose a novel PAMM-HiA-T5 model for HTC: a hierarchy-aware T5 model with path-adaptive mask mechanism that not only builds the knowledge of upper-level labels into low-level ones but also introduces path dependency information in label prediction. Specifically, we generate a multi-level sequential label structure to exploit hierarchical dependency across different levels with Breadth-First Search (BFS) and T5 model. To further improve label dependency prediction within each path, we then propose an original path-adaptive mask mechanism (PAMM) to identify the label's path information, eliminating sources of noises from other paths. Comprehensive experiments on three benchmark datasets show that our novel PAMM-HiA-T5 model greatly outperforms all state-of-the-art HTC approaches especially in Macro-F1. The ablation studies show that the improvements mainly come from our innovative approach instead of T5.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2109.08585 [cs.CL]
	(or arXiv:2109.08585v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2109.08585

Submission history

From: Wei Huang [view email]
[v1] Fri, 17 Sep 2021 15:03:03 UTC (777 KB)

Computer Science > Computation and Language

Title:Hierarchy-Aware T5 with Path-Adaptive Mask Mechanism for Hierarchical Text Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Hierarchy-Aware T5 with Path-Adaptive Mask Mechanism for Hierarchical Text Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators