AlignSum: Data Pyramid Hierarchical Fine-tuning for Aligning with Human Summarization Preference

Han, Yang; Wang, Yiming; Wang, Rui; Chen, Lu; Yu, Kai

Computer Science > Computation and Language

arXiv:2410.00409 (cs)

[Submitted on 1 Oct 2024]

Title:AlignSum: Data Pyramid Hierarchical Fine-tuning for Aligning with Human Summarization Preference

Authors:Yang Han, Yiming Wang, Rui Wang, Lu Chen, Kai Yu

View PDF HTML (experimental)

Abstract:Text summarization tasks commonly employ Pre-trained Language Models (PLMs) to fit diverse standard datasets. While these PLMs excel in automatic evaluations, they frequently underperform in human evaluations, indicating a deviation between their generated summaries and human summarization preferences. This discrepancy is likely due to the low quality of fine-tuning datasets and the limited availability of high-quality human-annotated data that reflect true human preference. To address this challenge, we introduce a novel human summarization preference alignment framework AlignSum. This framework consists of three parts: Firstly, we construct a Data Pymarid with extractive, abstractive, and human-annotated summary data. Secondly, we conduct the Gaussian Resampling to remove summaries with extreme lengths. Finally, we implement the two-stage hierarchical fine-tuning with Data Pymarid after Gaussian Resampling. We apply AlignSum to PLMs on the human-annotated CNN/DailyMail and BBC XSum datasets. Experiments show that with AlignSum, PLMs like BART-Large surpass 175B GPT-3 in both automatic and human evaluations. This demonstrates that AlignSum significantly enhances the alignment of language models with human summarization preferences.

Comments:	EMNLP2024 Findings, code at: this https URL
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2410.00409 [cs.CL]
	(or arXiv:2410.00409v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2410.00409

Submission history

From: Yang Han [view email]
[v1] Tue, 1 Oct 2024 05:14:48 UTC (1,976 KB)

Computer Science > Computation and Language

Title:AlignSum: Data Pyramid Hierarchical Fine-tuning for Aligning with Human Summarization Preference

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:AlignSum: Data Pyramid Hierarchical Fine-tuning for Aligning with Human Summarization Preference

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators