SATS: simplification aware text summarization of scientific documents

Farooq Zaman; Faisal Kamiran; Matthew Shardlow; Saeed-Ul Hassan; Asim Karim; Naif Radi Aljohani

doi:10.3389/frai.2024.1375419

SATS: simplification aware text summarization of scientific documents

Front Artif Intell. 2024 Jul 10:7:1375419. doi: 10.3389/frai.2024.1375419. eCollection 2024.

Authors

Farooq Zaman¹, Faisal Kamiran¹, Matthew Shardlow², Saeed-Ul Hassan², Asim Karim³, Naif Radi Aljohani⁴

Affiliations

¹ Scientometrics Lab, Information Technology University, Lahore, Pakistan.
² Department of Computing and Mathematics, Manchester Metropolitan University, Manchester, United Kingdom.
³ Department of Computer Science, Syed Babar Ali School of Science and Engineering (SBASSE), Lahore University of Management Sciences, Lahore, Pakistan.
⁴ Information Systems Department, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia.

Abstract

Simplifying summaries of scholarly publications has been a popular method for conveying scientific discoveries to a broader audience. While text summarization aims to shorten long documents, simplification seeks to reduce the complexity of a document. To accomplish these tasks collectively, there is a need to develop machine learning methods to shorten and simplify longer texts. This study presents a new Simplification Aware Text Summarization model (SATS) based on future n-gram prediction. The proposed SATS model extends ProphetNet, a text summarization model, by enhancing the objective function using a word frequency lexicon for simplification tasks. We have evaluated the performance of SATS on a recently published text summarization and simplification corpus consisting of 5,400 scientific article pairs. Our results in terms of automatic evaluation demonstrate that SATS outperforms state-of-the-art models for simplification, summarization, and joint simplification-summarization across two datasets on ROUGE, SARI, and CSS₁ . We also provide human evaluation of summaries generated by the SATS model. We evaluated 100 summaries from eight annotators for grammar, coherence, consistency, fluency, and simplicity. The average human judgment for all evaluated dimensions lies between 4.0 and 4.5 on a scale from 1 to 5 where 1 means low and 5 means high.

Keywords: deep learning; scientific documents; simplification; summarization; transformer model.

Grants and funding

The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.