Abstract
In this paper, we attempt to extract and generate the short summary for the news article with the length limit of 60 Chinese characters. Firstly, we preprocess the news article by segmenting sentences and words, and then extract four kinds of central words to form the keyword dictionary based on parsing tree. After that, the four kinds of features, i.e. the sentence weight, the sentence similarity, the sentence position and the length of sentence, will be employed to measure the significance of each sentence. Finally, we extract two sentences in the descending order of significance score and compress them to get the summary for each news article. This approach can analyze the grammatical elements from original sentences in order to generate compression rules and trim syntactic elements according to their parsing trees. The evaluation results show that our system is efficient in Chinese news summarization.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Luhn, H.: The automatic creation of literature abstracts. IBM J. Res. Dev. 2(2), 159–165 (1958)
Liu, M., Wang, L., Nie, L.: Weibo-oriented Chinese news summarization via multi-feature combination. In: Li, J., Ji, H., Zhao, D., Feng, Y. (eds.) NLPCC 2015. LNCS (LNAI), vol. 9362, pp. 581–589. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-25207-0_55
John, A., Wilscy, M.: Random forest classifier based multi-document summarization system. In: International Conference on Computer Engineering and Systems, pp. 132–138 (2013)
Moawad, I., Aref, M.: Semantic graph reduction approach for abstractive text summarization. In: International Conference on Computer Engineering and Systems, pp. 132–138 (2012)
Hirao, T., Yoshida, Y., Nishino, M.: Single-document summarization as a tree knapsack problem. In: Conference on Empirical Methods in Natural Language Processing, pp. 1515–1520 (2013)
Napoles, C., Durme, B.: Evaluating sentence compression: pitfalls and suggested remedies. In: Workshop on Monolingual Text-to-text Generation, pp. 91–97 (2011)
Cohn, T., Lapata, M.: Sentence compression as tree transduction. J. Artif. Intell. Res. 34(1), 637–674 (2009)
Alias, S., Mohammad, S.K., Hoon, G.K.: A Malay text summarizer using pattern-growth method with sentence compression rules. In: Third International Conference on Information Retrieval and Knowledge Management, pp. 7–12. IEEE (2017)
Filippova, K., Alfonseca, E.: Sentence compression by deletion with LSTMs. In: Conference on Empirical Methods in Natural Language Processing, pp. 360–368 (2015)
Nallapati, R., Zhou, B.: Abstractive Text Summarization Using Sequence-to-Sequence RNNs and Beyond. IBM Watson (2016)
Acknowledgments
The work presented in this paper is partially supported by the Major Projects of National Social Science Foundation of China under No. 11&ZD189, Natural Science Foundation of China under No. 61402341, Planning Foundation of Wuhan Science and Technology Bureau under No. 2016060101010047, and Open Foundation of Hubei Province Key Laboratory under No. 2016znss05A.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Liu, M., Yu, Y., Qi, Q., Hu, H., Ren, H. (2018). Extractive Single Document Summarization via Multi-feature Combination and Sentence Compression. In: Huang, X., Jiang, J., Zhao, D., Feng, Y., Hong, Y. (eds) Natural Language Processing and Chinese Computing. NLPCC 2017. Lecture Notes in Computer Science(), vol 10619. Springer, Cham. https://doi.org/10.1007/978-3-319-73618-1_70
Download citation
DOI: https://doi.org/10.1007/978-3-319-73618-1_70
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-73617-4
Online ISBN: 978-3-319-73618-1
eBook Packages: Computer ScienceComputer Science (R0)