Abstract
Automatic text summarization is leading topic of information retrieval research due to increasing online transfer of information. The large volume of information is limited due to constraint of memory devices and access time. The existing summarization system uses the sentence extraction technique where the important sentences are extracted and presented as summary. Various summarization methods are used which do not take context into consideration. The proposed system focuses on multi-document summarization which is based on context score. Bernoulli model of randomness is used to provide an informative score of bi-gram terms based on lexical association. The resulting weight is then used in the graph-based iterative algorithm to generate a summary. Experiments have been conducted over the self-generated 100 document and benchmark DUC data sets. It has been shown that proposed system outperforms the existing methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Das, D., Martins, A.F.T.: A survey on automatic text summarization. In: Literature Survey for the Language and Statistics II course at CMU 4, pp. 192–195 (2007)
Zhang, J., Sun, L., Zhou, q.: A cue-based hub-authority approach for multi-document text summarization. In: International Conference on Natural Language Processing and Knowledge Engineering, pp. 642–645 (2005)
Weu, F., He, Y., Li, W., Lu, Q.: A query-sensitive graph-based sentence ranking algorithm for query-oriented multi-document summarization. In: International Symposiums on Information Processing, pp. 9–13 (2008)
Thakkar, K.S., Dharaskar, R.V., Chandak, M.: Graph-based algorithms for text summarization. In: 3rd IEEE International Conference in Emerging Trends in Engineering and Technology (lCETET), pp. 516– 519 (2010)
Chatterjee, N., Mittal, A., Goyal, S.: Single document extractive texts summarization using genetic algorithms. In: Third International Conference on Emerging Applications of Information Technology (EAIT), pp. 19–23 (2012)
Sornil, O.. Gree-ut, K.: An Automatic text summarization approach using content-based and graph-based characteristics. In: IEEE Conference on Cybernetics and Intelligent Systems, pp. 1–6 (2006)
Ge, S.S., Zhang, Z., He, H.: Weighted graph model based sentence clustering and ranking for document summarization. In: 4th IEEE International Conference on in Interaction Sciences (ICIS), pp. 90–95 (2011)
Liu, D.-X., Hi, D.-X., Ji, D.-H., Yang, H.: A novel Chinese multi-document summarization using clustering based sentence extraction. In: Proceedings of the Fifth International Conference on Machine Learning and Cybernetics, Dalian, pp. 2592–2597 (2006)
Sonawane, S.S: Graph based information retrieval. IJACKD J. Res. 3(1) (2014)
Sonawane, S.S., Kulkarni, P.A.: Graph based representation and analysis of text document: a survey of techniques. Int. J. Comput. Appl. 96(19) (2014)
Ramesh, A., Srinivasa, K.G,, Pramod, N.: SentenceRank—a graph based approach to summarize text. In: Fifth International Conference on Applications of Digital Information and Web Technologies (ICADIWT), pp. 177–182 (2014)
Wei, Y.: Document summarization method based on heterogeneous graph. In: 9th IEEE International Conference on Fuzzy Systems and Knowledge Discovery, pp. 1285–1289 (2012)
Lin, Y.-S., Jiang, J.-Y., Lee, S.J.: A similarity measure for text classification and clustering. IEEE Trans. Knowl. Data Eng. 26(7), 1575–1590 (2014)
Ren, Pengjie, Zhumin Chen, Zhaochun Ren, Furu Wei, Jun Ma, and Maarten de Rijke. Leveraging contextual sentence relations for extractive summarization using a neural attention model. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 95–104 (2017).
Bhakkad, A., Dharamadhikari, S.C., Kulkarni, P.: Efficient approach to find bigram frequency in text document using E-VSM. Int. J. Comput. Appl. 68 (19), 9–11 (2013)
Erkan, G., Ramdev, D.R.: Lexrank: graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 457–479 (2004)
Amati, G., Van Rijsbergen, C.J.: Probabilistic Models of Information Retrieval Based on Measuring the Divergence from Randomness. ACM Trans. Inf. Syst. 20, 357–389 (2002)
Berberich, K., Bedathur, S., Weikum, G., Vazirgiannis, M.: Comparing Apples and oranges: normalized PageRank for evolving graphs. In: Proceedings of the 16th International Conference on World Wide Web, 1145–1146 (2007)
Dubey, H., Roy, B.N.: An improved page rank algorithm based on optimized normalization technique,. Int. J. Comput. Sci. Inf. Technol. 2(5), 2183-2188 (2011)
Over, P., Liggett, W.: Introduction to DUC: an intrinsic evaluation of generic news text summarization systems. In: Proceedings of DUC Workshop Text Summarization (2002)
Lin, C.Y.: ROUGH: a package for automatic evaluation of summaries. In: Proceedings of the Workshop on Text Summarization Branches Out, (2004)
Steinberger, J., Jezek, K.: Using latent semantic analysis in text summarization and summary evaluation. In: Proceedings of ISIM, 93–100 (2014)
Mihalcea, R., Tarau, P.: Textrank: bringing order into texts. In: Proceedings of EMNLP, pp. 404–411 (2004)
Goyal, P., Behera, L., & McGinnity, T. M. A context-based word indexing model for document summarization. IEEE Transactions on Knowledge and Data Engineering, 25(8), 1693–1705 (2013)
Sonawane, S.: Extractivd Summarization dataset. Mendeley Data 1 (2018). http://dx.doi.org/10.17632/z59vy3rb2r.1
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Sonawane, S., Ghotkar, A., Hinge, S. (2019). Context-Based Multi-document Summarization. In: Mandal, J., Sinha, D., Bandopadhyay, J. (eds) Contemporary Advances in Innovative and Applicable Information Technology. Advances in Intelligent Systems and Computing, vol 812. Springer, Singapore. https://doi.org/10.1007/978-981-13-1540-4_16
Download citation
DOI: https://doi.org/10.1007/978-981-13-1540-4_16
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1539-8
Online ISBN: 978-981-13-1540-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)