Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Context-Based Multi-document Summarization

  • Conference paper
  • First Online:
Contemporary Advances in Innovative and Applicable Information Technology

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 812))

Abstract

Automatic text summarization is leading topic of information retrieval research due to increasing online transfer of information. The large volume of information is limited due to constraint of memory devices and access time. The existing summarization system uses the sentence extraction technique where the important sentences are extracted and presented as summary. Various summarization methods are used which do not take context into consideration. The proposed system focuses on multi-document summarization which is based on context score. Bernoulli model of randomness is used to provide an informative score of bi-gram terms based on lexical association. The resulting weight is then used in the graph-based iterative algorithm to generate a summary. Experiments have been conducted over the self-generated 100 document and benchmark DUC data sets. It has been shown that proposed system outperforms the existing methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Das, D., Martins, A.F.T.: A survey on automatic text summarization. In: Literature Survey for the Language and Statistics II course at CMU 4, pp. 192–195 (2007)

    Google Scholar 

  2. Zhang, J., Sun, L., Zhou, q.: A cue-based hub-authority approach for multi-document text summarization. In: International Conference on Natural Language Processing and Knowledge Engineering, pp. 642–645 (2005)

    Google Scholar 

  3. Weu, F., He, Y., Li, W., Lu, Q.: A query-sensitive graph-based sentence ranking algorithm for query-oriented multi-document summarization. In: International Symposiums on Information Processing, pp. 9–13 (2008)

    Google Scholar 

  4. Thakkar, K.S., Dharaskar, R.V., Chandak, M.: Graph-based algorithms for text summarization. In: 3rd IEEE International Conference in Emerging Trends in Engineering and Technology (lCETET), pp. 516– 519 (2010)

    Google Scholar 

  5. Chatterjee, N., Mittal, A., Goyal, S.: Single document extractive texts summarization using genetic algorithms. In: Third International Conference on Emerging Applications of Information Technology (EAIT), pp. 19–23 (2012)

    Google Scholar 

  6. Sornil, O.. Gree-ut, K.: An Automatic text summarization approach using content-based and graph-based characteristics. In: IEEE Conference on Cybernetics and Intelligent Systems, pp. 1–6 (2006)

    Google Scholar 

  7. Ge, S.S., Zhang, Z., He, H.: Weighted graph model based sentence clustering and ranking for document summarization. In: 4th IEEE International Conference on in Interaction Sciences (ICIS), pp. 90–95 (2011)

    Google Scholar 

  8. Liu, D.-X., Hi, D.-X., Ji, D.-H., Yang, H.: A novel Chinese multi-document summarization using clustering based sentence extraction. In: Proceedings of the Fifth International Conference on Machine Learning and Cybernetics, Dalian, pp. 2592–2597 (2006)

    Google Scholar 

  9. Sonawane, S.S: Graph based information retrieval. IJACKD J. Res. 3(1) (2014)

    Google Scholar 

  10. Sonawane, S.S., Kulkarni, P.A.: Graph based representation and analysis of text document: a survey of techniques. Int. J. Comput. Appl. 96(19) (2014)

    Google Scholar 

  11. Ramesh, A., Srinivasa, K.G,, Pramod, N.: SentenceRank—a graph based approach to summarize text. In: Fifth International Conference on Applications of Digital Information and Web Technologies (ICADIWT), pp. 177–182 (2014)

    Google Scholar 

  12. Wei, Y.: Document summarization method based on heterogeneous graph. In: 9th IEEE International Conference on Fuzzy Systems and Knowledge Discovery, pp. 1285–1289 (2012)

    Google Scholar 

  13. Lin, Y.-S., Jiang, J.-Y., Lee, S.J.: A similarity measure for text classification and clustering. IEEE Trans. Knowl. Data Eng. 26(7), 1575–1590 (2014)

    Article  Google Scholar 

  14. Ren, Pengjie, Zhumin Chen, Zhaochun Ren, Furu Wei, Jun Ma, and Maarten de Rijke. Leveraging contextual sentence relations for extractive summarization using a neural attention model. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 95–104 (2017).

    Google Scholar 

  15. Bhakkad, A., Dharamadhikari, S.C., Kulkarni, P.: Efficient approach to find bigram frequency in text document using E-VSM. Int. J. Comput. Appl. 68 (19), 9–11 (2013)

    Article  Google Scholar 

  16. Erkan, G., Ramdev, D.R.: Lexrank: graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 457–479 (2004)

    Article  Google Scholar 

  17. Amati, G., Van Rijsbergen, C.J.: Probabilistic Models of Information Retrieval Based on Measuring the Divergence from Randomness. ACM Trans. Inf. Syst. 20, 357–389 (2002)

    Article  Google Scholar 

  18. Berberich, K., Bedathur, S., Weikum, G., Vazirgiannis, M.: Comparing Apples and oranges: normalized PageRank for evolving graphs. In: Proceedings of the 16th International Conference on World Wide Web, 1145–1146 (2007)

    Google Scholar 

  19. Dubey, H., Roy, B.N.: An improved page rank algorithm based on optimized normalization technique,. Int. J. Comput. Sci. Inf. Technol. 2(5), 2183-2188 (2011)

    Google Scholar 

  20. Over, P., Liggett, W.: Introduction to DUC: an intrinsic evaluation of generic news text summarization systems. In: Proceedings of DUC Workshop Text Summarization (2002)

    Google Scholar 

  21. Lin, C.Y.: ROUGH: a package for automatic evaluation of summaries. In: Proceedings of the Workshop on Text Summarization Branches Out, (2004)

    Google Scholar 

  22. Steinberger, J., Jezek, K.: Using latent semantic analysis in text summarization and summary evaluation. In: Proceedings of ISIM, 93–100 (2014)

    Google Scholar 

  23. Mihalcea, R., Tarau, P.: Textrank: bringing order into texts. In: Proceedings of EMNLP, pp. 404–411 (2004)

    Google Scholar 

  24. Goyal, P., Behera, L., & McGinnity, T. M. A context-based word indexing model for document summarization. IEEE Transactions on Knowledge and Data Engineering, 25(8), 1693–1705 (2013)

    Article  Google Scholar 

  25. Sonawane, S.: Extractivd Summarization dataset. Mendeley Data 1 (2018). http://dx.doi.org/10.17632/z59vy3rb2r.1

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sheetal Sonawane .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sonawane, S., Ghotkar, A., Hinge, S. (2019). Context-Based Multi-document Summarization. In: Mandal, J., Sinha, D., Bandopadhyay, J. (eds) Contemporary Advances in Innovative and Applicable Information Technology. Advances in Intelligent Systems and Computing, vol 812. Springer, Singapore. https://doi.org/10.1007/978-981-13-1540-4_16

Download citation

Publish with us

Policies and ethics