Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Abstractive Summarization: A Hybrid Approach for the Compression of Semantic Graphs

Published: 01 April 2016 Publication History

Abstract

Customization of information from web documents is an immense job that involves mainly the shortening of original texts. This task is carried out using summarization techniques. In general, an automatically generated summary is of two types-extractive and abstractive. Extractive methods use surface level and statistical features for the selection of important sentences, without considering the meaning conveyed by those sentences. In contrast, abstractive methods need a formal semantic representation, where the selection of important components and the rephrasing of the selected components are carried out using the semantic features associated with the words as well as the context. Furthermore, a deep linguistic analysis is needed for generating summaries. However, the bottleneck behind abstractive summarization is that it requires semantic representation, inference rules and natural language generation. In this paper, The authors propose a semi-supervised bootstrapping approach for the identification of important components for abstractive summarization. The input to the proposed approach is a fully connected semantic graph of a document, where the semantic graphs are constructed for sentences, which are then connected by synonym concepts and co-referring entities to form a complete semantic graph. The direction of the traversal of nodes is determined by a modified spreading activation algorithm, where the importance of the nodes and edges are decided, based on the node and its connected edges under consideration. Summary obtained using the proposed approach is compared with extractive and template based summaries, and also evaluated using ROUGE scores.

References

[1]
Anderson, J. R. 1983. A spreading activation theory of memory. Journal of Verbal Learning and Verbal Behavior, 223, 261-295.
[2]
Balaji, J., & Geetha, T. V. 2011. Morpho-Semantic Features for Rule-based Tamil Enconversion. International Journal of Computers and Applications, 266, 11-18.
[3]
Balaji, J., Geetha, T.V., & Ranjani Parthasarathi. 2012. Two-Stage Bootstrapping for Anaphora Resolution. Proceedings of the24th International Conference on Computational Linguistics COLING 2012 pp. 507-516.
[4]
Balaji, J., Geetha, T.V., & Ranjani Parthasarathi. 2012. Semantic Parsing of Tamil Sentences. Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages MTPIL at the24th International Conference on Computational Linguistics COLING 2012 pp. 15-22.
[5]
Balaji J., Geetha T.V., & Ranjani Parthasarathi. 2013. A Graph Based Query Focused Multi-Document Summarization. International Journal of Intelligent Information Technologies.
[6]
Balaji J., Geetha T.V., & Ranjani Parthasarathi. 2013. Graph based Bootstrapping for Coreference Resolution. Journal of Intelligent Systems.
[7]
Balaji, J., & Geetha, T.V., & Ranjani Parthasarathi. 2014. Semi-Supervised Learning of UNL Semantic Relations of a Morphologically Rich Language.
[8]
Baldwin, B., & Morton, T.S. 1998. Dynamic coreference-based summarization. Proceedings of the Third Conference on Empirical Methods in Natural Language Processing EMNLP-3.
[9]
Barzilay, R., McKeown, K.R., & Elhadad, M. 1999. Information fusion in the context of multi-document summarization. Proc. 37th ACL pp. 550-557.
[10]
Bergler, S., Witte, R., Khalife, M., Li, Z., & Rudzicz, F. 2003, May-June. Using knowledge-poor coreference resolution for text summarization. Proceedings of DUC, Workshop on Text Summarization pp. 85-92.
[11]
Canhasi, E., & Kononenko, I. 2011. Semantic Role Frames Graph-based Multi-document Summarization, Faculty of computer and information science. University of Ljubljana.
[12]
Chali, Y., & Joty, S.R. 2008. Unsupervised approach for selecting sentences in query based summarization. Proceedings of theFLAIRS Conference pp. 47-52.
[13]
Crestani, F. 1997. Application of spreading activation techniques in information retrieval. Artificial Intelligence Review, 116, 453-482.
[14]
DangH. T.OwczarzakK. 2009. Overview of the TAC 2009 Summarization Track. Proceedings of the Second Text Analysis Conference, Gaithersburg, Maryland, USA.
[15]
Erkan, G., & Radev, D. R. 2004. Lexrank: Graph-based lexical centrality as salience in text summarization. Journal of Artificial Intelligence Research, 221, 457-479.
[16]
FIRE. Forum for information retrieval evaluation. 2010. Retrieved from www.isical.ac.in/~fire/working-notes.html
[17]
Freitas, A., Oliveira, J. G., Curry, E., O'Riain, S., & Silva, J. P. 2011. Treo: Combining Entity-Search, Spreading Activation and Semantic Relatedness for Querying Linked Data. Proceedings of the 1st Workshop on Question Answering Over Linked Data QALD-1.
[18]
Gupta, V., & Gurpreet Singh Lehal. 2010. A Survey of text summarization of extractive techniques. University institute of engineering and Technology, Computer Science & Engineering, Punjab University, Chandigarh, India.
[19]
Haghighi, A., & Vanderwende, L. 2009. Exploring content models for multi-document summarization. Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics NAACL '09 pp. 362-370. 10.3115/1620754.1620807
[20]
Hahn, U., & Mani, I. 2000, The Challenges of Automatic Summarization. Computer, 3311, 29-36.
[21]
HendrickxI.BosmaW. 2008, Using coreference links and sentence compression in graph-based summarization. Proceedings of the Text Analysis Conference TAC.
[22]
Khan, A., & Naomie Salim. 2014. A Review on Abstractive Summarization Methods.
[23]
LinC. Y. 2004, ROUGE: A Package for Automatic Evaluation of Summaries. Proceedings of Workshop on Text Summarization Branches Out, Post-Conference Workshop of ACL '04, Barcelona, Spain.
[24]
Mani, I., & Maybury, T. M. 1999. Advances in Automatic Text Summarization. MA, USA: MIT Press Cambridge.
[25]
Mann, W., & Thompson, S. 1988. Rhetorical structure theory. Toward a functional theory of text organization. Text, 83, 243-281.
[26]
Martins, C. B., & Rino, L. H. M. 2002, Revisiting UNLSumm Improvement through a case study. Proceedings of theWorkshop on Multilingual Information Access and Natural Language Processing, IBERAMIA '02.
[27]
MihalceaR.TarauP. 2004. TextRank: Bringing Order into Texts. Proceedings of the Conference on Empirical Methods in Natural Language Processing EMNLP 2004, Barcelona, Spain.
[28]
MohamedA.SanguthevarR. 2006. Query-based summarization based on document graphs. Proceedings of the Document Understanding Conference DUC '06.
[29]
Nastase, V. 2008. Topic-driven multi-document summarization with encyclopedic knowledge and spreading activation. Proceedings of the Conference on Empirical Methods in Natural Language Processing EMNLP '08 pp. 763-772.
[30]
Nenkova, A. 2005. Automatic text summarization of newswire: Lessons learned from the document understanding conference. Proceedings of the 20th National Conference on Artificial IntelligenceAAAI '05 Vol. 3, pp. 1436-1441. AAAI Press.
[31]
Nenkova, A., & Vanderwende, L. 2005. The impact of frequency on summarization Tech. Rep. MSR-TR-2005-101. Microsoft Research, Redmond, Washington.
[32]
Quillian, M. R. 1967. Word Concepts: A Theory and Simulation of Some Basic Semantic Capabilities. Behavioral Science, 125, 410-430. 6059773.
[33]
Radev, D. R., Jing, H., Stys, M., & Tam, D. 2004. Centroid-based summarization of multiple documents. Information Processing & Management, 406, 919-938.
[34]
RosnerM.CamilleriC. 2008, Multisum: query-based multi-document summarization. Proceedings of the Workshop on Multi-source Multilingual Information Extraction and Summarization, MMIES '08, Stroudsburg, PA, USA pp. 25-32. 10.3115/1613172.1613180
[35]
Sornlertlamvanich, V., Potipiti, T., & Charoenporn, T. 2001. UNL Document Summarization. Proceedings of the First International Workshop on MultiMedia Annotation, Tokyo, Japan.
[36]
Steinberger, J., Poesio, M., Kabadjov, M. A., & Jeek, K. 2007. Two uses of anaphora resolution in summarization. Information Processing & Management, 436, 1663-1680.
[37]
Subalalitha, C. N., Umamaheswari, E., Geetha, T. V., Ranjani, P., & Karky, M., 2011, Template based multilingual summary generation.
[38]
Suchal, J. 2008. On Finding Power Method in Spreading Activation Search, SOFSEM 2 pp. 124-130. Kosice, Slovakia: Safarik University.
[39]
Thiel, K., & Berthold, M. R. 2012. Node Similarities from Spreading Activation pp. 246-262.
[40]
Troussov, A., Levner, E., Bogdan, C., Judge, J., & Botvich, D. 2009. Spreading Activation Methods. In Shawkat, A., & Xiang, Y. Eds., Dynamic and Advanced Data Mining for Progressing Technological Development. USA: IGI Global.
[41]
UNDL. 2010, Universal networking language unl knowledge base UNL KB. Retrieved from http://www.unlweb.net/wiki/UNL_Knowledge_Base
[42]
UNDL. 2011 Universal networking language unl. Retrieved from http://www.undl.org/unlsys/unl/unl2005

Cited By

View all
  1. Abstractive Summarization: A Hybrid Approach for the Compression of Semantic Graphs

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image International Journal on Semantic Web & Information Systems
        International Journal on Semantic Web & Information Systems  Volume 12, Issue 2
        April 2016
        122 pages
        ISSN:1552-6283
        EISSN:1552-6291
        Issue’s Table of Contents

        Publisher

        IGI Global

        United States

        Publication History

        Published: 01 April 2016

        Author Tags

        1. Semantic Graphs
        2. Semantic Relations
        3. Summarization
        4. Universal Networking Language UNL

        Qualifiers

        • Article

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 08 Mar 2025

        Other Metrics

        Citations

        Cited By

        View all

        View Options

        View options

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media