Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3397271.3401232acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
short-paper

Multi-Modal Summary Generation using Multi-Objective Optimization

Published: 25 July 2020 Publication History

Abstract

Significant development of communication technology over the past few years has motivated research in multi-modal summarization techniques. A majority of the previous works on multi-modal summarization focus on text and images. In this paper, we propose a novel extractive multi-objective optimization based model to produce a multi-modal summary containing text, images, and videos. Important objectives such as intra-modality salience, cross-modal redundancy and cross-modal similarity are optimized simultaneously in a multi-objective optimization framework to produce effective multi-modal output. The proposed model has been evaluated separately for different modalities, and has been found to perform better than state-of-the-art approaches.

Supplementary Material

MP4 File (3397271.3401232.mp4)
In this video we describe a novel multi-modal multi-objective optimization based summarization framework, that uses differential evolution as its underlying strategy. We also discuss the benefits of multi-modal summarization, discuss some empirical observations, and present a generated summary.

References

[1]
Rasim Alguliev, Ramiz Aliguliyev, and Makrufa Hajirahimova. 2010. Multi-document summarization model based on integer linear programming. Intelligent Control and Automation, Vol. 1, 02 (2010), 105.
[2]
David Arthur and Sergei Vassilvitskii. 2006. k-means+: The advantages of careful seeding. Technical Report. Stanford.
[3]
Swagatam Das, Ajith Abraham, and Amit Konar. 2007. Automatic clustering using an improved differential evolution algorithm. IEEE Transactions on systems, man, and cybernetics-Part A: Systems and Humans, Vol. 38, 1 (2007), 218--237.
[4]
Kalyanmoy Deb, Amrit Pratap, Sameer Agarwal, and TAMT Meyarivan. 2002. A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE transactions on evolutionary computation, Vol. 6, 2 (2002), 182--197.
[5]
Kalyanmoy Deb and Santosh Tiwari. 2008. Omni-optimizer: A generic evolutionary algorithm for single and multi-objective optimization. European Journal of Operational Research, Vol. 185, 3 (2008), 1062--1087.
[6]
Günes Erkan and Dragomir R Radev. 2004. Lexrank: Graph-based lexical centrality as salience in text summarization. Jour. of artif. intel. res., Vol. 22 (2004), 457--479.
[7]
Mahak Gambhir and Vishal Gupta. 2017. Recent automatic text summarization techniques: a survey. Artificial Intelligence Review, Vol. 47, 1 (2017), 1--66.
[8]
Anubhav Jangra, Adam Jatowt, Mohammad Hasanuzzaman, and Sriparna Saha. 2020. Text-Image-Video Summary Generation Using Joint Integer Linear Programming. In European Conference on Information Retrieval. Springer, 190--198.
[9]
Benjamin Klein, Guy Lev, Gil Sadeh, and Lior Wolf. 2014. Fisher vectors derived from hybrid gaussian-laplacian mixture models for image annotation. arXiv preprint arXiv:1411.7399 (2014).
[10]
Haoran Li, Junnan Zhu, Cong Ma, Jiajun Zhang, and Chengqing Zong. 2017. Multi-modal Summarization for Asynchronous Collection of Text, Image, Audio and Video. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. 1092--1102.
[11]
Rada Mihalcea and Paul Tarau. 2004. Textrank: Bringing order into text. In Proceedings of the 2004 conference on empirical methods in natural language processing. 404--411.
[12]
Malay K Pakhira, Sanghamitra Bandyopadhyay, and Ujjwal Maulik. 2004. Validity index for crisp and fuzzy clusters. Pattern recognition, Vol. 37, 3 (2004), 487--501.
[13]
Naveen Saini, Sriparna Saha, Anubhav Jangra, and Pushpak Bhattacharyya. 2019. Extractive single document summarization using multi-objective optimization: Exploring self-organized differential evolution, grey wolf optimizer and water cycle algorithm. Knowledge-Based Systems, Vol. 164 (2019), 45--67.
[14]
Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. In International Conference on Learning Representations .
[15]
Naushad UzZaman, Jeffrey P Bigham, and James F Allen. 2011. Multimodal summarization of complex sentences. In Proceedings of the 16th international conference on Intelligent user interfaces. ACM, 43--52.
[16]
Liwei Wang, Yin Li, and Svetlana Lazebnik. 2016. Learning deep structure-preserving image-text embeddings. In Proceedings of the IEEE conference on computer vision and pattern recognition. 5005--5013.
[17]
Jin-ge Yao, Xiaojun Wan, and Jianguo Xiao. 2017. Recent advances in document summarization. Knowledge and Information Systems, Vol. 53, 2 (2017), 297--336.
[18]
Yong Zhang, Meng Joo Er, Rui Zhao, and Mahardhika Pratama. 2016. Multiview convolutional neural networks for multidocument extractive summarization. IEEE transactions on cybernetics, Vol. 47, 10 (2016), 3230--3242.
[19]
Junnan Zhu, Haoran Li, Tianshang Liu, Yu Zhou, Jiajun Zhang, and Chengqing Zong. 2018. MSMO: Multimodal Summarization with Multimodal Output. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. 4154--4164.

Cited By

View all
  • (2024)IndicBART Alongside Visual Element: Multimodal Summarization in Diverse Indian LanguagesDocument Analysis and Recognition - ICDAR 202410.1007/978-3-031-70552-6_16(264-280)Online publication date: 11-Sep-2024
  • (2023)A Survey on Multi-modal SummarizationACM Computing Surveys10.1145/358470055:13s(1-36)Online publication date: 13-Jul-2023
  • (2023)MOO-CMDS+NER: Named Entity Recognition-Based Extractive Comment-Oriented Multi-document SummarizationAdvances in Information Retrieval10.1007/978-3-031-28238-6_49(580-588)Online publication date: 17-Mar-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '20: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval
July 2020
2548 pages
ISBN:9781450380164
DOI:10.1145/3397271
© 2020 Association for Computing Machinery. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 July 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. differential evolution
  2. multi-modal summarization
  3. multi-objective optimization

Qualifiers

  • Short-paper

Funding Sources

  • Visvesvaraya PhD scheme for Electronics and IT, Ministry ofElectronics and Information Technology (MeitY), Government of India

Conference

SIGIR '20
Sponsor:

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)53
  • Downloads (Last 6 weeks)8
Reflects downloads up to 16 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)IndicBART Alongside Visual Element: Multimodal Summarization in Diverse Indian LanguagesDocument Analysis and Recognition - ICDAR 202410.1007/978-3-031-70552-6_16(264-280)Online publication date: 11-Sep-2024
  • (2023)A Survey on Multi-modal SummarizationACM Computing Surveys10.1145/358470055:13s(1-36)Online publication date: 13-Jul-2023
  • (2023)MOO-CMDS+NER: Named Entity Recognition-Based Extractive Comment-Oriented Multi-document SummarizationAdvances in Information Retrieval10.1007/978-3-031-28238-6_49(580-588)Online publication date: 17-Mar-2023
  • (2022)Multi-document Summarization via Deep Learning Techniques: A SurveyACM Computing Surveys10.1145/352975455:5(1-37)Online publication date: 3-Dec-2022
  • (2022)Unsupervised framework for comment-based multi-document extractive summarizationProceedings of the Genetic and Evolutionary Computation Conference10.1145/3512290.3528691(574-582)Online publication date: 8-Jul-2022
  • (2022)Research on Multimodal Summarization by Integrating Visual and Text Modal Information2022 IEEE International Conference on Advances in Electrical Engineering and Computer Applications (AEECA)10.1109/AEECA55500.2022.9919012(882-889)Online publication date: 20-Aug-2022
  • (2022)WIDAR - Weighted Input Document Augmented ROUGEAdvances in Information Retrieval10.1007/978-3-030-99736-6_21(304-321)Online publication date: 5-Apr-2022
  • (2021)Multi-Modal Supplementary-Complementary Summarization using Multi-Objective OptimizationProceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3404835.3462877(818-828)Online publication date: 11-Jul-2021

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media