research-article

Generating Persuasive Visual Storylines for Promotional Videos

Authors:

Changgong Zhang,

Chunyan MiaoAuthors Info & Claims

CIKM '19: Proceedings of the 28th ACM International Conference on Information and Knowledge Management

Pages 901 - 910

https://doi.org/10.1145/3357384.3357906

Published: 03 November 2019 Publication History

Abstract

Video contents have become a critical tool for promoting products in E-commerce. However, the lack of automatic promotional video generation solutions makes large-scale video-based promotion campaigns infeasible. The first step of automatically producing promotional videos is to generate visual storylines, which is to select the building block footage and place them in an appropriate order. This task is related to the subjective viewing experience. It is hitherto performed by human experts and thus, hard to scale. To address this problem, we propose WundtBackpack, an algorithmic approach to generate storylines based on available visual materials, which can be video clips or images. It consists of two main parts, 1) the Learnable Wundt Curve to evaluate the perceived persuasiveness based on the stimulus intensity of a sequence of visual materials, which only requires a small volume of data to train; and 2) a clustering-based backpacking algorithm to generate persuasive sequences of visual materials while considering video length constraints. In this way, the proposed approach provides a dynamic structure to empower artificial intelligence (AI) to organize video footage in order to construct a sequence of visual stimuli with persuasive power. Extensive real-world experiments show that our approach achieves close to 10% higher perceived persuasiveness scores by human testers, and 12.5% higher expected revenue compared to the best performing state-of-the-art approach.

References

[1]

Harsh Agrawal, Arjun Chandrasekaran, Dhruv Batra, Devi Parikh, and Mohit Bansal. 2016. Sort Story: Sorting Jumbled Images and Captions into Stories. In EMNLP . 925--931.

[2]

J Scott Armstrong. 2010. Persuasive advertising: Evidence-based principles .Palgrave Macmillan.

[3]

Daniel E Berlyne. 1960. Conflict, arousal, and curiosity. McGraw-Hill Book Company.

[4]

Larry Cahill and James L McGaugh. 1995. A novel demonstration of enhanced memory associated with emotional arousal. Consciousness and cognition, Vol. 4, 4 (1995), 410--421.

[5]

Nick Cawthon and Andrew Vande Moere. 2006. A conceptual model for evaluating aesthetic effect within the user experience of information visualization. In IV. 374--382.

[6]

Chih-Chung Chang and Chih-Jen Lin. 2011. LIBSVM: a library for support vector machines. ACM transactions on intelligent systems and technology (TIST), Vol. 2, 3 (2011), 27.

Digital Library

[7]

Kyunghyun Cho, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. In EMNLP. 1724--1734.

[8]

Jinsoo Choi, Tae-Hyun Oh, and In So Kweon. 2016. Video-story composition via plot analysis. In CVPR. 3122--3130.

[9]

Xiang Deng, Chaoran Cui, Huidi Fang, Xiushan Nie, and Yilong Yin. 2017. Personalized image aesthetics assessment. In CIKM. 2043--2046.

[10]

Yi Dong, Chang Liu, Zhiqi Shen, Yu Han, Zhanning Gao, Pan Wang, Changgong Zhang, Peiran Ren, and Xuanzong Xie. 2019. Personalized Video Summarization with Idiom Adaptation. In ACM MM .

[11]

Elizabeth C Hirschman. 1980. Innovativeness, novelty seeking, and consumer creativity. Journal of Consumer Research, Vol. 7, 3 (1980), 283--295.

[12]

Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation, Vol. 9, 8 (1997), 1735--1780.

[13]

Andrew G Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, Marco Andreetto, and Hartwig Adam. 2017. Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861 (2017).

[14]

X-S Hua, Lie Lu, and H-J Zhang. 2006. Photo2Video-A system for automatically converting photographic series into video. IEEE Transactions on circuits and systems for video technology, Vol. 16, 7 (2006), 803--819.

Digital Library

[15]

Hye-Rin Kim, Yeong-Seok Kim, Seon Joo Kim, and In-Kwon Lee. 2018. Building emotional machines: Recognizing image emotions through deep neural networks. IEEE Transactions on Multimedia, Vol. 20, 11 (2018), 2980--2992.

Digital Library

[16]

Alex Krizhevsky. 2014. One weird trick for parallelizing convolutional neural networks. arXiv preprint arXiv:1404.5997 (2014).

[17]

Kenneth C. Laudon and Carol Guercio Traver. 2017. E-commerce : business, technology, society. Upper Saddle River : Pearson, [2017].

[18]

Mark R Lepper and David Greene. 2015. The hidden costs of reward: New perspectives on the psychology of human motivation .Psychology Press.

[19]

James MacQueen et almbox. 1967. Some methods for classification and analysis of multivariate observations. In The Fifth Berkeley Symposium on Mathematical Statistics and Probability, Vol. 1. 281--297.

[20]

Kathryn Merrick and Elanor Huntington. 2008. Attention focus in curious, reconfigurable robots. In ACRA .

[21]

Daniel James O'Keefe. 2016. Persuasion: Theory and Research. (2016).

[22]

Adam Paszke, Sam Gross, Soumith Chintala, Gregory Chanan, Edward Yang, Zachary DeVito, Zeming Lin, Alban Desmaison, Luca Antiga, and Adam Lerer. 2017. Automatic differentiation in PyTorch. In NIPS-W .

[23]

James A Russell. 1980. A circumplex model of affect. Journal of Personality and Social Psychology, Vol. 39, 6 (1980), 1161.

[24]

Rob Saunders. 2002. Curious Design Agents and Artificial Creativity-A Synthetic Approach to the Study of Creative Behaviour. PhD Thesis (2002).

[25]

Katharina Schwarz, Patrick Wieschollek, and Hendrik PA Lensch. 2018. Will people like your image? learning the aesthetic space. In WACV. 2048--2057.

[26]

Gunnar A Sigurdsson, Xinlei Chen, and Abhinav Gupta. 2016. Learning visual storylines with skipping recurrent neural networks. In ECCV . 71--88.

[27]

Hossein Talebi and Peyman Milanfar. 2018. NIMA: Neural image assessment. IEEE Transactions on Image Processing, Vol. 27, 8 (2018), 3998--4011.

[28]

Du Tran, Lubomir Bourdev, Rob Fergus, Lorenzo Torresani, and Manohar Paluri. 2015. Learning spatiotemporal features with 3d convolutional networks. In ICCV . 4489--4497.

[29]

Dingding Wang, Tao Li, and Mitsunori Ogihara. 2012. Generating pictorial storylines via minimum-weight connected dominating set approximation in multi-view graphs. In AAAI. 1006--1013.

[30]

Guolong Wang, Junchi Yan, and Zheng Qin. 2018. Collaborative and Attentive Learning for Personalized Image Aesthetic Assessment. In IJCAI. 957--963.

[31]

Zhou Wang, Alan C Bovik, Hamid R Sheikh, Eero P Simoncelli, et almbox. 2004. Image quality assessment: from error visibility to structural similarity. IEEE Transactions on Image Processing, Vol. 13, 4 (2004), 600--612.

Digital Library

[32]

Qiong Wu and Chunyan Miao. 2013. Curiosity: From psychology to computation. Comput. Surveys, Vol. 46, 2 (2013), 18.

Digital Library

[33]

Yue Wu, Xu Shen, Tao Mei, Xinmei Tian, Nenghai Yu, and Yong Rui. 2016. Monet: A system for reliving your memories by theme-based photo storytelling. IEEE Transactions on Multimedia, Vol. 18, 11 (2016), 2206--2216.

Digital Library

[34]

Han Yu, Zhiqi Shen, Chunyan Miao, Cyril Leung, Victor R. Lesser, and Qiang Yang. 2018. Building Ethics into Artificial Intelligence. In IJCAI. 5527--5533.

[35]

Guangyu Zhong, Yi-Hsuan Tsai, Sifei Liu, Zhixun Su, and Ming-Hsuan Yang. 2018. Learning Video-Story Composition via Recurrent Neural Network. In WACV . 1727--1735.

Cited By

Zhang JYu H(2023)EID: Facilitating Explainable AI Design Discussions in Team-Based SettingsInternational Journal of Crowd Science10.26599/IJCS.2022.91000347:2(47-54)Online publication date: Jun-2023
https://doi.org/10.26599/IJCS.2022.9100034
Zeng AYu HDa QZhan YYu YZhou JMiao C(2021)Improving search engine efficiency through contextual factor selectionAI Magazine10.1609/aimag.v42i2.1509942:2(50-58)Online publication date: 1-Jun-2021
https://dl.acm.org/doi/10.1609/aimag.v42i2.15099
Roy D(2021)Media Design and Technical Writing with Industry 4.0 Towards Developing Entrepreneurial Thinking in EFL Learners: A Pilot Study2021 9th International Conference on Information and Education Technology (ICIET)10.1109/ICIET51873.2021.9419630(98-109)Online publication date: 27-Mar-2021
https://doi.org/10.1109/ICIET51873.2021.9419630
Show More Cited By

Index Terms

Generating Persuasive Visual Storylines for Promotional Videos
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision tasks
2. Information systems
  1. World Wide Web
    1. Online advertising
      1. Display advertising
    2. Web applications
      1. Electronic commerce
        Online shopping

Recommendations

Selection of a best metric and evaluation of bottom-up visual saliency models

There are many ''machine vision'' models of the visual saliency mechanism, which controls the process of selecting and allocating attention to the most ''prominent'' locations in the scene and helps humans interact with the visual environment ...
Multi-modal co-attention relation networks for visual question answering
Abstract
The current mainstream visual question answering (VQA) models only model the object-level visual representations but ignore the relationships between visual objects. To solve this problem, we propose a Multi-Modal Co-Attention Relation Network (...
Fusion of visual salience maps for object acquisition

The paradigm of visual attention has been widely investigated and applied to many computer vision applications. In this study, the authors propose a new saliency‐based visual attention algorithm applied to object acquisition. The proposed algorithm ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

CIKM '19: Proceedings of the 28th ACM International Conference on Information and Knowledge Management

November 2019

3373 pages

ISBN:9781450369763

DOI:10.1145/3357384

General Chairs:
Wenwu Zhu
Tsinghua University, China
,
Dacheng Tao
University of Massachusetts, USA
,
Xueqi Cheng
Institute of Computing Technology, CAS, China
,
Program Chairs:
Peng Cui
Tsinghua University, China
,
Elke Rundensteiner
Worcester Polytechnic Institute, USA
,
David Carmel
Amazon Research, USA
,
Qi He
LinkedIn, USA
,
Jeffrey Xu Yu
Chinese University of Hong Kong, China

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 November 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

CIKM '19

Sponsor:

CIKM '19: The 28th ACM International Conference on Information and Knowledge Management

November 3 - 7, 2019

Beijing, China

Acceptance Rates

CIKM '19 Paper Acceptance Rate 202 of 1,031 submissions, 20%;

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Sponsor:
sigir
sigir

The 34th ACM International Conference on Information and Knowledge Management

November 10 - 14, 2025

Seoul , Republic of Korea

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

4
Total Citations
View Citations
350
Total Downloads

Downloads (Last 12 months)54
Downloads (Last 6 weeks)5

Reflects downloads up to 31 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Zhang JYu H(2023)EID: Facilitating Explainable AI Design Discussions in Team-Based SettingsInternational Journal of Crowd Science10.26599/IJCS.2022.91000347:2(47-54)Online publication date: Jun-2023
https://doi.org/10.26599/IJCS.2022.9100034
Zeng AYu HDa QZhan YYu YZhou JMiao C(2021)Improving search engine efficiency through contextual factor selectionAI Magazine10.1609/aimag.v42i2.1509942:2(50-58)Online publication date: 1-Jun-2021
https://dl.acm.org/doi/10.1609/aimag.v42i2.15099
Roy D(2021)Media Design and Technical Writing with Industry 4.0 Towards Developing Entrepreneurial Thinking in EFL Learners: A Pilot Study2021 9th International Conference on Information and Education Technology (ICIET)10.1109/ICIET51873.2021.9419630(98-109)Online publication date: 27-Mar-2021
https://doi.org/10.1109/ICIET51873.2021.9419630
Dong YLiu CShen ZGao ZWang PZhang CRen PXie XYu HHuang Q(2019)Domain Specific and Idiom Adaptive Video SummarizationProceedings of the ACM Multimedia Asia10.1145/3338533.3366603(1-6)Online publication date: 15-Dec-2019
https://dl.acm.org/doi/10.1145/3338533.3366603

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten