research-article

Retrieval-Augmented Hypergraph for Multimodal Social Media Popularity Prediction

Authors:

Zhangtao Cheng,

Goce Trajcevski,

Fan ZhouAuthors Info & Claims

KDD '24: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

Pages 445 - 455

https://doi.org/10.1145/3637528.3672041

Published: 24 August 2024 Publication History

Abstract

Accurately predicting the popularity of multimodal user-generated content (UGC) is fundamental for many real-world applications such as online advertising and recommendation. Existing approaches generally focus on limited contextual information within individual UGCs, yet overlook the potential benefit of exploiting meaningful knowledge in relevant UGCs. In this work, we propose RAGTrans, an aspect-aware retrieval-augmented multi-modal hypergraph transformer that retrieves pertinent knowledge from a multi-modal memory bank and enhances UGC representations via neighborhood knowledge aggregation on multi-model hypergraphs. In particular, we initially retrieve relevant multimedia instances from a large corpus of UGCs via the aspect information and construct a knowledge-enhanced hypergraph based on retrieved relevant instances. This allows capturing meaningful contextual information across the data. We then design a novel bootstrapping hypergraph transformer on multimodal hypergraphs to strengthen UGC representations across modalities via customizing a propagation algorithm to effectively diffuse information across nodes and edges. Additionally, we propose a user-aware attention-based fusion module to comprise the enriched UGC representations for popularity prediction. Extensive experiments on real-world social media datasets demonstrate that RAGTrans outperforms state-of-the-art popularity prediction models across settings.

Supplemental Material

MP4 File

Retrieval-Augmented Hypergraph for Multimodal Social Media Popularity Prediction

Download
14.84 MB

References

[1]

Alessia Antelmi, Gennaro Cordasco, Mirko Polato, Vittorio Scarano, Carmine Spagnuolo, and Dingqi Yang. 2023. A Survey on Hypergraph Representation Learning. Comput. Surveys (2023).

[2]

Jimmy Lei Ba, Jamie Ryan Kiros, and Geoffrey E Hinton. 2016. Layer normalization. arXiv preprint arXiv:1607.06450 (2016).

[3]

DavidMBlei, AndrewY Ng, and Michael I Jordan. 2003. Latent dirichlet allocation. Journal of machine Learning research 3, Jan (2003), 993--1022.

Digital Library

[4]

Alain Bretto. 2013. Hypergraph theory. An introduction. Mathematical Engineering. Cham: Springer (2013).

[5]

Ethem F Can, Hüseyin Oktay, and R Manmatha. 2013. Predicting retweet count using visual cues. In International Conference on Information and Knowledge Management (CIKM). 1481--1484.

Digital Library

[6]

Spencer Cappallo, Thomas Mensink, and Cees GM Snoek. 2015. Latent factors of visual popularity prediction. In ACM International Conference on Multimedia (MM). 195--202.

Digital Library

[7]

Jingyuan Chen, Xuemeng Song, Liqiang Nie, Xiang Wang, Hanwang Zhang, and Tat-Seng Chua. 2016. Micro tells macro: Predicting the popularity of micro-videos via a transductive model. In ACM International Conference on Multimedia (MM). 898--907.

Digital Library

[8]

Justin Cheng, Lada Adamic, P Alex Dow, Jon Michael Kleinberg, and Jure Leskovec. 2014. Can cascades be predicted?. In International Conference on World Wide Web (WWW). 925--936.

Digital Library

[9]

Zhangtao Cheng, JoojoWalker, Ting Zhong, and Fan Zhou. 2022. Modeling multiview interactions with contrastive graph learning for collaborative filtering. In 2022 International Joint Conference on Neural Networks (IJCNN). IEEE, 1--8.

[10]

Zhangtao Cheng, Wenxue Ye, Leyuan Liu, Wenxin Tai, and Fan Zhou. 2023. Enhancing Information Diffusion Prediction with Self-Supervised Disentangled User and Cascade Representations. In International Conference on Information and Knowledge Management (CIKM). 3808--3812.

[11]

Tsun-hin Cheung and Kin-man Lam. 2022. Crossmodal bipolar attention for multimodal classification on social media. Neurocomputing 514 (2022), 1--12.

Digital Library

[12]

Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In North American Chapter of the Association for Computational Linguistics (NAACL). 4171--4186.

[13]

Wenqi Fan, Yao Ma, Qing Li, Yuan He, Eric Zhao, Jiliang Tang, and Dawei Yin. 2019. Graph neural networks for social recommendation. In The world wide web conference (WWW). 417--426.

[14]

Yifan Feng, Haoxuan You, Zizhao Zhang, Rongrong Ji, and Yue Gao. 2019. Hypergraph neural networks. In AAAI Conference on Artificial Intelligence (AAAI), Vol. 33. 3558--3565.

Digital Library

[15]

Francesco Gelli, Tiberio Uricchio, Marco Bertini, Alberto Del Bimbo, and Shih-Fu Chang. 2015. Image popularity prediction in social media using sentiment and context features. In ACM International Conference on Multimedia (MM). 907--910.

Digital Library

[16]

Mor Geva, Roei Schuster, Jonathan Berant, and Omer Levy. 2021. Transformer Feed-Forward Layers Are Key-Value Memories. In Conference on Empirical Methods in Natural Language Processing (EMNLP). 5484--5495.

[17]

Kai Han, Yunhe Wang, Hanting Chen, Xinghao Chen, Jianyuan Guo, Zhenhua Liu, Yehui Tang, An Xiao, Chunjing Xu, Yixing Xu, et al. 2022. A survey on vision transformer. IEEE transactions on pattern analysis and machine intelligence 45, 1 (2022), 87--110.

[18]

John A Hartigan, Manchek A Wong, et al. 1979. A k-means clustering algorithm. Applied statistics 28, 1 (1979), 100--108.

[19]

Junxian He, Chunting Zhou, Xuezhe Ma, Taylor Berg-Kirkpatrick, and Graham Neubig. 2022. Towards a Unified View of Parameter-Efficient Transfer Learning. In International Conference on Learning Representations (ICLR).

[20]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Computer Vision and Pattern Recognition (CVPR). 770--778.

[21]

Nicola J Hodges, A Mark Williams, Spencer J Hayes, and Gavin Breslin. 2007. What is modelled during observational learning? Journal of sports sciences 25, 5 (2007), 531--545.

[22]

Chih-Chung Hsu, Chia-Ming Lee, Xiu-Yu Hou, and Chi-Han Tsai. 2023. Gradient Boost Tree Network based on Extensive Feature Analysis for Popularity Prediction of Social Posts. In ACM International Conference on Multimedia (MM). 9451--9455.

Digital Library

[23]

Zheng Hu, Satoshi Nakagawa, Liang Luo, Yu Gu, and Fuji Ren. 2023. Celebrityaware Graph Contrastive Learning Framework for Social Recommendation. In International Conference on Information and Knowledge Management (CIKM). 793--802.

[24]

Liya Ji, Chan Ho Park, Zhefan Rao, and Qifeng Chen. 2023. Neural Image Popularity Assessment with Retrieval-augmented Transformer. In ACM International Conference on Multimedia (MM). 2427--2436.

[25]

Jianwen Jiang, Yuxuan Wei, Yifan Feng, Jingxuan Cao, and Yue Gao. 2019. Dynamic Hypergraph Neural Networks. In International Joint Conference on Artificial Intelligence (IJCAI). 2635--2641.

[26]

Guangyin Jin, Yuxuan Liang, Yuchen Fang, Zezhi Shao, Jincai Huang, Junbo Zhang, and Yu Zheng. 2023. Spatio-temporal graph neural networks for predictive learning in urban computing: A survey. IEEE Transactions on Knowledge and Data Engineering (TKDE) (2023).

[27]

Aditya Khosla, Atish Das Sarma, and Raffay Hamid. 2014. What makes an image popular?. In The Web Conference (WWW). 867--876.

Digital Library

[28]

Thomas N. Kipf and Max Welling. 2017. Semi-Supervised Classification with Graph Convolutional Networks. In International Conference on Learning Representations (ICLR).

[29]

Xin Lai, Yihong Zhang, and Wei Zhang. 2020. Hyfea: winning solution to social media popularity prediction for multimedia grand challenge 2020. In ACM International Conference on Multimedia (MM). 4565--4569.

Digital Library

[30]

Himabindu Lakkaraju and Jitendra Ajmera. 2011. Attention prediction on social media brand pages. In International Conference on Information and Knowledge Management (CIKM). 2157--2160.

Digital Library

[31]

Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, et al. 2020. Retrieval-augmented generation for knowledge-intensive nlp tasks. Advances in Neural Information Processing Systems (Neurips) 33 (2020), 9459--9474.

[32]

Liuwu Li, Runwei Situ, Junyan Gao, Zhenguo Yang, and Wenyin Liu. 2017. A hybrid model combining convolutional neural network with xgboost for predicting social media popularity. In ACM International Conference on Multimedia (MM). 1912--1917.

Digital Library

[33]

Xianming Li and Jing Li. 2023. Angle-optimized text embeddings. arXiv preprint arXiv:2309.12871 (2023).

[34]

Xiang Lisa Li and Percy Liang. 2021. Prefix-Tuning: Optimizing Continuous Prompts for Generation. In Conference on Empirical Methods in Natural Language Processing (EMNLP). 4582--4597.

[35]

Leyuan Liu, Junyi Chen, Zhangtao Cheng, Wenxin Tai, and Fan Zhou. 2023. Towards Trustworthy Rumor Detection with Interpretable Graph Structural Learning. In International Conference on Information and Knowledge Management (CIKM). 4089--4093.

[36]

Alexander Long, Wei Yin, Thalaiyasingam Ajanthan, Vu Nguyen, Pulak Purkait, Ravi Garg, Alan Blair, Chunhua Shen, and Anton van den Hengel. 2022. Retrieval augmented classification for long-tail visual recognition. In Computer Vision and Pattern Recognition (CVPR). 6959--6969.

[37]

Philip J McParlane, Yashar Moshfeghi, and Joemon M Jose. 2014. " Nobody comes here anymore, it's too crowded"; Predicting Image Popularity on Flickr. In ACM International Conference on Multimedia (MM). 385--391.

Digital Library

[38]

Alessandro Ortis, Giovanni Maria Farinella, and Sebastiano Battiato. 2019. Prediction of social image popularity dynamics. In Image Analysis and Processing (ICIAP). 572--582.

[39]

Nicolas Papernot and Patrick McDaniel. 2018. Deep k-nearest neighbors: Towards confident, interpretable and robust deep learning. arXiv preprint arXiv:1803.04765 (2018).

[40]

Henrique Pinto, Jussara M Almeida, and Marcos A Gonçalves. 2013. Using early view patterns to predict the popularity of youtube videos. In Web Search and Data Mining (WSDM). 365--374.

[41]

David Premack and Guy Woodruff. 1978. Does the chimpanzee have a theory of mind? Behavioral and brain sciences 1, 4 (1978), 515--526.

[42]

Yang Qian, Wang Xu, Xiao Liu, Haifeng Ling, Yuanchun Jiang, Yidong Chai, and Yezheng Liu. 2022. Popularity prediction for marketer-generated content: A text-guided attention neural network for multi-modal feature fusion. Information Processing & Management 59, 4 (2022), 102984.

Digital Library

[43]

Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, et al. 2021. Learning transferable visual models from natural language supervision. In International Conference on Machine Learning (ICML). 8748--8763.

[44]

Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. In EMNLP. Association for Computational Linguistics (ACL), 3980--3990.

[45]

Stephen E Robertson, SteveWalker, Susan Jones, Micheline M Hancock-Beaulieu, Mike Gatford, et al. 1995. Okapi at TREC-3. Nist Special Publication Sp 109 (1995), 109.

[46]

Lifeng Sun, XiaoyanWang, ZhiWang, Hong Zhao, andWenwu Zhu. 2016. Socialaware video recommendation for online social groups. IEEE Transactions on Multimedia 19, 3 (2016), 609--618.

Digital Library

[47]

Xiangguo Sun, Hongzhi Yin, Bo Liu, Hongxu Chen, Jiuxin Cao, Yingxia Shao, and Nguyen Quoc Viet Hung. 2021. Heterogeneous hypergraph embedding for graph classification. In ACM International Conference on Web Search and Data Mining (WSDM). 725--733.

Digital Library

[48]

Gabor Szabo and Bernardo A Huberman. 2010. Predicting the popularity of online content. Commun. ACM 53, 8 (2010), 80--88.

Digital Library

[49]

Alexandru Tatar, Marcelo Dias De Amorim, Serge Fdida, and Panayotis Antoniadis. 2014. A survey on predicting the popularity of web content. Journal of Internet Services and Applications 5, 1 (2014), 1--20.

[50]

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. Conference on Neural Information Processing Systems (NIPS) 30 (2017).

[51]

Jing Wang, Shuo Yang, Hui Zhao, and Yue Yang. 2023. Social media popularity prediction with multimodal hierarchical fusion model. Computer Speech & Language 80 (2023), 101490.

Digital Library

[52]

WeChat. 2021. 2021 China University Computer Contest-WeChat Big Data Challenge. https://algo.weixin.qq.com/2021/problem-description.

[53]

Evan Weissburg, Arya Kumar, and Paramveer S Dhillon. 2022. Judging a book by its cover: Predicting the marginal impact of title on Reddit post popularity. In AAAI Conference on Web and Social Media, Vol. 16. 1098--1108.

[54]

Bo Wu, Wen-Huang Cheng, Yongdong Zhang, Qiushi Huang, Jintao Li, and Tao Mei. 2017. Sequential Prediction of Social Media Popularity with Deep Temporal Context Networks. In International Joint Conference on Artificial Intelligence (IJCAI). 3062--3068.

[55]

Bo Wu, Wen-Huang Cheng, Peiye Liu, Bei Liu, Zhaoyang Zeng, and Jiebo Luo. 2019. SMP Challenge: An Overview of Social Media Prediction Challenge 2019. In ACM International Conference on Multimedia (MM).

Digital Library

[56]

Bo Wu, Wen-Huang Cheng, Yongdong Zhang, and Tao Mei. 2016. Time matters: Multi-scale temporalization of social media popularity. In ACM International Conference on Multimedia (MM). 1336--1344.

Digital Library

[57]

Bo Wu, Tao Mei, Wen-Huang Cheng, and Yongdong Zhang. 2016. Unfolding temporal dynamics: Predicting social media popularity using multi-scale temporal decomposition. In AAAI Conference on Artificial Intelligence (AAAI), Vol. 30.

[58]

Lianwei Wu, Yuan Rao, Xiong Yang, Wanzhen Wang, and Ambreen Nazir. 2021. Evidence-aware hierarchical interactive attention networks for explainable claim verification. In International Joint Conference on Artificial Intelligence (IJCAI). 1388--1394.

[59]

Zonghan Wu, Shirui Pan, Fengwen Chen, Guodong Long, Chengqi Zhang, and S Yu Philip. 2020. A comprehensive survey on graph neural networks. IEEE Transactions on Neural Networks and Learning Systems (TNNLS) 32, 1 (2020), 4--24.

[60]

Jiayi Xie, Yaochen Zhu, and Zhenzhong Chen. 2021. Micro-video Popularity Prediction via Multimodal Variational Information Bottleneck. IEEE Transactions on Multimedia (2021).

[61]

Xovee Xu, Fan Zhou, Kunpeng Zhang, and Siyuan Liu. 2022. CCGL: Contrastive cascade graph learning. IEEE Transactions on Knowledge and Data Engineering (TKDE) 35, 5 (2022), 4539--4554.

[62]

Xovee Xu, Fan Zhou, Kunpeng Zhang, Siyuan Liu, and Goce Trajcevski. 2021. CasFlow: Exploring hierarchical structures and propagation uncertainty for cascade prediction. IEEE Transactions on Knowledge and Data Engineering (TKDE) 35, 4 (2021), 3484--3499.

Digital Library

[63]

Junliang Yu, Hongzhi Yin, Jundong Li, Qinyong Wang, Nguyen Quoc Viet Hung, and Xiangliang Zhang. 2021. Self-supervised multi-channel hypergraph convolutional network for social recommendation. In The Web Conference. 413--424.

Digital Library

[64]

Wei Zhang, Wen Wang, Jun Wang, and Hongyuan Zha. 2018. User-guided hierarchical attention network for multi-modal social image popularity prediction. In The Web Conference (WWW). 1277--1286.

Digital Library

[65]

Zhuoran Zhang, Shibiao Xu, Li Guo, and Wenke Lian. 2022. Multi-modal Variational Auto-Encoder Model for Micro-video Popularity Prediction. In International Conference on Communication and Information Processing. 9--16.

[66]

Fan Zhou, Xovee Xu, Goce Trajcevski, and Kunpeng Zhang. 2021. A survey of information cascade analysis: Models, predictions, and recent advances. ACM Computing Surveys (CSUR) 54, 2 (2021), 1--36.

Digital Library

Index Terms

Retrieval-Augmented Hypergraph for Multimodal Social Media Popularity Prediction

Index terms have been assigned to the content through auto-classification.

Recommendations

Does content determine information popularity in social media?: a case study of youtube videos' content and their popularity
CHI '14: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems

We here investigate what drives the popularity of information on social media platforms. Focusing on YouTube, we seek to understand the extent to which content by itself determines a video's popularity. Using mechanical turk as experimental platform, we ...
The Ramsey number for hypergraph cycles I

Let C_n denote the 3-uniform hypergraph loose cycle, that is the hypergraph with vertices v₁.....,v_n and edges v₁v₂v₃, v₃v₄v₅, v₅v₆v₇,.....,v_n-1v_nv₁. We prove that every red-blue colouring of the edges of the complete 3-uniform hypergraph with N vertices ...
Popularity prediction with semantic retrieval for news recommendation
Abstract
News recommendation (NR) is critical for helping users to navigate the vast amount of information on online news platforms. However, the key challenges of tackling the cold-start problem, comprehensively modeling user interests and accurately ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

KDD '24: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 2024

6901 pages

ISBN:9798400704901

DOI:10.1145/3637528

General Chairs:
Ricardo Baeza-Yates
Northeastern University, USA
,
Francesco Bonchi
CENTAI / Eurecat, Italy

Copyright © 2024 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 August 2024

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Kashgar Science and Technology Bureau
National Natural Science Foundation of China

Conference

KDD '24

Sponsor:

KDD '24: The 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 25 - 29, 2024

Barcelona, Spain

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
306
Total Downloads

Downloads (Last 12 months)306
Downloads (Last 6 weeks)306

Reflects downloads up to 04 Oct 2024

Other Metrics

View Author Metrics

Citations

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents