Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

RoSGAS: Adaptive Social Bot Detection with Reinforced Self-supervised GNN Architecture Search

Published: 22 May 2023 Publication History

Abstract

Social bots are referred to as the automated accounts on social networks that make attempts to behave like humans. While Graph Neural Networks (GNNs) have been massively applied to the field of social bot detection, a huge amount of domain expertise and prior knowledge is heavily engaged in the state-of-the-art approaches to design a dedicated neural network architecture for a specific classification task. Involving oversized nodes and network layers in the model design, however, usually causes the over-smoothing problem and the lack of embedding discrimination. In this article, we propose RoSGAS, a novel Reinforced and Self-supervised GNN Architecture Search framework to adaptively pinpoint the most suitable multi-hop neighborhood and the number of layers in the GNN architecture. More specifically, we consider the social bot detection problem as a user-centric subgraph embedding and classification task. We exploit the heterogeneous information network to present the user connectivity by leveraging account metadata, relationships, behavioral features, and content features. RoSGAS uses a multi-agent deep reinforcement learning (RL), 31 pages. mechanism for navigating the search of optimal neighborhood and network layers to learn individually the subgraph embedding for each target user. A nearest neighbor mechanism is developed for accelerating the RL training process, and RoSGAS can learn more discriminative subgraph embedding with the aid of self-supervised learning. Experiments on five Twitter datasets show that RoSGAS outperforms the state-of-the-art approaches in terms of accuracy, training efficiency, and stability and has better generalization when handling unseen samples.

References

[1]
Norah Abokhodair, Daisy Yoo, and David W. McDonald. 2015. Dissecting a social botnet: Growth, content and influence in Twitter. CSCW. 839–851.
[2]
Seyed Ali Alhosseini, Raad Bin Tareaf, Pejman Najafi, and Christoph Meinel. 2019. Detect me if you can: Spam bot detection using inductive representation learning. WWW. 148–153.
[3]
Emily Alsentzer, Samuel Finlayson, Michelle Li, and Marinka Zitnik. 2020. Subgraph neural networks. NIPS 33 (2020), 8017–8029.
[4]
Albert-László Barabási and Réka Albert. 1999. Emergence of scaling in random networks. Science 286, 5439 (1999), 509–512.
[5]
Filippo Maria Bianchi, Daniele Grattarola, Lorenzo Livi, and Cesare Alippi. 2021. Graph neural networks with convolutional ARMAfilters. TPAMI (2021).
[6]
Adam Breuer, Roee Eilat, and Udi Weinsberg. 2020. Friend or faux: Graph-based early detection of fake accounts on social networks. WWW. 1287–1297.
[7]
Stefano Cresci. 2020. A decade of social bot detection. Commun. ACM 63, 10 (2020), 72–83.
[8]
Stefano Cresci, Roberto Di Pietro, Marinella Petrocchi, Angelo Spognardi, and Maurizio Tesconi. 2015. Fame for sale: Efficient detection of fake Twitter followers. Decis. Support Syst. 80 (2015), 56–71.
[9]
Stefano Cresci, Roberto Di Pietro, Marinella Petrocchi, Angelo Spognardi, and Maurizio Tesconi. 2017. The paradigm-shift of social spambots: Evidence, theories, and tools for the arms race. WWW. 963–972.
[10]
Quanyu Dai, Qiang Li, Jian Tang, and Dan Wang. 2018. Adversarial network embedding. In AAAI, Vol. 32.
[11]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. NAACL. 4171–4186.
[12]
Ming Ding, Jie Tang, and Jie Zhang. 2018. Semi-supervised learning on graphs with generative adversarial nets. CIKM. 913–922.
[13]
Yuxiao Dong, Nitesh V. Chawla, and Ananthram Swami. 2017. metapath2vec: Scalable representation learning for heterogeneous networks. KDD. 135–144.
[14]
Yingtong Dou, Zhiwei Liu, Li Sun, Yutong Deng, Hao Peng, and Philip S. Yu. 2020. Enhancing graph neural network-based fraud detectors against camouflaged fraudsters. CIKM. 315–324.
[15]
Shangbin Feng, Zhaoxuan Tan, Rui Li, and Minnan Luo. 2022. Heterogeneity-aware Twitter bot detection with relational graph transformers. AAAI.
[16]
Shangbin Feng, Herun Wan, Ningnan Wang, Jundong Li, and Minnan Luo. 2021. TwiBot-20: A comprehensive Twitter bot detection benchmark. CIKM. 4485–4494.
[17]
Shangbin Feng, Herun Wan, Ningnan Wang, and Minnan Luo. 2021. BotRGCN: Twitter bot detection with relational graph convolutional networks. In Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM’21). 236–239.
[18]
Emilio Ferrara, Onur Varol, Clayton Davis, Filippo Menczer, and Alessandro Flammini. 2016. The rise of social bots. Commun. ACM 59, 7 (2016), 96–104.
[19]
Matthias Fey and Jan E. Lenssen. 2019. Fast graph representation learning with PyTorch geometric. ICLR Workshop.
[20]
Yang Gao, Hong Yang, Peng Zhang, Chuan Zhou, and Yue Hu. 2020. Graph neural architecture search. IJCAI, Vol. 20. 1403–1409.
[21]
Zafar Gilani, Ekaterina Kochmar, and Jon Crowcroft. 2017. Classification of Twitter accounts into automated agents and human users. ASONAM. 489–496.
[22]
Aditya Grover and Jure Leskovec. 2016. node2vec: Scalable feature learning for networks. KDD. 855–864.
[23]
William L. Hamilton, Rex Ying, and Jure Leskovec. 2017. Inductive representation learning on large graphs. NIPS. 1025–1035.
[24]
Yiming Hei, Renyu Yang, Hao Peng, Lihong Wang, Xiaolin Xu, Jianwei Liu, Hong Liu, Jie Xu, and Lichao Sun. 2021. Hawk: Rapid android malware detection through heterogeneous graph attention networks. TNNLS (2021), 1–15.
[25]
Thomas N. Kipf and Max Welling. 2016. Variational graph auto-encoders. arXiv preprint arXiv:1611.07308 (2016).
[26]
Thomas N. Kipf and Max Welling. 2017. Semi-supervised classification with graph convolutional networks. ICLR.
[27]
Kwei-Herng Lai, Daochen Zha, Kaixiong Zhou, and Xia Hu. 2020. Policy-GNN: Aggregation optimization for graph neural networks. KDD. 461–471.
[28]
Gustav Larsson, Michael Maire, and Gregory Shakhnarovich. 2016. Learning representations for automatic colorization. ECCV. Springer, 577–593.
[29]
John Boaz Lee, Ryan A. Rossi, Xiangnan Kong, Sungchul Kim, Eunyee Koh, and Anup Rao. 2019. Graph convolutional networks with motif-based attention. CIKM. 499–508.
[30]
Qimai Li, Zhichao Han, and Xiao-Ming Wu. 2018. Deeper insights into graph convolutional networks for semi-supervised learning. 32nd AAAI Conference on Artificial Intelligence.
[31]
Ziqi Liu, Chaochao Chen, Xinxing Yang, Jun Zhou, Xiaolong Li, and Le Song. 2018. Heterogeneous graph neural networks for malicious account detection. CIKM. 2077–2085.
[32]
Michele Mazza, Stefano Cresci, Marco Avvenuti, Walter Quattrociocchi, and Maurizio Tesconi. 2019. Rtbust: Exploiting temporal patterns for botnet detection on Twitter. WebSci. 183–192.
[33]
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin Riedmiller, Andreas K. Fidjeland, Georg Ostrovski, et al. 2015. Human-level control through deep reinforcement learning. Nature 518, 7540 (2015), 529–533.
[34]
Lev Muchnik, Sen Pei, Lucas C. Parra, Saulo D. S. Reis, José S. Andrade Jr., Shlomo Havlin, and Hernán A. Makse. 2013. Origins of power-law degree distribution in the heterogeneity of human activity in social networks. Scientific Reports 3, 1 (2013), 1–8.
[35]
Hao Peng, Jianxin Li, Qiran Gong, Yuanxin Ning, Senzhang Wang, and Lifang He. 2020. Motif-matching based subgraph-level attentional convolutional network for graph classification. AAAI, Vol. 34. 5387–5394.
[36]
Hao Peng, Renyu Yang, Zheng Wang, Jianxin Li, Lifang He, Philip Yu, Albert Zomaya, and Raj Ranjan. 2021. Lime: Low-cost incremental learning for dynamic heterogeneous information networks. IEEE Trans. Comput. (2021), 628–642.
[37]
Hao Peng, Ruitong Zhang, Yingtong Dou, Renyu Yang, Jingyi Zhang, and Philip S. Yu. 2021. Reinforced neighborhood selection guided multi-relational graph neural networks. ACM Trans. Inf. Syst. 40, 4, Article 69 (Dec.2021), 46 pages.
[38]
Hao Peng, Ruitong Zhang, Shaoning Li, Yuwei Cao, Shirui Pan, and Philip S. Yu. 2023. Reinforced, incremental and cross-lingual event detection from social messages. TPAMI 45, 1 (2023), 980–998.
[39]
Zhen Peng, Wenbing Huang, Minnan Luo, Qinghua Zheng, Yu Rong, Tingyang Xu, and Junzhou Huang. 2020. Graph representation learning via graphical mutual information maximization. WWW. 259–270.
[40]
Franco Scarselli, Marco Gori, Ah Chung Tsoi, Markus Hagenbuchner, and Gabriele Monfardini. 2008. The graph neural network model. IEEE Transactions on Neural Networks 20, 1 (2008), 61–80.
[41]
Junhong Shen and Lin F. Yang. 2021. Theoretically principled deep RL acceleration via nearest neighbor function approximation. AAAI, Vol. 35. 9558–9566.
[42]
Chuan Shi, Yitong Li, Jiawei Zhang, Yizhou Sun, and S. Yu Philip. 2016. A survey of heterogeneous information network analysis. TKDE 29, 1 (2016), 17–37.
[43]
Fan-Yun Sun, Jordan Hoffmann, Vikas Verma, and Jian Tang. 2019. Infograph: Unsupervised and semi-supervised graph-level representation learning via mutual information maximization. arXiv preprint arXiv:1908.01000 (2019).
[44]
Jian Tang, Meng Qu, Mingzhe Wang, Ming Zhang, Jun Yan, and Qiaozhu Mei. 2015. Line: Large-scale information network embedding. WWW. 1067–1077.
[45]
Julian R. Ullmann. 1976. An algorithm for subgraph isomorphism. Journal of the ACM (JACM) 23, 1 (1976), 31–42.
[46]
Aäron van den Oord, Nal Kalchbrenner, Lasse Espeholt, Koray Kavukcuoglu, Oriol Vinyals, and Alex Graves. 2016. Conditional image generation with PixelCNN decoders. NIPS. 4790–4798.
[47]
Onur Varol, Emilio Ferrara, Clayton Davis, Filippo Menczer, and Alessandro Flammini. 2017. Online human-bot interactions: Detection, estimation, and characterization. In Proceedings of the International AAAI Conference on Web and Social Media (ICWSM’17), Vol. 11.
[48]
Petar Veličković, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Lio, and Yoshua Bengio. 2018. Graph attention networks. ICLR.
[49]
Petar Velickovic, William Fedus, William L. Hamilton, Pietro Liò, Yoshua Bengio, and R. Devon Hjelm. 2019. Deep graph infomax. ICLR (Poster) 2, 3 (2019), 4.
[50]
Jianyu Wang, Rui Wen, Chunming Wu, Yu Huang, and Jian Xion. 2019. Fdgars: Fraudster detection via graph convolutional networks in online app review system. WWW. 310–316.
[51]
Felix Wu, Amauri Souza, Tianyi Zhang, Christopher Fifty, Tao Yu, and Kilian Weinberger. 2019. Simplifying graph convolutional networks. ICML. 6861–6871.
[52]
Wenhan Xiong, Thien Hoang, and William Yang Wang. 2017. Deeppath: A reinforcement learning method for knowledge graph reasoning. EMNLP. 564–573.
[53]
Keyulu Xu, Chengtao Li, Yonglong Tian, Tomohiro Sonobe, Ken-ichi Kawarabayashi, and Stefanie Jegelka. 2018. Representation learning on graphs with jumping knowledge networks. ICML. PMLR, 5453–5462.
[54]
Carl Yang, Mengxiong Liu, Vincent W. Zheng, and Jiawei Han. 2018. Node, motif and subgraph: Leveraging network functional blocks through structural convolution. ASONAM. IEEE, 47–52.
[55]
Kai-Cheng Yang, Onur Varol, Clayton A. Davis, Emilio Ferrara, Alessandro Flammini, and Filippo Menczer. 2019. Arming the public with artificial intelligence to counter social bots. Comput. Hum. Behav. 1, 1 (2019), 48–61.
[56]
Kai-Cheng Yang, Onur Varol, Pik-Mai Hui, and Filippo Menczer. 2020. Scalable and generalizable social bot detection through data selection. AAAI, Vol. 34. 1096–1103.
[57]
Xiaoyu Yang, Yuefei Lyu, Tian Tian, Yifei Liu, Yudong Liu, and Xi Zhang. 2020. Rumor detection on social media with graph structured adversarial learning. IJCAI. 1417–1423.
[58]
Zihao Yuan, Qi Yuan, and Jiajing Wu. 2020. Phishing detection on ethereum via learning representation of transaction subgraphs. BlockSys. Springer, 178–191.
[59]
Hanqing Zeng, Hongkuan Zhou, Ajitesh Srivastava, Rajgopal Kannan, and Viktor Prasanna. 2020. Graphsaint: Graph sampling based inductive learning method. ICLR.
[60]
Xingxing Zhang, Furu Wei, and Ming Zhou. 2019. HIBERT: Document level pre-training of hierarchical bidirectional transformers for document summarization. ACL. 5059–5069.
[61]
Xusheng Zhao, Qiong Dai, Jia Wu, Hao Peng, Mingsheng Liu, Xu Bai, Jianlong Tang, and Philip S. Yu. 2022. Multi-view tensor graph neural networks through reinforced aggregation. TKDE (2022), 1–14.
[62]
Zhiqiang Zhong, Cheng-Te Li, and Jun Pang. 2020. Reinforcement learning enhanced heterogeneous graph neural network. arXiv preprint arXiv:2010.13735 (2020).
[63]
Kaixiong Zhou, Qingquan Song, Xiao Huang, and Xia Hu. 2019. Auto-GNN: Neural architecture search of graph neural networks. arXiv preprint arXiv:1909.03184 (2019).

Cited By

View all
  • (2025)Temporal-spatial-fusion-based risk assessment on the adjacent building during deep excavationInformation Fusion10.1016/j.inffus.2024.102653114(102653)Online publication date: Feb-2025
  • (2024)Unsupervised Social Bot Detection via Structural Information TheoryACM Transactions on Information Systems10.1145/366052242:6(1-42)Online publication date: 19-Aug-2024
  • (2024)Multimodal Detection of Bots on X (Twitter) Using TransformersIEEE Transactions on Information Forensics and Security10.1109/TIFS.2024.343513819(7320-7334)Online publication date: 2024
  • Show More Cited By

Index Terms

  1. RoSGAS: Adaptive Social Bot Detection with Reinforced Self-supervised GNN Architecture Search

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on the Web
      ACM Transactions on the Web  Volume 17, Issue 3
      August 2023
      302 pages
      ISSN:1559-1131
      EISSN:1559-114X
      DOI:10.1145/3597636
      Issue’s Table of Contents

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 22 May 2023
      Online AM: 02 December 2022
      Accepted: 20 October 2022
      Revised: 14 June 2022
      Received: 25 January 2022
      Published in TWEB Volume 17, Issue 3

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Graph neural network
      2. architecture search
      3. reinforcement learning

      Qualifiers

      • Research-article

      Funding Sources

      • National Key R&D Program of China
      • S&T Program of Hebei
      • NSFC
      • National Key R&D Program of China
      • UK EPSRC
      • UK Turing Pilot Project
      • UK Alan Turing PDEA Scheme

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)643
      • Downloads (Last 6 weeks)44
      Reflects downloads up to 02 Sep 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2025)Temporal-spatial-fusion-based risk assessment on the adjacent building during deep excavationInformation Fusion10.1016/j.inffus.2024.102653114(102653)Online publication date: Feb-2025
      • (2024)Unsupervised Social Bot Detection via Structural Information TheoryACM Transactions on Information Systems10.1145/366052242:6(1-42)Online publication date: 19-Aug-2024
      • (2024)Multimodal Detection of Bots on X (Twitter) Using TransformersIEEE Transactions on Information Forensics and Security10.1109/TIFS.2024.343513819(7320-7334)Online publication date: 2024
      • (2024)Multi-hop Attention-based Graph Pooling: A Personalized PageRank Perspective2024 Third International Conference on Distributed Computing and High Performance Computing (DCHPC)10.1109/DCHPC60845.2024.10454077(1-7)Online publication date: 14-May-2024
      • (2024)SEGCN: a subgraph encoding based graph convolutional network model for social bot detectionScientific Reports10.1038/s41598-024-54809-z14:1Online publication date: 19-Feb-2024
      • (2024)SStackGNN: Graph Data Augmentation Simplified Stacking Graph Neural Network for Twitter Bot DetectionInternational Journal of Computational Intelligence Systems10.1007/s44196-024-00496-717:1Online publication date: 29-Apr-2024
      • (2024)A hierarchical dual-view model for fake news detection guided by discriminative lexiconsInternational Journal of Machine Learning and Cybernetics10.1007/s13042-024-02322-0Online publication date: 23-Aug-2024
      • (2024)Automated catastrophic optical damage inspection of semiconductor laser chip based on multi-scale strip convolution aggregationInternational Journal of Machine Learning and Cybernetics10.1007/s13042-023-02079-y15:7(3027-3042)Online publication date: 31-Jan-2024
      • (2024)Coarse-to-fine label propagation with hybrid representation for deep semi-supervised bot detectionWireless Networks10.1007/s11276-024-03821-2Online publication date: 14-Aug-2024
      • (2024)BotScout: A Social Bot Detection Algorithm Based on Semantics, Attributes and NeighborhoodsAdvanced Intelligent Computing Technology and Applications10.1007/978-981-97-5581-3_28(343-355)Online publication date: 1-Aug-2024
      • Show More Cited By

      View Options

      Get Access

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Full Text

      View this article in Full Text.

      Full Text

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media