Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Advertisement

Privacy information propagation in online social networks - a case study based on Weibo data

  • Regular Contribution
  • Published:
International Journal of Information Security Aims and scope Submit manuscript

Abstract

The ever-increasing popularity of online social networks (OSNs) has made the propagation of privacy information in such networks a great concern. This paper aims to provide an in-depth study to reveal some main characteristics as well as the impacting factors on the propagation of privacy information in OSNs so as to establish a scientific basis for the development of privacy protection policies and mechanisms. Challenges in the construction of privacy information propagation models include a proper definition of privacy information and a precise characterization of the propagation. To realize the goals, in this study, we first provide a definition of privacy information and then propose a method for the reconstruction of the propagation paths of privacy information in Weibo (W-PIPPR), one of the most popular OSNs in China (https://weibo.com), based on which a dataset for privacy information propagation (PIPD-Weibo) has been constructed. In addition, we conducted an assessment on general perceptions of the sensitivity of various privacy attributes based on the questionnaire “What is your privacy?” that we designed and distributed. Analysis performed on PIPD-Weibo revealed the speed and scale as well as the topological structure of the propagation, showing that the influence of the privacy subjects as well as the sensitivity of private attributes is significant on the speed and scale of the propagation. Our study can not only provide some insight understanding of the propagation of privacy information in OSNs, but also contribute to accumulating empirical cases for the research on the propagation of privacy information in OSNs. Besides, our study has some practical implications on the design of software for privacy information propagation in OSNs and can aid the development of effective cybersecurity and privacy protection policies and strategies in OSNs.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

Data availability

We construct a Weibo-based dataset and make it publicly available for future research on the study of propagation of privacy information (PIPD-Weibo) that contains 30 instances of privacy information. PIPD-Weibo can be accessed via https://github.com/honglalala/PIPD--Weibo.

References

  1. Avgerou, A.D., Stamatiou, Y.C.: Privacy awareness diffusion in social networks. IEEE Secur. Priv. 13, 44–50 (2015). https://doi.org/10.1109/MSP.2015.136

    Article  Google Scholar 

  2. Wu, Y.: L Pan Privacy-aware personal information propagation management in social networks. Proc. - 2018 IEEE 3rd Int. Conf. Data Sci. Cyberspace DSC 2018 169–174 https://doi.org/10.1109/DSC.2018.00032 (2018)

  3. Cheng, Y., Ma, J., Liu, Z., Wu, Y., Wei, K., Dong, C.: A lightweight privacy preservation Scheme with efficient Reputation Management for Mobile Crowdsensing in Vehicular Networks. IEEE Trans. Dependable Secur. Comput. 20, 1771–1788 (2023). https://doi.org/10.1109/TDSC.2022.3163752

    Article  Google Scholar 

  4. Majeed, A., Hwang, S.O.: Rectification of syntactic and semantic privacy mechanisms. IEEE Secur. Priv. 21, 18–32 (2023). https://doi.org/10.1109/MSEC.2022.3188365

    Article  Google Scholar 

  5. Xie, Y., Tong, S., Zhou, P., Li, Y., Feng, D.: Efficient Storage Management for Social Network Events Based on Clustering and Hot/Cold Data classification. IEEE Trans. Comput. Soc. Syst. 10, 120–130 (2023). https://doi.org/10.1109/TCSS.2022.3146310

    Article  Google Scholar 

  6. Zhang, Z., Ren, F., Zhang, J., Su, S., Yan, Y., Wei, Q., Sun, L., Zhu, G., Guo, C.: When Behavior Analysis meets Social Network Alignment. IEEE Trans. Knowl. Data Eng. 35, 7590–7607 (2022). https://doi.org/10.1109/TKDE.2022.3197985

    Article  Google Scholar 

  7. Hsu, B.Y., Yeh, L.Y., Chang, M.Y., Shen, C.Y.: Willingness maximization for Ego Network Data extraction in multiple online Social Networks. IEEE Trans. Knowl. Data Eng. 35, 8672–8686 (2023). https://doi.org/10.1109/TKDE.2022.3207150

    Article  Google Scholar 

  8. Aristotle: Politics. China Renmin University Press (2003)

  9. Stephen, J.F.: Liberty, Equality, Fraternity. Henry Hold and Co, New York (1873)

    Google Scholar 

  10. Alan, F.: Westin: Privacy and Freedom. Atheneum (1968)

  11. Nissenbaum, H.: Privacy as contestual integrity. Washingt Law Rev. 79, 101–139 (2004)

    Google Scholar 

  12. Nissenbaum, H.: Privacy in Context: Technology, Policy, and the Integrity of Social life. Stanford University Press (2011)

  13. Sanfilippo, M.R., Shvartzshnaider, Y., Reyes, I., Nissenbaum, H., Egelman, S.: Disaster privacy/privacy disaster. J. Assoc. Inf. Sci. Technol. 71, 1002–1014 (2020). https://doi.org/10.1002/asi.24353

    Article  Google Scholar 

  14. Shvartzshnaider, Y., Wies, T., Pavlinovic, Z., Lakshminarayanan, Mittal, P., Balashankar, A., Nissenbaum, H.: Vaccine: Using contextual integrity for data leakage detection. In: Proceedings of the World Wide Web Conference 2019, San Francisco, CA, USA. pp. 1702–1712 (2019)

  15. Theodorakopoulos, G., Panaousis, E., Liang, K., Loukas, G.: On-the-fly privacy for location histograms. IEEE Trans. Dependable Secur. Comput. 19, 566–578 (2022). https://doi.org/10.1109/TDSC.2020.2980270

    Article  Google Scholar 

  16. Yu, H., Sun, H., Xu, D.: Research on the Dilemma and Countermeasures of Employees’ Right to Privacy Based on Big Data. In: Proceedings of 2021 2nd International Conference on Big Data and Informatization Education, Hangzhou, China. pp. 21–28 (2021)

  17. Perumal, S., Aramugam, R., Samy, G.N., Krishnasamy, K., Shanmugam, B.: Proposed customer’s sensitive information privacy model for financial institution. In: Proceedings of 2019 International Conference on Computing, Electronics and Communications Engineering, London, UK. pp. 203–207. IEEE (2019)

  18. Li, H., Xia, C., Wang, T., Wen, S., Chen, C., Xiang, Y.: Capturing Dynamics of Information Diffusion in SNS: A Survey of Methodology and techniques. ACM Comput. Surv. 55 (2021). https://doi.org/10.1145/3485273

  19. Zhu, H., Huang, C., Li, H.: Information diffusion model based on privacy setting in online social networking services. Comput. J. 58, 536–548 (2015). https://doi.org/10.1093/comjnl/bxu062

    Article  Google Scholar 

  20. Goel, S., Watts, D.J., Goldstein, D.G.: The structure of online diffusion networks. In: Proceedings of the ACM Conference on Electronic Commerce. pp. 623–638 (2012)

  21. Zhang, F., Tang, J., Liu, X., Hou, Z., Dong, Y., Zhang, J., Liu, X., Xie, R., Zhuang, K., Zhang, X., Lin, L., Yu, P.S.: Understanding WeChat user preferences and wow diffusion. IEEE Trans. Knowl. Data Eng. 34, 6033–6046 (2022). https://doi.org/10.1109/TKDE.2021.3064233

    Article  Google Scholar 

  22. Moosa, J., Awad, W., Kalganova, T.: Accuracy and Privacy Evaluation of detected communities using Attributed-Based Label Propagation Method. Conf. IT Innov. Knowl. Discov. ITIKD 2023. 1–6 (2023). (2023). Int https://doi.org/10.1109/ITIKD56332.2023.10100272

  23. Wu, X., Fu, L., Long, H., Yang, D., Lu, Y., Wang, X., Chen, G.: Adaptive diffusion of Sensitive Information in Online Social Networks. IEEE Trans. Knowl. Data Eng. 33, 3020–3034 (2021). https://doi.org/10.1109/TKDE.2020.2964242

    Article  Google Scholar 

  24. Hu, X., Zhu, T., Zhai, X., Wang, H., Zhou, W., Zhao, W.: Privacy data Diffusion modeling and preserving in Online Social Network. IEEE Trans. Knowl. Data Eng. 35, 6224–6237 (2023). https://doi.org/10.1109/TKDE.2022.3176948

    Article  Google Scholar 

  25. Hu, X., Zhu, T., Zhai, X., Zhou, W., Zhao, W.: Privacy data propagation and preservation in Social Media: A real-world case study. IEEE Trans. Knowl. Data Eng. 35, 4137–4150 (2023). https://doi.org/10.1109/TKDE.2021.3137326

    Article  Google Scholar 

  26. Li, Z., Lv, T., Zhang, X., Chen, X.: The effects of personal characteristics and interpersonal influence on privacy information diffusion in SNS. In: Proceedings of 2013 IEEE International Conference on Service Operations and Logistics, and Informatics, SOLI 2013. pp. 413–418. IEEE (2013)

  27. Zhu, T., Li, J., Hu, X., Xiong, P., Zhou, W.: The dynamic privacy-preserving mechanisms for Online Dynamic Social Networks. IEEE Trans. Knowl. Data Eng. 34, 2962–2974 (2022). https://doi.org/10.1109/TKDE.2020.3015835

    Article  Google Scholar 

  28. Caliskan-Islam, A., Walsh, J., Greenstadt, R.: Privacy detective: Detecting private information and collective privacy behavior in a large social network. Proc. ACM Conf. Comput. Commun. Secur. 35–46 (2014). https://doi.org/10.1145/2665943.2665958

  29. Yi, Y., Zhu, N., He, J., Jurcut, A.D., Zhao, B.: Toward pragmatic modeling of privacy information propagation in online social networks. Comput. Networks. 219 (2022). https://doi.org/10.1016/j.comnet.2022.109429

  30. Chen, B.: Weibo hand Slipped to like After Cancel Friends know? https://baijiahao.baidu.com/s?id=1761774757330971791픴=spider&for=pc

  31. Page, L.: The PageRank citation ranking: Bringing order to the web. (1999)

  32. Kim, S., Han, J., Yoo, S., Gerla, M.: How are social influencers connected in instagram? Lect Notes Comput. Sci. (Including Subser. Lect Notes Artif. Intell. Lect Notes Bioinformatics). 10540 LNCS, 257–264 (2017). https://doi.org/10.1007/978-3-319-67256-4_20

    Article  Google Scholar 

  33. Medler, J.T.: The types of Flatidae (Homoptera) in the Stockholm Museum described by Stål, Melichar, Jacobi and Walker. Insect Syst. Evol. 17, 323–337 (1986). https://doi.org/10.1163/187631286X00251

    Article  Google Scholar 

  34. Ebbinghaus, H.: Memory: A contribution to experimental psychology. Ann. Neurosci. 20, 155–156 (2013). https://doi.org/10.5214/ans.0972.7531.200408

    Article  Google Scholar 

  35. Cui, Z., Sun, X., Chen, H., Pan, L., Cui, L., Liu, S., Xu, G.: Dynamic recommendation based on Graph Diffusion and Ebbinghaus curve. IEEE Trans. Comput. Soc. Syst. PP. 1–10 (2023). https://doi.org/10.1109/TCSS.2023.3267611

  36. Barabási, A.L., Albert, R.: Emergence of scaling in random networks. Science. 286, 509–512 (1999). https://doi.org/10.1126/science.286.5439.509

    Article  MathSciNet  Google Scholar 

  37. Clauset, A., Shalizi, C.R., Newman, M.E.J.: Power-law distributions in empirical data. SIAM Rev. 51, 661–703 (2009). https://doi.org/10.1137/070710111

    Article  MathSciNet  Google Scholar 

  38. Kezer, M., Dienlin, T., Baruh, L.: Getting the privacy Calculus right: Analyzing the relations between privacy concerns, expected benefits, and Self-Disclosure using response surface analysis. Cyberpsychology. 16 (2022). https://doi.org/10.5817/CP2022-4-1

  39. Vgena, K., Kitsiou, A., Kalloniatis, C.: Understanding the role of users’ socio-location attributes and their privacy implications on social media. Inf. \& Comput. Secur. 30, 705–729 (2022)

    Article  Google Scholar 

  40. Bhroin, N.N.I., Dinh, T., Thiel, K., Lampert, C., Staksrud, E., Ólafsson, K.: The privacy Paradox by Proxy: Considering predictors of Sharenting. Media Commun. 10, 371–383 (2022). https://doi.org/10.17645/mac.v10i1.4858

    Article  Google Scholar 

  41. Khattar, P.: What you don’t know will Hurt You - fighting the privacy Paradox by Designing for privacy and enforcing Protective Technology. SSRN Electron. J. 18 (2023). https://doi.org/10.2139/ssrn.4380722

  42. Vgena, K., Kitsiou, A., Kalloniatis, C., Gritzalis, S.: Determining the role of Social Identity attributes to the protection of users’ privacy in Social Media. Futur Internet. 14, 1–18 (2022). https://doi.org/10.3390/fi14090249

    Article  Google Scholar 

Download references

Acknowledgements

The work presented in this paper has been supported by Beijing Natural Science Foundation (No. IS23054), Science Foundation of China University of Petroleum, Beijing (No. 2462024SZBH007) and by the 2023 International Cooperation Training Program for Innovative Talents (“Double First-class” Construction Special Program - “Artificial Intelligence + Internet of Things”) of the China Scholarship Council (CSC).

Author information

Authors and Affiliations

Authors

Contributions

Jingsha He and Nafei Zhu proposed the conceptualization and methodology, Yehong Luo wrote the manuscript text, conducted experiments, collated and analyzed the data, Lei Sun and Ziwen Wang did some experiments, Yuzi Yi and Xiangjun Ma collated the data, Jurcut, Anca Delia revised the manuscript. All authors reviewed the manuscript.

Corresponding author

Correspondence to Jingsha He.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Appendix

Questionnaires are generally used as a method of research to uncover facts and to provide some insights into what individuals really think about something. The definition of privacy stresses that the judgment on privacy by humans is subjective, i.e., humans are the decision makers regarding what privacy is. Consequently, we decided to design a questionnaire and ask participants to tell us the importance of private attributes from their perspectives. The design of the questionnaire “What is your privacy?” consists mainly of three parts.

The first part aims to collect the basic information of the participants, which includes: age, gender, occupation, education level and marital status. Since the judgment of privacy is personal, participants of the survey should span across as many different walks of life as possible to ensure the diversity of the collected data. Different ages and different occupations also matter greatly. For example, politicians would generally prefer keeping their contact information confidential while teachers often publish such information on their websites. Gender, education and marital status would greatly influence the privacy awareness of individuals.

The second part aims to collect the privacy opinions of the participants on the set of private attributes that we include in the questionnaire based on the common belief that humans would generally agree on a common set of private attributes. Caliskan et al. categorized personal attributes into nine categories based on information posted in tweets [28]. In our design, we summarized the private attributes into eight categories after removing “neutral descriptions”. Participants could also augment the questionnaire with private attributes that they considered to be relevant.

The third part aims to collect information on how the participants feel about the sensitivity of each private attribute in the questionnaire. For each choice of the private attributes by a participant, an assessment of the likelihood of releasing the private attribute was sought. The likelihood in the questionnaire employed a 5-point scale from “I will not release it” to “I will definitely release it” with lower values indicating higher levels of the willingness to conceal and higher values indicating higher levels of the willingness to release. The general framework as well as the design of the questionnaire is shown in Fig. 14.

Fig. 14
figure 14

The general framework of the “What is your privacy?” questionnaire

We handed out 500 questionnaires to 500 volunteers and received 138 back among which 135 were valid. The 135 participants were from various regions of China consisting of 78 males and 57 females ranging from 18 to 50 years old. Among the 135 participants, 43 (31.34%) are between the ages of 18 and 25, 45 (35.07%) between 26 and 30, 28 (20.90%) between 31 and 40, and 19 (11.94%) between 41 and 50. In terms of the education levels of the 135 participants, 34 (26.12%) did not attend the college, 49 (32.84%) with a bachelor’s degree, 39 (31.34%) with a master’s degree, and 13 (9.70%) with a doctoral degree. In terms of the marital status, 94 (70.15%) were unmarried and 41 (29.85%) were married. In terms of the occupation, there were 25 (18.51%) students, 10 (7.41%) artists, 12 (8.89%) social workers, 29 (21.48%) researcher, 10 (7.41%) managers, 24 (17.78%) skilled workers, 14 (10.37%) transactional employees, and 11 (8.15%) freelancers. Some statistics on the received and valid questionnaires are summarized in Figs. 15 and 16

Fig. 15
figure 15

Number of participants who perceived the attributes as private attributes

Fig. 16
figure 16

Percentage of the participants who would choose the different posting possibilities

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Luo, Y., Zhu, N., Wang, Z. et al. Privacy information propagation in online social networks - a case study based on Weibo data. Int. J. Inf. Secur. 24, 32 (2025). https://doi.org/10.1007/s10207-024-00946-5

Download citation

  • Published:

  • DOI: https://doi.org/10.1007/s10207-024-00946-5

Keywords