Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article
Public Access

User Modeling on Demographic Attributes in Big Mobile Social Networks

Published: 11 July 2017 Publication History

Abstract

Users with demographic profiles in social networks offer the potential to understand the social principles that underpin our highly connected world, from individuals, to groups, to societies. In this article, we harness the power of network and data sciences to model the interplay between user demographics and social behavior and further study to what extent users’ demographic profiles can be inferred from their mobile communication patterns. By modeling over 7 million users and 1 billion mobile communication records, we find that during the active dating period (i.e., 18--35 years old), users are active in broadening social connections with males and females alike, while after reaching 35 years of age people tend to keep small, closed, and same-gender social circles. Further, we formalize the demographic prediction problem of inferring users’ gender and age simultaneously. We propose a factor graph-based WhoAmI method to address the problem by leveraging not only the correlations between network features and users’ gender/age, but also the interrelations between gender and age. In addition, we identify a new problem—coupled network demographic prediction across multiple mobile operators—and present a coupled variant of the WhoAmI method to address its unique challenges. Our extensive experiments demonstrate the effectiveness, scalability, and applicability of the WhoAmI methods. Finally, our study finds a greater than 80% potential predictability for inferring users’ gender from phone call behavior and 73% for users’ age from text messaging interactions.

References

[1]
Talayeh Aledavood, Eduardo López, Sam G. B. Roberts, Felix Reed-Tsochas, Esteban Moro Egido, Robin I. M. Dunbar, and Jari Saramäki. 2015. Channel-specific daily patterns in mobile phone communication. CoRR, abs/1507.04596 (2015).
[2]
Bin Bi, Milad Shokouhi, Michal Kosinski, and Thore Graepel. 2013. Inferring the demographics of search users: Social data meets search queries. In WWW’13. 131--140.
[3]
Vincent D. Blondel, Adeline Decuyper, and Gautier Krings. 2015. A survey of results on mobile phone datasets analysis. arXiv:1502.03406 (2015).
[4]
F. Calabrese, L. Ferrari, and V. Blondel. 2014. Urban sensing using mobile phones network data: A survey of research. ACM Comput. Surv. (2014).
[5]
Deepayan Chakrabarti, Stanislav Funiak, Jonathan Chang, and Sofus A. Macskassy. 2014. Joint inference of multiple label types in large networks. In ICML’14. 874--882.
[6]
Benjamin Cornwell. 2011. Age trends in daily social contact patterns. Res. Aging 33, 5 (2011), 598--631.
[7]
Yuxiao Dong, Fabio Pinelli, Yiannis Gkoufas, Zubair Nabi, Francesco Calabrese, and Nitesh V. Chawla. 2015a. Inferring unusual crowd events from mobile phone call detail records. In Machine Learning and Knowledge Discovery in Databases. Springer International Publishing, 474--492.
[8]
Yuxiao Dong, Jie Tang, Nitesh V. Chawla, Tiancheng Lou, Yang Yang, and Bai Wang. 2015b. Inferring social status and rich club effects in enterprise communication networks. PLoS ONE 10 (03 2015), e0119446.
[9]
Yuxiao Dong, Jie Tang, Tiancheng Lou, Bin Wu, and Nitesh V. Chawla. 2013. How long will she call me? Distribution, social theory and duration prediction. In Machine Learning and Knowledge Discovery in Databases. Springer, 16--31.
[10]
Yuxiao Dong, Yang Yang, Jie Tang, Yang Yang, and Nitesh V. Chawla. 2014. Inferring user demographics and social strategies in mobile social networks. In KDD’14. ACM, 15--24.
[11]
Yuxiao Dong, Jing Zhang, Jie Tang, Nitesh V. Chawla, and Bai Wang. 2015. CoupledLP: Link prediction in coupled networks. In KDD’15. ACM, 199--208.
[12]
Nan Du, Christos Faloutsos, Bai Wang, and Leman Akoglu. 2009. Large human communication networks: Patterns and a utility-driven generator. In KDD’09. ACM, 269--278.
[13]
Nathan Eagle, Alex (Sandy) Pentland, and David Lazer. 2009. Inferring social network structure using mobile phone data. Proc. Natl. Acad. Sci. U.S.A. 106, 36 (2009).
[14]
David Easley and Jon Kleinberg. 2010. Networks, Crowds, and Markets: Reasoning about a Highly Connected World. Cambridge University Press.
[15]
Mária Ercsey-Ravasz, Ryan N. Lichtenwalter, Nitesh V. Chawla, and Zoltán Toroczkai. 2012. Range-limited centrality measures in complex networks. Phys. Rev. E 85, 6 (Jun 2012), 066103.
[16]
Linton C. Freeman. 1982. Centered graphs and the structure of ego networks. Math. Soc. Sci. 3, 3 (1982), 291--304.
[17]
Huiji Gao, Jiliang Tang, Xia Hu, and Huan Liu. 2013. Modeling temporal effects of human mobile behavior on location-based social networks. In CIKM’13. 1673--1678.
[18]
Andrew Gelman and Jennifer Hill. 2006. Data Analysis Using Regression and Multilevel/Hierarchical Models. Cambridge University Press.
[19]
Marta C. Gonzalez, Cesar A. Hidalgo, and Albert-Laszlo Barabasi. 2008. Understanding individual human mobility patterns. Nature 453, 7196 (2008), 779--782.
[20]
Mark Granovetter. 1973. The strength of weak ties. Am. J. Sociology 78, 6 (1973), 1360--1380.
[21]
Mark Granovetter. 1985. Economic action and social structure: The problem of embeddedness. Amer. J. Sociol. (1985).
[22]
Susan C. Herring. 2003. Gender and power in on-line communication. In Handbook of Language and Gender. Wiley-Blackwell, 202.
[23]
Cesar A. Hidalgo and C. Rodriguez-Sickert. 2008. The dynamics of a mobile phone network. Physica A: Stat. Mech. Appl. 387, 12 (2008), 3017--3024.
[24]
Jian Hu, Hua-Jun Zeng, Hua Li, Cheng Niu, and Zheng Chen. 2007. Demographic prediction based on user’s browsing behavior. In WWW’07. 151--160.
[25]
Xia Hu and Huan Liu. 2012. Social status and role analysis of Palin’s email network. In WWW’12 Companion. ACM, 531--532.
[26]
Lauri Kovanen, Kimmo Kaski, János Kertész, and Jari Saramäki. 2013. Temporal motifs reveal homophily, gender-specific patterns, and group talk in call sequences. PNAS 110, 45 (2013), 18070--18075.
[27]
David Krackhardt. 1992. The Strength of Strong Ties. Cambridge, Harvard Business School Press, Hershey, USA.
[28]
Frank R. Kschischang, Brendan J. Frey, and Hans A. Loeliger. 2001. Factor graphs and the sum-product algorithm. IEEE Trans. Internet Technol. 47 (2001), 498--519.
[29]
P. F. Lazarsfeld and R. K. Merton. 1954. Friendship as a social process: A substantive and methodological analysis. Freedom and Control in Modern Society,Van Nostrand, New York (1954), 8--66.
[30]
Jure Leskovec and Eric Horvitz. 2008. Planetary-scale views on a large instant-messaging network. In WWW’08. ACM, 915--924.
[31]
Quim Llimona, Jordi Luque, Xavier Anguera, Zoraida Hidalgo, Souneil Park, and Nuria Oliver. 2015. Effect of gender and call duration on customer satisfaction in call center big data. In INTERSPEECH’15.
[32]
Tiancheng Lou, Jie Tang, John Hopcroft, Zhanpeng Fang, and Xiaowen Ding. 2013. Learning to predict reciprocity and triadic closure in social networks. ACM Trans. Knowl. Discov. Data 7, 2 (2013), 5:1--5:25.
[33]
Peter V. Marsden. 1987. Core discussion networks of americans. Amer. Sociol. Rev. (1987), 122--131.
[34]
Manfred Max-Neef, Antonio Elizalde, and Martin Hopenhayn. 1992. Development and human needs. Real-life Economics: Understanding Wealth Creation (1992), 197--213.
[35]
M. Mead. 1970. Culture and Commitment: A Study of the Generation Gap. Natural History Press.
[36]
L. Meng, Y. Hulovatyy, A. Striegel, and T. Milenković. 2016. On the interplay between individuals’ evolving interaction patterns and traits in dynamic multiplex social networks. IEEE Trans. Netw. Sci. Eng. 3, 1 (2016), 32--43.
[37]
Matthew Michelson and Sofus A. Macskassy. 2011. What blogs tell us about websites: A demographics study. In WSDM’11. ACM, 365--374.
[38]
Giovanna Miritello, Rubén Lara, Manuel Cebrian, and Esteban Moro. 2013. Limited communication capacity unveils strategies for human interaction. Sci. Rep. 3 (2013).
[39]
Alan Mislove, Sune Lehmann, Yong-Yeol Ahn, Jukka-Pekka Onnela, and J. Niels Rosenquist. 2011. Understanding the demographics of twitter users. In ICWSM’11.
[40]
Kaixiang Mo, Ben Tan, Erheng Zhong, and Qiang Yang. 2012. Your phone understands you. In Nokia MDC’12.
[41]
Kevin P. Murphy, Yair Weiss, and Michael I. Jordan. 1999. Loopy belief propagation for approximate inference: An empirical study. In UAI’99. 467--475.
[42]
J. P. Onnela, J. Saramäki, J. Hyvönen, G. Szabó, D. Lazer, K. Kaski, J. Kertész, and A.-L. Barabási. 2007. Structure and tie strengths in mobile communication networks. Proc. Natl. Acad. Sci. U.S.A. (2007).
[43]
Vasyl Palchykov, Kimmo Kaski, János Kertész, Albert-László Barabási, and Robin I. M. Dunbar. 2012. Sex differences in intimate relationships. Sci. Rep. 2:370 (2012).
[44]
Stephen W. Raudenbush and Anthony S. Bryk. 2002. Hierarchical Linear Models: Applications and Data Analysis Methods. Vol. 1. Sage.
[45]
Jari Saramaki and Esteban Moro. 2015. From seconds to months: multi-scale dynamics of mobile telephone calls. arXiv:1504.01479 (2015).
[46]
Mukund Seshadri, Sridhar Machiraju, Ashwin Sridharan, Jean Bolot, Christos Faloutsos, and Jure Leskovec. 2008. Mobile call graphs: Beyond power-law and lognormal distributions. In KDD’08. ACM, 596--604.
[47]
Xiaolin Shi, Lada A. Adamic, and Martin J. Strauss. 2007. Networks of strong ties. Physica A: Stat. Mech. Appl. 378, 1 (2007), 33--47.
[48]
Bai-En Shie, S. Yu Philip, and Vincent S. Tseng. 2013. Mining interesting user behavior patterns in mobile commerce environments. Appl. Intell. 38, 3 (2013), 418--435.
[49]
Zbigniew Smoreda and Christian Licoppe. 2000. Gender-specific use of the domestic telephone. Soc. Psych. Quart. 63, 3 (2000), 238--252.
[50]
Richard C. Sprinthall. 2011. Basic Statistical Analysis. Pearson.
[51]
Arkadiusz Stopczynski, Vedran Sekara, Piotr Sapiezynski, Andrea Cuttone, Jakob Eg Larsen, and Sune Lehmann. 2014. Measuring large-scale social networks with high resolution. PLOS One 9, 4 (2014), e95978.
[52]
Michael Szell and Stefan Thurner. 2013. How women organize social networks different from men. Sci. Rep. 3 (July 2013).
[53]
Jie Tang, Tiancheng Lou, Jon Kleinberg, and Sen Wu. 2016. Transfer learning to infer social ties across heterogeneous networks. ACM Trans. Inf. Syst. 34, 2, Article 7 (April 2016).
[54]
Jie Tang, Sen Wu, and Jimeng Sun. 2013. Confluence: Conformity influence in large social networks. In KDD’13. ACM, 347--355.
[55]
Jie Tang, Jing Zhang, Limin Yao, Juanzi Li, Li Zhang, and Zhong Su. 2008. ArnetMiner: Extraction and mining of academic social networks. In KDD’08. 990--998.
[56]
Dashun Wang, Dino Pedreschi, Chaoming Song, Fosca Giannotti, and Albert-Laszlo Barabasi. 2011. Human mobility, social ties, and link prediction. In KDD’11. ACM, 1100--1108.
[57]
Josh Ying, Yao-Jen Chang, Chi-Min Huang, and Vincent S. Tseng. 2012. Demographic prediction based on user’s mobile behaviors. In Nokia MDC’12.
[58]
Yuchen Zhao, Guan Wang, Philip S. Yu, Shaobo Liu, and Simon Zhang. 2013. Inferring social roles and statuses in social networks. In KDD’13. 695--703.
[59]
Yu Zheng. 2015. Trajectory data mining: An overview. ACM Trans. Intell. Syst. Technol. (TIST) 6, 3 (2015), 29.

Cited By

View all
  • (2024)Social demographics imputation based on similarity in multi-dimensional activity-travel pattern: A two-step approachTravel Behaviour and Society10.1016/j.tbs.2024.10084337(100843)Online publication date: Oct-2024
  • (2023)Malicious Activities Prediction Over Online Social Networking Using Ensemble ModelIntelligent Automation & Soft Computing10.32604/iasc.2023.02865036:1(461-479)Online publication date: 2023
  • (2023)Examining User Heterogeneity in Digital ExperimentsACM Transactions on Information Systems10.1145/357893141:4(1-34)Online publication date: 12-Jan-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Information Systems
ACM Transactions on Information Systems  Volume 35, Issue 4
Special issue: Search, Mining and their Applications on Mobile Devices
October 2017
461 pages
ISSN:1046-8188
EISSN:1558-2868
DOI:10.1145/3112649
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 July 2017
Accepted: 01 December 2016
Revised: 01 October 2016
Received: 01 June 2016
Published in TOIS Volume 35, Issue 4

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Gender and age
  2. computational social science
  3. demographic prediction
  4. ego networks
  5. mobile communication
  6. mobile phone data
  7. node attributes
  8. social tie and triad

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)107
  • Downloads (Last 6 weeks)25
Reflects downloads up to 16 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Social demographics imputation based on similarity in multi-dimensional activity-travel pattern: A two-step approachTravel Behaviour and Society10.1016/j.tbs.2024.10084337(100843)Online publication date: Oct-2024
  • (2023)Malicious Activities Prediction Over Online Social Networking Using Ensemble ModelIntelligent Automation & Soft Computing10.32604/iasc.2023.02865036:1(461-479)Online publication date: 2023
  • (2023)Examining User Heterogeneity in Digital ExperimentsACM Transactions on Information Systems10.1145/357893141:4(1-34)Online publication date: 12-Jan-2023
  • (2023)Item Recommendation on Shared Accounts Through User IdentificationSocial Media Processing10.1007/978-981-99-7596-9_5(63-76)Online publication date: 15-Nov-2023
  • (2022)HyperSoRec: Exploiting Hyperbolic User and Item Representations with Multiple Aspects for Social-aware RecommendationACM Transactions on Information Systems10.1145/346391340:2(1-28)Online publication date: 30-Apr-2022
  • (2022)Intelligent Geodemographic Clustering Based on Neural Network and Particle Swarm OptimizationIEEE Transactions on Systems, Man, and Cybernetics: Systems10.1109/TSMC.2021.307235752:6(3746-3756)Online publication date: Jun-2022
  • (2022)Spatiotemporal Analysis of Mobile Phone Network Based on Self-Organizing Feature MapIEEE Internet of Things Journal10.1109/JIOT.2021.31272039:13(10948-10960)Online publication date: 1-Jul-2022
  • (2021) MyrrorBot: A Digital Assistant Based on Holistic User Models for Personalized Access to Online ServicesACM Transactions on Information Systems10.1145/344767939:4(1-34)Online publication date: 16-Aug-2021
  • (2021)Secure fine-grained friend-making scheme based on hierarchical management in mobile social networksInformation Sciences10.1016/j.ins.2020.12.012554(15-32)Online publication date: Apr-2021
  • (2020)Attributed Collaboration Network Embedding for Academic Relationship MiningACM Transactions on the Web10.1145/340973615:1(1-20)Online publication date: 24-Nov-2020
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Full Access

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media