Abstract
Personality analysis has been widely used in various social services such as mental healthcare, recommendation systems and so on because its natural explainability for AI applications in Web intelligence. With the penetration of Web2.0, traditional social researches have gradually turned to online social networks. However, for a long time, personality detection from online social texts has sunk into an embarrassing situation for the lack of large labeled datasets. Limited by supervised learning frameworks and small labeled datasets, prior works mainly detect one’s personality in the individual perspective, which may not well meet the challenges of massive un-labeled data in the near future. In this paper, we present a first look into group-level personality detection and we use an unsupervised feature learning method instead of supervised methods used in most related works. We propose AdaWalk, a new and novel model of group-level personality detection by learning the influence from text generated networks. The model uses different kernels to evaluate how much a given node should decide its walk path locally or globally. The advantage of AdaWalk is three-folded: a) the model is an unsupervised feature learning method, which means it relies less on annotations. b) by traversing the network, we can capture the influence in the group level, thus the analysis of one’s personality is not only based on individual records but also the information in groups. Therefore, AdaWalk can leverage small datasets more comprehensively. c) AdaWalk is scalable and can be easily transformed as distributed algorithms, which means it has more potential, compared with existing personality detection methods, to meet the massive data without annotations. We use AdaWalk to predict users’ Big Five personality scores in FIVE heterogeneous personality datasets. Compared with more than TEN famous related methods, AdaWalk outperforms the others, meanwhile verifying the significance of the group perspective and unsupervised feature learning methods in the application of personality analysis. To make our experiment repeatable, AdaWalk and related datasets are available at https://xiangguosun.strikingly.com.
Similar content being viewed by others
Notes
A taxonomy for personality traits. It describe one’s personality from five factors: Openness, Conscientiousness, Extraversion, Agreeableness and Neuroticism (or Stability).
The source code can be accessed only for the academic purpose. https://xiangguosun.strikingly.com
An open-source toolkit for Network Embedding. https://github.com/thunlp/OpenNE
References
Adelstein, J.S., Shehzad, Z., Mennes, M., DeYoung, C.G, Zuo, X.-N., Kelly, C., Margulies, D.S., Bloomfield, A., Gray, J.R., Castellanos, F.X., et al.: Personality is reflected in the brain’s intrinsic functional architecture. PloS one 6 (11), e27633 (2011)
Ahmed, A., Shervashidze, N., Narayanamurthy, S., Josifovski, V., Smola, A.J.: Distributed large-scale natural graph factorization. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 37–48. ACM (2013)
Amichai-Hamburger, Y., Vinitzky, G.: Social network use and personality. Comput. Hum. Behav. 26(6), 1289–1295 (2010)
Andrew Schwartz, H., Eichstaedt, J.C., Kern, M.L, Dziurzynski, L., Ramones, S.M., Agrawal, M., Shah, A., Kosinski, M., Stillwell, D., Seligman, M.E.P., et al.: Personality, gender, and age in the language of social media: the open-vocabulary approach. PloS one 8(9), e73791 (2013)
Bachrach, Y., Kosinski, M., Graepel, T., Kohli, P., Stillwell, D.: Personality and patterns of facebook usage. In: Proceedings of the 4th Annual ACM Web Science Conference, pp. 24–32. ACM (2012)
Bai, S., Gao, R., Zhu, T.: Determining personality traits from renren status usage behavior. In: Computational Visual Media, pp. 226–233. Springer (2012)
Barrick, M.R., Mount, M.K.: The big five personality dimensions and job performance: a meta-analysis. Person. Psychol. 44(1), 1–26 (1991)
Benet-Martinez, V., John, O.P: Los cinco grandes across cultures and ethnic groups: multitrait-multimethod analyses of the big five in Spanish and English. J. Person. Soc. Psychol. 75(3), 729 (1998)
Biel, J.-I., Tsiminaki, V., Dines, J., Gatica-Perez, D.: Hi youtube!: personality impressions and verbal content in social video. In: Proceedings of the 15th ACM on International Conference on Multimodal Interaction, pp. 119–126. ACM (2013)
Cavallari, S., Zheng, V.W., Cai, H., Chang, K.C.-C., Cambria, E.: Learning community embedding with community detection and node embedding on graphs. In: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, pp. 377–386. ACM (2017)
Celli, F., Pianesi, F., Stillwell, D., Kosinski, M.: Workshop on computational personality recognition (shared task). In: Proceedings of the Workshop on Computational Personality Recognition (2013)
Chen, C-M, Chien, P-C, Lin, Y-C, Tsai, M-F, Yang, Y-H: Exploiting latent social listening representations for music recommendations. In: Proceedings of Ninth ACM International Conf. Recommender Syst. Poster (2015)
Costa, P.T., McCrae, R.R.: The revised neo personality inventory (neo-pi-r). The SAGE Handb. Person. Theory Assess. 2, 179–198 (2008)
Deyoung, C.G.: Toward a theory of the big five. Psychol. Inq. 21(1), 26–33 (2010)
Grover, A., Leskovec, J.: node2vec: scalable feature learning for networks. In: Proceedings of the 22nd ACM SIGKDD international Conference on Knowledge Discovery and Data Mining, pp. 855–864. ACM (2016)
Guntuku, S.C., Zhou, J.T., Roy, S., Lin, W., Tsang, I.W.: ‘Who likes what and, why?’ insights into modeling users’ personality based on image ‘likes’. IEEE Trans. Affect. Comput. 9, 130–143 (2018)
John, O.P., Donahue, E.M., Kentle, R.L.: The big five inventory: versions 4a and 54, Institute of personality and social research. University of California, Berkeley (1991)
John, O.P., Naumann, L.P., Soto, C.J.: Paradigm shift to the integrative big five trait taxonomy. Handb. Person Theory Res. 3, 114–158 (2008)
Kampman, O., Barezi, E.J., Bertero, D., Fung, P.: Investigating audio, video, and text fusion methods for end-to-end automatic personality prediction. In: ACL (2), pp. 606–611. Association for Computational Linguistics (2018)
Kosinski, M., Stillwell, D., Graepel, T.: Private traits and attributes are predictable from digital records of human behavior. Proc. Natl. Acad. Sci. 110(15), 5802–5805 (2013)
Lambiotte, R., Kosinski, M.: Tracking the digital footprints of personality. Proc. IEEE 102(12), 1934–1939 (2014)
Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: International Conference on Machine Learning, pp. 1188–1196 (2014)
Li, A., Zhang, F., Zhu, T.: Web use behaviors for identifying mental health status. In: International Conference on Brain and Health Informatics, pp. 348–358. Springer (2013)
Liu, X., Zhu, T.: Deep learning for constructing microblog behavior representation to identify social media user’s personality. PeerJ Comput. Sci. e81, 2 (2016)
Mairesse, F., Walker, M.A., Mehl, M.R., Moore, R.K.: Using linguistic cues for the automatic recognition of personality in conversation and text. J. Artif. Intell. Res. 30, 457–500 (2007)
Majumder, N., Poria, S., Gelbukh, A., Cambria, E.: Deep learning-based document modeling for personality detection from text. IEEE Intell. Syst. 32(2), 74–79 (2017)
McCallum, A.K, Nigam, K., Rennie, J., Seymore, K.: Automating the construction of internet portals with machine learning. Inf. Retr. 3(2), 127–163 (2000)
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Nov, O., Ye, C.: Personality and technology acceptance: personal innovativeness in it, openness and resistance to change. In: Hawaii International Conference on System Sciences, Proceedings of the 41st Annual, pp. 448–448. IEEE (2008)
Ou, M., Cui, P., Pei, J., Zhang, Z., Zhu, W.: Asymmetric transitivity preserving graph embedding. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1105–1114. ACM (2016)
Page, L., Brin, S., Motwani, R., Winograd, T.: The pagerank citation ranking: bringing order to the Web. Technical report, Stanford InfoLab (1999)
Park, G., Andrew Schwartz, H., Eichstaedt, J.C, Kern, M.L., Kosinski, M., Stillwell, D., Ungar, L.H., Seligman, M.E.P.: Automatic personality assessment through social media language. J. Pers. Soc. Psychol. 108(6), 934 (2015)
Pennebaker, J.W., King, L.A.: Linguistic styles: language use as an individual difference. J. Person. Soc. Psychol. 77(6), 1296 (1999)
Pennebaker, J.W, Francis, M.E, Booth, R.J: Linguistic inquiry and word count: Liwc 2001. Mahway: Lawrence Erlbaum Assoc. 71(2001), 2001 (2001)
Perozzi, B., Al-Rfou, R., Skiena, S.: Deepwalk: online learning of social representations. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 701–710. ACM (2014)
Qian, Q., Huang, M., Zhao, H., Xu, J., Zhu, X.: Assigning personality/profile to a chatting machine for coherent conversation generation. In: IJCAI, pp. 4279–4285 (2018)
Qiu, L., Lin, H., Ramsay, J., Yang, F.: You are what you tweet: personality expression and perception on twitter. J. Res. Pers. 46(6), 710–718 (2012)
Qiu, L., Lu, J., Ramsay, J., Yang, S., Qu, W., Zhu, T.: Personality expression in Chinese language use. International Journal of Psychology (2016)
Rahwan, I., Cebrian, M., Obradovich, N., Bongard, J., Bonnefon, J.-F., Breazeal, C., Crandall, J.W., Christakis, N.A., Couzin, I.D., Jackson, M.O., et al.: Machine behaviour. Nature 568(7753), 477 (2019)
Rai, T.S.: High replicability in personality psychology. Science 364(6438), 348–348 (2019)
Rajaraman, A., Ullman, J.D.: Mining of massive datasets. Cambridge University Press, Cambridge (2011)
Rangel, F., Celli, F., Rosso, P., Potthast, M., Stein, B., Walter, D.: Overview of the 3rd author profiling task at PAN 2015. In: Cappellato, L., Ferro, N., Jones, G., San Juan, E. (eds.) CLEF Evaluation Labs and Workshop – Working Notes Papers, 8-11 September, Toulouse, France. CEUR-WS.org (2015)
Selfhout, M., Burk, W., Branje, S., Denissen, J., Van Aken, M., Meeus, W.: Emerging late adolescent friendship networks and big five personality traits: a social network approach. J. Person. 78(2), 509–538 (2010)
Sen, P., Namata, G., Bilgic, M., Getoor, L., Galligher, B., Eliassi-Rad, T.: Collective classification in network data. AI Mag. 29(3), 93 (2008)
Sun, G., Shen, J.: Facilitating social collaboration in mobile cloud-based learning: a teamwork as a service (taas) approach. IEEE Trans. Learn. Technol. 7(3), 207–220 (2014)
Sun, X., Liu, B., Cao, J., Luo, J., Shen, X.: Who am I? personality detection based on deep learning for texts. In: ICC, pp. 1–6. IEEE (2018)
Tang, J., Qu, M., Wang, M., Zhang, M., Yan, J., Mei, Q.: Line: Large-scale information network embedding. In: Proceedings of the 24th International Conference on World Wide Web, pp. 1067–1077. International World Wide Web Conferences Steering Committee (2015)
Wang, H., Zhang, F., Hou, M., Xie, X., Guo, M., Qi, L.: Shine: signed heterogeneous information network embedding for sentiment link prediction. In: Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, pp. 592–600. ACM (2018)
Wei, H., Zhang, F., Yuan, N.J., Cao, C., Fu, H., Xie, X., Rui, Y., Ma, W.-Y.: Beyond the words predicting user personality from heterogeneous information. In: Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, pp. 305–314. ACM (2017)
Wu, Y., Kosinski, M., Stillwell, D.: Computer-based personality judgments are more accurate than those made by humans. Proc. Natl. Acad. Sci. 112(4), 1036–1040 (2015)
Zafarani, R., Liu, H.: Social computing data repository at asu (2009)
Zhao, S., Ding, G., Han, J., Gao, Y.: Personality-aware personalized emotion recognition from physiological signals. In: IJCAI, pp. 1660–1667 (2018)
Zheng, S., Qi, M., Wang, T., Chen, W., Yu, N., Ma, Z-M, Liu, T-Y: Asynchronous stochastic gradient descent with delay compensation. In: International Conference on Machine Learning, pp. 4120–4129 (2017)
Zibrek, K., Kokkinara, E., McDonnell, R.: The effect of realistic appearance of virtual characters in immersive environments - does the character’s personality play a role?. IEEE Trans. Vis. Comput. Graph. 24(4), 1681–1690 (2018)
Acknowledgments
This work is supported by National Key R&D Program of China 2017YFB1003000, National Natural Science Foundation of China under Grants No. 61972087, No. 61772133, No. 61472081, No. 61402104. Jiangsu Provincial Key Project BE2018706. Key Laboratory of Computer Network Technology of Jiangsu Province. Jiangsu Provincial Key Laboratory of Network and Information Security under Grants No. BM2003201, and Key Laboratory of Computer Network and Information Integration of Ministry of Education of China under Grants No. 93K-9.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article belongs to the Topical Collection: Computational Social Science as the Ultimate Web Intelligence
Guest Editors: Xiaohui Tao, Juan D. Velasquez, Jiming Liu, and Ning Zhong
Rights and permissions
About this article
Cite this article
Sun, X., Liu, B., Meng, Q. et al. Group-level personality detection based on text generated networks. World Wide Web 23, 1887–1906 (2020). https://doi.org/10.1007/s11280-019-00729-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11280-019-00729-2