Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2809936.2809937acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Community Topic Usage in Social Networks

Published: 18 October 2015 Publication History

Abstract

When studying large social media data sets, it is useful to reduce the dimensionality of both the network (e.g. by finding communities) and user-generated data such as text (e.g. using topic models). Algorithms exist for both these tasks, however their combination has received little attention and proposed models to date are not scalable (e.g.: [4]). One approach to such combined modelling is to perform community and topic modelling independently and later combine the results. In the case of overlapping communities, this combination requires a method for attributing each users topic usage to the communities in which she participates. This paper presents a Bayesian model for attributing individual documents to communities which balances the users proportional community membership with community topic coherence. Community topic usage is modelled with a Dirichlet distribution with fixed concentration parameter, leading to a well defined conjugate prior. Thought the prior is computationally expensive, the already reduced dimensionality in both topics and communities make a tractable algorithm feasible, even for large data sets. The model is applied to a corpus of tweets and twitter follower relations collected on hash tags used by people with eating disorders [14].

References

[1]
E. M. Airoldi, D. M. Blei, S. E. Fienberg, and E. P. Xing. Mixed membership stochastic blockmodels. In D. Koller, D. Schuurmans, Y. Bengio, and L. Bottou, editors, Advances in Neural Information Processing Systems 21, pages 33--40. Curran Associates, Inc., 2009.
[2]
D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. The Journal of Machine Learning Research, 3:993--1022, 2003.
[3]
A. Bruns and J. E. Burgess. The use of twitter hashtags in the formation of ad hoc publics. In Proceedings of the 6th European Consortium for Political Research (ECPR) General Conference 2011, University of Iceland, Reykjavik, 2011.
[4]
D. Duan, Y. Li, R. Li, Z. Lu, and A. Wen. Mei: mutual enhanced in finite generative model for simultaneous community and topic detection. In Discovery Science, pages 91--106. Springer, 2011.
[5]
P. K. Gopalan and D. M. Blei. Efficient discovery of overlapping communities in massive networks. Proceedings of the National Academy of Sciences, 110(36):14534--14539, 2013.
[6]
T. L. Griffiths and M. Steyvers. Finding scientific topics. Proceedings of the National Academy of Sciences of the United States of America, 101(Suppl 1):5228--5235, 2004.
[7]
I. Himelboim. Mapping twitter topic networks: From polarized crowds to community clusters. Blog post, Pew Research Center's Internet & American Life Project, 2014.
[8]
B. A. Huberman, D. M. Romero, and F. Wu. Social networks that matter: Twitter under the microscope. SSRN Scholarly Paper ID 1313405, Social Science Research Network, Rochester, NY, 2008.
[9]
A. Java, X. Song, T. Finin, and B. Tseng. Why we twitter: Understanding microblogging usage and communities. In Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 Workshop on Web Mining and Social Network Analysis, WebKDD/SNA-KDD '07, pages 56--65, New York, NY, USA, 2007. ACM.
[10]
D. Li, Y. Ding, X. Shuai, J. Bollen, J. Tang, S. Chen, J. Zhu, and G. Rocha. Adding community and dynamic to topic models. Journal of Informetrics, 6(2):237--253, 2012.
[11]
D. Li, B. He, Y. Ding, J. Tang, C. Sugimoto, Z. Qin, E. Yan, J. Li, and T. Dong. Community-based topic modeling for social tagging. In Proceedings of the 19th ACM International Conference on Information and Knowledge Management, CIKM '10, pages 1565--1568,New York, NY, USA, 2010. ACM.
[12]
M. J. Paul and M. Dredze. You are what you tweet: Analyzing twitter for public health. In ICWSM, 2011.
[13]
S. Singer and J. Nelder. Nelder-mead algorithm. Scholarpedia, 4(7):2928, 2009.
[14]
I. Wood. A case study of collecting dynamic social data: The pro-ana twitter community. Australian Journal of Intelligent Information Processing Systems, 14(3), 2015.
[15]
I. Wood. Using topic models to measure social psychological characteristics in online social media. In Social Computing, Behavioral-Cultural Modeling, and Prediction, volume 9021 of Lecture Notes in Computer Science, pages 308--313, Washington, DC, USA, 2015. Springer International Publishing.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
TM '15: Proceedings of the 2015 Workshop on Topic Models: Post-Processing and Applications
October 2015
74 pages
ISBN:9781450337847
DOI:10.1145/2809936
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 October 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. author community membership
  2. bayesian inference
  3. community detection
  4. conjugate prior
  5. dirichlet distribution
  6. topic models

Qualifiers

  • Research-article

Funding Sources

Conference

CIKM'15
Sponsor:

Acceptance Rates

TM '15 Paper Acceptance Rate 8 of 12 submissions, 67%;
Overall Acceptance Rate 8 of 12 submissions, 67%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 139
    Total Downloads
  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 25 Jan 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media