Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1081870.1081925acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

Modeling and predicting personal information dissemination behavior

Published: 21 August 2005 Publication History

Abstract

In this paper, we propose a new way to automatically model and predict human behavior of receiving and disseminating information by analyzing the contact and content of personal communications. A personal profile, called CommunityNet, is established for each individual based on a novel algorithm incorporating contact, content, and time information simultaneously. It can be used for personal social capital management. Clusters of CommunityNets provide a view of informal networks for organization management. Our new algorithm is developed based on the combination of dynamic algorithms in the social network field and the semantic content classification methods in the natural language processing and machine learning literatures. We tested CommunityNets on the Enron Email corpus and report experimental results including filtering, prediction, and recommendation capabilities. We show that the personal behavior and intention are somewhat predictable based on these models. For instance, "to whom a person is going to send a specific email" can be predicted by one's personal social network and content analysis. Experimental results show the prediction accuracy of the proposed adaptive algorithm is 58% better than the social network-based predictions, and is 75% better than an aggregated model based on Latent Dirichlet Allocation with social network enhancement. Two online demo systems we developed that allow interactive exploration of CommunityNet are also discussed.

References

[1]
B. A. Nardi, S. Whittaker, and H. Schwarz. "It's not what you know, it's who you know: work in the information age," First Mon., 5, 2000.
[2]
D. Krackhardt, "Panel on Informal Networks within Formal Organizations," XXV Intl. Social Network Conf., Feb. 2005.
[3]
D. Krackhardt and M. Kilduff, "Structure, culture and Simmelian ties in entrepreneurial firms," Social Networks, Vol. 24, 2002.
[4]
B. Nardi, S. Whittaker, E. Isaacs, M. Creech, J. Johnson, and J. Hainsworth, "ContactMap: Integrating Communication and Information Through Visualizing Personal Social Networks," Com. of the Association for Computing Machinery. April, 2002.
[5]
https://www.linkedin.com/home?trk=logo.
[6]
https://www.orkut.com/Login.aspx.
[7]
http://www.friendster.com/.
[8]
N. Lin, "Social Capital," Cambridge Univ. Press, 2001.
[9]
W. Cohen. http://www-2.cs.cmu.edu/~enron/.
[10]
S. Milgram. "The Small World Problem," Psychology Today, pp 60--67, May 1967.
[11]
M. Schwartz and D. Wood, "Discovering Shared Interests Among People Using Graph Analysis", Comm. ACM, v. 36, Aug. 1993.
[12]
H. Kautz, B. Selman, and M. Shah. "Referral Web: Combining social networks and collaborative filtering," Comm. ACM, March 1997.
[13]
G. W. Flake, S. Lawrence, C. Lee Giles, and F. M. Coetzee. "Self-organization and identification of Web communities," IEEE Computer, 35(3):66--70, March 2002.
[14]
J. Tyler, D. Wilkinson, and B. A. Huberman. "Email as spectroscopy: Automated Discovery of Community Structure Within Organizations," Intl. Conf. on Communities and Technologies., 2003.
[15]
L. Page, S. Brin, R. Motwani and T. Winograd. "The PageRank Citation Ranking: Bringing Order to the Web," Stanford Digital Libraries Working Paper, 1998.
[16]
J. Kleinberg. "Authoritative sources in a hyperlinked environment," In Proc. 9th ACM-SIAM Symposium on Discrete Algorithms, 1998.
[17]
S. Wasserman, and P. E. Pattison, "Logit models and logistic regression for social networks: I. An introduction to Markov graphs and p*", Psychometrika, 61: 401-- 425, 1996.
[18]
T. A.B. Snijders. "Models for Longitudinal Network Data," Chapter 11 in Models and methods in social network analysis, New York: Cambridge University Press, 2004.
[19]
D. L.-Nowell and J. Kleinberg, "The Link Prediction Problem for Social Networks," In Proceedings of the 12th Intl. Conf. on Information and Knowledge Management, 2003.
[20]
J. Kubica, A. Moore, J. Schneider, and Y. Yang. "Stochastic Link and Group Detection," In Proceedings of the 2002 AAAI Conference. Edmonton, Alberta, 798--804, 2002.
[21]
M. Handcock and D. Hunter, "Curved Exponential Family Models for Networks," XXV Intl. Social Network Conf., Feb. 2005.
[22]
T. Hofmann, "Probabilistic Latent Semantic Analysis," Proc. of the Conf. on Uncertainty in Artificial Intelligence, 1999.
[23]
D. Blei, A. Ng, and M. Jordan, "Latent Dirichlet allocation," Journal of Machine Learning Research, 3:993--1022, January 2003.
[24]
T. Griffiths and M. Steyvers, "Finding Scientific Topics," Proc. of the National Academy of Sciences, 5228--5235, 2004.
[25]
M. R.-Zvi, T. Griffiths, M. Steyvers and P. Smyth, "The Author-Topic Model for Authors and Documents", Proc. of the Conference on Uncertainty in Artificial Intelligence volume 21, 2004.
[26]
A. McCallum, A. Corrada-Emmanuel, and X. Wang, "The Author-Recipient-Topic Model for Topic and Role Discovery in Social Networks: Experiments with Enron and Academic Email," Technical Report UM-CS-2004-096, 2004.
[27]
X. Song, B. L. Tseng, C.-Y. Lin, and M.-T. Sun, "ExpertiseNet: Relational and Evolutionary Expert Modeling," 10th Intl. Conf. on User Modeling, Edinburgh, UK, July 24-30, 2005.
[28]
J. Allan, R. Papka, and V. Lavrenko. "On-line New Event Detection and Tracking," Proc. of 21st ACM SIGIR, pp.37--45, August 1998.
[29]
http://en.wikipedia.org/wiki/Timeline_of_the_Enron_scandal.
[30]
J. Breese, D. Heckerman, and C. Kadie. "Empirical analysis of predictive algorithms for collaborative filtering," Conf. on Uncertainty in Artificial Intelligence, Madison,WI, July 1998.

Cited By

View all
  • (2020)A cross-lingual sentiment topic model evolution over timeIntelligent Data Analysis10.3233/IDA-18444924:2(253-266)Online publication date: 27-Mar-2020
  • (2020)Making Meaningful User Segments from Datasets Using Product Dissemination and Product ImpactData and Information Management10.2478/dim-2020-00484:4(237-249)Online publication date: 27-Nov-2020
  • (2020)DeepCP: Deep Learning Driven Cascade Prediction Based Autonomous Content Placement in Closed Social NetworkIEEE Journal on Selected Areas in Communications10.1109/JSAC.2020.2999687(1-1)Online publication date: 2020
  • Show More Cited By

Index Terms

  1. Modeling and predicting personal information dissemination behavior

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '05: Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
    August 2005
    844 pages
    ISBN:159593135X
    DOI:10.1145/1081870
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 21 August 2005

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. information dissemination
    2. personal information management
    3. user behavior modeling

    Qualifiers

    • Article

    Conference

    KDD05

    Acceptance Rates

    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Upcoming Conference

    KDD '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)18
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 25 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2020)A cross-lingual sentiment topic model evolution over timeIntelligent Data Analysis10.3233/IDA-18444924:2(253-266)Online publication date: 27-Mar-2020
    • (2020)Making Meaningful User Segments from Datasets Using Product Dissemination and Product ImpactData and Information Management10.2478/dim-2020-00484:4(237-249)Online publication date: 27-Nov-2020
    • (2020)DeepCP: Deep Learning Driven Cascade Prediction Based Autonomous Content Placement in Closed Social NetworkIEEE Journal on Selected Areas in Communications10.1109/JSAC.2020.2999687(1-1)Online publication date: 2020
    • (2020)LoCEC: Local Community-based Edge Classification in Large Online Social Networks2020 IEEE 36th International Conference on Data Engineering (ICDE)10.1109/ICDE48307.2020.00150(1689-1700)Online publication date: Apr-2020
    • (2019)Random Playlists Smoothly Commuting Between StylesACM Transactions on Multimedia Computing, Communications, and Applications10.1145/336174215:4(1-20)Online publication date: 16-Dec-2019
    • (2019)HGANACM Transactions on Multimedia Computing, Communications, and Applications10.1145/334468415:4(1-24)Online publication date: 16-Dec-2019
    • (2019)Image/Video Restoration via Multiplanar Autoregressive Model and Low-Rank OptimizationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/334172815:4(1-23)Online publication date: 16-Dec-2019
    • (2019)Reinforcement Learning, Unsupervised Methods, and Concept Drift in Stream LearningEncyclopedia of Big Data Technologies10.1007/978-3-319-77525-8_327(1413-1420)Online publication date: 20-Feb-2019
    • (2018)Labeled Phrase Latent Dirichlet Allocation and its online learning algorithmData Mining and Knowledge Discovery10.1007/s10618-018-0555-032:4(885-912)Online publication date: 1-Jul-2018
    • (2018)Reinforcement Learning, Unsupervised Methods, and Concept Drift in Stream LearningEncyclopedia of Big Data Technologies10.1007/978-3-319-63962-8_327-1(1-8)Online publication date: 1-Jun-2018
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media