Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3038912.3052669acmotherconferencesArticle/Chapter ViewAbstractPublication PageswebconfConference Proceedingsconference-collections
research-article

Modeling the Dynamics of Learning Activity on the Web

Published: 03 April 2017 Publication History
  • Get Citation Alerts
  • Abstract

    People are increasingly relying on social media and the Web to find solutions to their problems in a wide range of domains. In this setting, closely related problems often lead to the same characteristic learning pattern --- people sharing a similar problem visit closely related pieces of information, perform almost identical queries or, more generally, take a series of similar actions at a similar pace. In this paper, we introduce a novel modeling framework for clustering continuous-time grouped streaming data, the Hierarchical Dirichlet Hawkes process (HDHP), which allows us to automatically uncover a wide variety of learning patterns from detailed traces of learning activity. Our model allows for efficient inference, scaling to millions of actions and thousands of users. Experiments on real data from Stack Overflow reveal that our framework recovers meaningful learning patterns, accurately tracks users' interests and goals over time and achieves better predictive performance than the state of the art.

    References

    [1]
    O. Aalen, O. Borgan, and H. Gjessing. Survival and event history analysis: a process point of view. Springer Science & Business Media, 2008.
    [2]
    A. Ahmed, Q. Ho, C. H. Teo, J. Eisenstein, A. J. Smola, and E. P. Xing. Online Inference for the Infinite Topic-Cluster Model: Storylines from Streaming Text. In AISTATS, 2011.
    [3]
    A. Ahmed and E. Xing. Dynamic Non-Parametric Mixture Models and The Recurrent Chinese Restaurant Process with Applications to Evolutionary Clustering. In SDM, 2008.
    [4]
    A.-L. Barabási. The origin of bursts and heavy tails in human dynamics. Nature, 2005.
    [5]
    D. M. Blei and P. I. Frazier. Distance dependent chinese restaurant processes. The Journal of Machine Learning Research, 2011.
    [6]
    D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. The Journal of Machine Learning Research, 2003.
    [7]
    O. Cappé, S. J. Godsill, and E. Moulines. An overview of existing methods and recent advances in sequential Monte Carlo. Proceedings of the IEEE, 2007.
    [8]
    C. Carvalho, M. S. Johannes, H. F. Lopes, and N. Polson. Particle learning and smoothing. Statistical Science, 2010.
    [9]
    D. J. Daley and D. Vere-Jones. An introduction to the theory of point processes. Vol. II. Probability and its Applications (New York). Springer, second edition, 2008.
    [10]
    N. Du, M. Farajtabar, A. Ahmed, A. J. Smola, and L. Song. Dirichlet-hawkes processes with applications to clustering continuous-time document streams. In ACM SIGKDD, 2015.
    [11]
    M. Farajtabar, N. Du, M. Gomez-Rodriguez, I. Valera, L. Song, and H. Zha. Shaping social activity by incentivizing users. In NIPS, 2014.
    [12]
    T. S. Ferguson. A bayesian analysis of some nonparametric problems. The Annals of Statistics, 1973.
    [13]
    A. G. Hawkes. Spectra of some self-exciting and mutually exciting point processes. Biometrika, 1971.
    [14]
    T. Iwata, A. Shah, and Z. Ghahramani. Discovering latent influence in online social activities via shared cascade poisson processes. In ACM SIGKDD, 2013.
    [15]
    J. Sethuraman. A constructive definition of Dirichlet priors. Statistica Sinica, 1994.
    [16]
    A. Smith, A. Doucet, N. de Freitas, and N. Gordon. Sequential Monte Carlo methods in practice. Springer Science & Business Media, 2013.
    [17]
    M. A. Stephens. Goodness of fit with special reference to tests for exponentiality. Stanford University, 1978.
    [18]
    Y. W. Teh and M. I. Jordan. Hierarchical Bayesian nonparametric models with applications. In Bayesian Nonparametrics: Principles and Practice. Cambridge University Press, 2010.
    [19]
    Y. W. Teh, M. I. Jordan, M. J. Beal, and D. M. Blei. Hierarchical Dirichlet processes. Journal of the American Statistical Association, 2006.
    [20]
    I. Valera and M. Gomez-Rodriguez. Modeling adoption and usage of competing products. In ICDM, 2015.
    [21]
    Q. Zhao, M. A. Erdogdu, H. Y. He, A. Rajaraman, and J. Leskovec. Seismic: A self-exciting point process model for predicting tweet popularity. In ACM SIGKDD, 2015.
    [22]
    K. Zhou, H. Zha, and L. Song. Learning triggering kernels for multi-dimensional hawkes processes. In ICML, 2013.

    Cited By

    View all
    • (2023)Hawkes Processes With Stochastic Exogenous Effects for Continuous-Time Interaction ModellingIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2022.316164945:2(1848-1861)Online publication date: 1-Feb-2023
    • (2023)Dirichlet-Survival Process: Scalable Inference of Topic-Dependent Diffusion NetworksAdvances in Information Retrieval10.1007/978-3-031-28238-6_47(562-570)Online publication date: 17-Mar-2023
    • (2023)Multivariate Powered Dirichlet-Hawkes ProcessAdvances in Information Retrieval10.1007/978-3-031-28238-6_4(47-61)Online publication date: 17-Mar-2023
    • Show More Cited By

    Index Terms

    1. Modeling the Dynamics of Learning Activity on the Web

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      WWW '17: Proceedings of the 26th International Conference on World Wide Web
      April 2017
      1678 pages
      ISBN:9781450349130

      Sponsors

      • IW3C2: International World Wide Web Conference Committee

      In-Cooperation

      Publisher

      International World Wide Web Conferences Steering Committee

      Republic and Canton of Geneva, Switzerland

      Publication History

      Published: 03 April 2017

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. continuous-time data clustering
      2. learning activity modeling
      3. user interest tracking

      Qualifiers

      • Research-article

      Conference

      WWW '17
      Sponsor:
      • IW3C2

      Acceptance Rates

      WWW '17 Paper Acceptance Rate 164 of 966 submissions, 17%;
      Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)7
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 12 Aug 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Hawkes Processes With Stochastic Exogenous Effects for Continuous-Time Interaction ModellingIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2022.316164945:2(1848-1861)Online publication date: 1-Feb-2023
      • (2023)Dirichlet-Survival Process: Scalable Inference of Topic-Dependent Diffusion NetworksAdvances in Information Retrieval10.1007/978-3-031-28238-6_47(562-570)Online publication date: 17-Mar-2023
      • (2023)Multivariate Powered Dirichlet-Hawkes ProcessAdvances in Information Retrieval10.1007/978-3-031-28238-6_4(47-61)Online publication date: 17-Mar-2023
      • (2023)Properties of Reddit News Topical InteractionsComplex Networks and Their Applications XI10.1007/978-3-031-21127-0_2(16-28)Online publication date: 4-Jan-2023
      • (2022)Online neural sequence detection with hierarchical dirichlet point processProceedings of the 36th International Conference on Neural Information Processing Systems10.5555/3600270.3600752(6654-6665)Online publication date: 28-Nov-2022
      • (2022)Inference, Prediction, & Entropy-Rate Estimation of Continuous-Time, Discrete-Event ProcessesEntropy10.3390/e2411167524:11(1675)Online publication date: 17-Nov-2022
      • (2022)Mining Reaction and Diffusion Dynamics in Social ActivitiesProceedings of the 31st ACM International Conference on Information & Knowledge Management10.1145/3511808.3557396(1521-1531)Online publication date: 17-Oct-2022
      • (2022)Interactions in Information SpreadCompanion Proceedings of the Web Conference 202210.1145/3487553.3524190(313-317)Online publication date: 25-Apr-2022
      • (2022)Powered Dirichlet–Hawkes process: challenging textual clustering using a flexible temporal priorKnowledge and Information Systems10.1007/s10115-022-01731-364:11(2921-2944)Online publication date: 1-Aug-2022
      • (2021)Using Dirichlet Marked Hawkes Processes for Insider Threat DetectionDigital Threats: Research and Practice10.1145/34579083:1(1-19)Online publication date: 22-Oct-2021
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media