Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1143844.1143859acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlConference Proceedingsconference-collections
Article

Dynamic topic models

Published: 25 June 2006 Publication History
  • Get Citation Alerts
  • Abstract

    A family of probabilistic time series models is developed to analyze the time evolution of topics in large document collections. The approach is to use state space models on the natural parameters of the multinomial distributions that represent the topics. Variational approximations based on Kalman filters and nonparametric wavelet regression are developed to carry out approximate posterior inference over the latent topics. In addition to giving quantitative, predictive models of a sequential corpus, dynamic topic models provide a qualitative window into the contents of a large document collection. The models are demonstrated by analyzing the OCR'ed archives of the journal Science from 1880 through 2000.

    References

    [1]
    Aitchison, J. (1982). The statistical analysis of compositional data. Journal of the Royal Statistical Society, Series B, 44(2):139--177.]]
    [2]
    Blei, D., Ng, A., and Jordan, M. (2003). Latent Dirichlet allocation. Journal of Machine Learning Research, 3:993--1022.]]
    [3]
    Blei, D. M. and Lafferty, J. D. (2006). Correlated topic models. In Weiss, Y., Schölkopf, B., and Platt, J., editors, Advances in Neural Information Processing Systems 18. MIT Press, Cambridge, MA.]]
    [4]
    Buntine, W. and Jakulin, A. (2004). Applying discrete PCA in data analysis. In Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence, pages 59--66. AUAI Press.]]
    [5]
    Erosheva, E. (2002). Grade of membership and latent structure models with application to disability survey data. PhD thesis, Carnegie Mellon University, Department of Statistics.]]
    [6]
    Fei-Fei, L. and Perona, P. (2005). A Bayesian hierarchical model for learning natural scene categories. IEEE Computer Vision and Pattern Recognition.]]
    [7]
    Griffiths, T. and Steyvers, M. (2004). Finding scientific topics. Proceedings of the National Academy of Science, 101:5228--5235.]]
    [8]
    Kalman, R. (1960). A new approach to linear filtering and prediction problems. Transaction of the AMSE: Journal of Basic Engineering, 82:35--45.]]
    [9]
    McCallum, A., Corrada-Emmanuel, A., and Wang, X. (2004). The author-recipient-topic model for topic and role discovery in social networks: Experiments with Enron and academic email. Technical report, University of Massachusetts, Amherst.]]
    [10]
    Pritchard, J., Stephens, M., and Donnelly, P. (2000). Inference of population structure using multilocus genotype data. Genetics, 155:945--959.]]
    [11]
    Rosen-Zvi, M., Griffiths, T., Steyvers, M., and Smith, P. (2004). The author-topic model for authors and documents. In Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence, pages 487--494. AUAI Press.]]
    [12]
    Sivic, J., Rusell, B., Efros, A., Zisserman, A., and Freeman, W. (2005). Discovering objects and their location in images. In International Conference on Computer Vision (ICCV 2005).]]
    [13]
    Snelson, E. and Ghahramani, Z. (2006). Sparse Gaussian processes using pseudo-inputs. In Weiss, Y., Schölkopf, B., and Platt, J., editors, Advances in Neural Information Processing Systems 18, Cambridge, MA. MIT Press.]]
    [14]
    Wasserman, L. (2006). All of Nonparametric Statistics. Springer.]]
    [15]
    West, M. and Harrison, J. (1997). Bayesian Forecasting and Dynamic Models. Springer.]]

    Cited By

    View all
    • (2024)Quantifying the Activities of Local Assembly Members in Japan: Recent Advances and an Approach Using the BERTopic ModelInterdisciplinary Information Sciences10.4036/iis.2024.R.0330:1(68-101)Online publication date: 2024
    • (2024)Revisiting Probabilistic Latent Semantic Analysis: Extensions, Challenges and InsightsTechnologies10.3390/technologies1201000512:1(5)Online publication date: 3-Jan-2024
    • (2024)The Integration of Complex Systems Science and Community-Based Research: A Scoping ReviewSystems10.3390/systems1203008812:3(88)Online publication date: 9-Mar-2024
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICML '06: Proceedings of the 23rd international conference on Machine learning
    June 2006
    1154 pages
    ISBN:1595933832
    DOI:10.1145/1143844
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 25 June 2006

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Article

    Acceptance Rates

    ICML '06 Paper Acceptance Rate 140 of 548 submissions, 26%;
    Overall Acceptance Rate 140 of 548 submissions, 26%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)799
    • Downloads (Last 6 weeks)81
    Reflects downloads up to 26 Jul 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Quantifying the Activities of Local Assembly Members in Japan: Recent Advances and an Approach Using the BERTopic ModelInterdisciplinary Information Sciences10.4036/iis.2024.R.0330:1(68-101)Online publication date: 2024
    • (2024)Revisiting Probabilistic Latent Semantic Analysis: Extensions, Challenges and InsightsTechnologies10.3390/technologies1201000512:1(5)Online publication date: 3-Jan-2024
    • (2024)The Integration of Complex Systems Science and Community-Based Research: A Scoping ReviewSystems10.3390/systems1203008812:3(88)Online publication date: 9-Mar-2024
    • (2024)Visual Analysis Method for Traffic Trajectory with Dynamic Topic Movement Patterns Based on the Improved Markov Decision ProcessElectronics10.3390/electronics1303046713:3(467)Online publication date: 23-Jan-2024
    • (2024)Dynamic insights into research trends and trajectories in early reading: an analytical exploration via dynamic topic modelingFrontiers in Psychology10.3389/fpsyg.2024.132649415Online publication date: 7-Feb-2024
    • (2024)Sparseness-constrained nonnegative tensor factorization for detecting topics at different time scalesFrontiers in Applied Mathematics and Statistics10.3389/fams.2024.128707410Online publication date: 22-Jul-2024
    • (2024)NEW DIRECTIONS IN QUANTUM COMPUTING TECHNOLOGYSSRN Electronic Journal10.2139/ssrn.4871417Online publication date: 2024
    • (2024)Analyzing public demands on China’s online government inquiry platform: A BERTopic-Based topic modeling studyPLOS ONE10.1371/journal.pone.029685519:2(e0296855)Online publication date: 15-Feb-2024
    • (2024)Assessing the evolution of research topics in a biological field using plant science as an examplePLOS Biology10.1371/journal.pbio.300261222:5(e3002612)Online publication date: 23-May-2024
    • (2024)How are texts analyzed in blockchain research? A systematic literature reviewFinancial Innovation10.1186/s40854-023-00501-610:1Online publication date: 29-Feb-2024
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media