Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1645953.1646223acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
poster

Topic and keyword re-ranking for LDA-based topic modeling

Published: 02 November 2009 Publication History

Abstract

Topic-based text summaries promise to help average users quickly understand a text collection and derive insights. Recent research has shown that the Latent Dirichlet Allocation (LDA) model is one of the most effective approaches to topic analysis. However, the LDA-based results may not be ideal for human understanding and consumption. In this paper, we present several topic and keyword re-ranking approaches that can help users better understand and consume the LDA-derived topics in their text analysis. Our methods process the LDA output based on a set of criteria that model a user's information needs. Our evaluation demonstrates the usefulness of the methods in summarizing several large-scale, real world data sets.

References

[1]
C. M. Bishop. Pattern Recognition and Machine Learning (Information Science and Statistics). Springer, 2006.
[2]
D. Blei and J. Lafferty. Topic Models, chapter: Topic Models. Taylor and Francis, 2009. (in Press).
[3]
D. Blei, A. Ng, and M. Jordan. Latent Dirichlet allocation. Journal of Machine Learning Research, 3:993--1022, 2003.
[4]
S. Deerwester, S. Dumais, T. Landauer, G. Furnas, and R. Harshman. Indexing by latent semantic analysis. Journal of the American Society of Information Science, 41(6):391--407, 1990.
[5]
T. Griffiths and M. Steyvers. Finding scientific topics. Proceedings of the National Academy of Sciences, 101:5228--5235, 2004.
[6]
X. He, D. Cai, and P. Niyogi. Laplacian score for feature selection. In Proceedings of NIPS, 2005.
[7]
T. Hofmann. Probabilistic latent semantic analysis. In Proceedings of UAI, pages 289--296, 1999.
[8]
M. Lan, C. Tan, H. Low, and S. Sung. A comprehensive comparative study on term weighting schemes for text categorization with support vector machines. In WWW (Special interest tracks and posters), pages 1032--1033, 2005.
[9]
S. Liu, M. X. Zhou, S. Pan, W. Qian, W. Cai, and X. Lian. Interactive, topics-based visual text summarization and analysis. In Proceedings of the CIKM, 2009.
[10]
A. K. McCallum. Mallet: A machine learning for language toolkit. 2002.
[11]
P. Mitra, C. Murthy, and S. Pal. Unsupervised feature selection using feature similarity. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(3):301--312, 2002.
[12]
M. Sahami. Using Machine Learning to Improve Information Access. PhD thesis, Department of Computer Science, Stanford University, USA, 1998.
[13]
G. Salton and C. Buckley. Term-weighting approaches in automatic text retrieval. Information Processing and Management, 24(5):513--523, 1988.
[14]
Y. Yang and J. Pedersen. A comparative study on feature selection in text categorization. In Proceedings of ICML, pages 412--420, 1997.

Cited By

View all
  • (2025)Hierarchical Deep Document ModelIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.348752337:1(351-364)Online publication date: Jan-2025
  • (2024)Happiness and Sadness in Adolescents’ Instagram Direct Messaging: A Neural Topic Modeling ApproachSocial Media + Society10.1177/2056305124122965510:1Online publication date: 23-Feb-2024
  • (2024)Bert-BiLSTM Model for Sentiment Analysis Using Contextual Embeddings and Bidirectional Dependencies2024 International Symposium on Internet of Things and Smart Cities (ISITSC)10.1109/ISITSC64373.2024.00022(88-93)Online publication date: 21-Jun-2024
  • Show More Cited By

Index Terms

  1. Topic and keyword re-ranking for LDA-based topic modeling

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge management
      November 2009
      2162 pages
      ISBN:9781605585123
      DOI:10.1145/1645953
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 02 November 2009

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. topic and keyword re-ranking
      2. topic model

      Qualifiers

      • Poster

      Conference

      CIKM '09
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

      Upcoming Conference

      CIKM '25

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)120
      • Downloads (Last 6 weeks)16
      Reflects downloads up to 25 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2025)Hierarchical Deep Document ModelIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.348752337:1(351-364)Online publication date: Jan-2025
      • (2024)Happiness and Sadness in Adolescents’ Instagram Direct Messaging: A Neural Topic Modeling ApproachSocial Media + Society10.1177/2056305124122965510:1Online publication date: 23-Feb-2024
      • (2024)Bert-BiLSTM Model for Sentiment Analysis Using Contextual Embeddings and Bidirectional Dependencies2024 International Symposium on Internet of Things and Smart Cities (ISITSC)10.1109/ISITSC64373.2024.00022(88-93)Online publication date: 21-Jun-2024
      • (2024)A Review: Comprehensive study on societal Analysis for health care system Using topic modeling Paradigms2024 International Conference on Advancements in Smart, Secure and Intelligent Computing (ASSIC)10.1109/ASSIC60049.2024.10507910(1-5)Online publication date: 27-Jan-2024
      • (2024)Utilizing cognitive signals generated during human reading to enhance keyphrase extraction from microblogsInformation Processing and Management: an International Journal10.1016/j.ipm.2023.10361461:2Online publication date: 1-Mar-2024
      • (2024)Enhancing AI Research for Breast Cancer: A Comprehensive Review of Tumor-Infiltrating Lymphocyte DatasetsJournal of Imaging Informatics in Medicine10.1007/s10278-024-01043-837:6(2996-3008)Online publication date: 28-May-2024
      • (2024)Automated Topic Exploration in a Cultural Heritage CorpusMetadata and Semantic Research10.1007/978-3-031-65990-4_21(229-240)Online publication date: 31-Jul-2024
      • (2023)Analyzing Trends and Topics of Sinhala Hate Speech on Twitter: A Time Series Approach2023 IEEE 17th International Conference on Industrial and Information Systems (ICIIS)10.1109/ICIIS58898.2023.10253497(67-72)Online publication date: 25-Aug-2023
      • (2023)Passenger intelligence as a competitive opportunity: unsupervised text analytics for discovering airline-specific insights from online reviewsAnnals of Operations Research10.1007/s10479-022-05162-9333:2-3(1045-1075)Online publication date: 13-Jan-2023
      • (2022)Predicting Duplicate in Bug Report Using Topic-Based Duplicate Learning With Fine Tuning-Based BERT AlgorithmIEEE Access10.1109/ACCESS.2022.322623810(129666-129675)Online publication date: 2022
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media