Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2556195.2556200acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
tutorial

Multilingual probabilistic topic modeling and its applications in web mining and search

Published: 24 February 2014 Publication History

Abstract

Multilingual topic models are a fairly novel group of unsupervised, language-independent and generative machine learning models. This tutorial covers all key aspects of their probabilistic framework and demonstrates how to easily integrate these models into frameworks for cross-lingual and multilingual Web mining and search.

References

[1]
D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent Dirichlet Allocation. Journal of Machine Learning Research, 3:993--1022, 2003.
[2]
X. Ni, J.-T. Sun, J. Hu, and Z. Chen. Cross lingual text classification by mining multilingual topics from Wikipedia. In WSDM, pages 375--384, 2011.
[3]
I. Vulić, W. De Smet, and M.-F. Moens. Cross-language information retrieval models based on latent topic models trained with document-aligned comparable corpora. Information Retrieval, 16(3):331--368, 2013.
[4]
I. Vulić and M.-F. Moens. A unified framework for monolingual and cross-lingual relevance modeling based on probabilistic topic models. In ECIR, pages 98--109, 2013.
[5]
S. Zoghbi, I. Vulić, and M.-F. Moens. I pinned it. Where can i buy one like it? Automatically linking Pinterest pins to online Webshops. In DUBMOD, pages 9--12, 2013.

Cited By

View all
  • (2018)Students search interest model over an organisation based on web log dataInternational Journal of Business Intelligence and Data Mining10.5555/3192182.319218513:1-3(26-39)Online publication date: 15-Dec-2018
  • (2018)A more time-efficient gibbs sampling algorithm based on SparseLDA for latent dirichlet allocationIntelligent Data Analysis10.3233/IDA-17360922:6(1227-1257)Online publication date: 12-Dec-2018
  • (2017)ECIR 2016 Workshop on Modeling, Learning and Mining for Cross/Multilinguality (MultiLingMine '16)ACM SIGIR Forum10.1145/3053408.305342450:2(89-95)Online publication date: 14-Feb-2017
  • Show More Cited By

Index Terms

  1. Multilingual probabilistic topic modeling and its applications in web mining and search

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      WSDM '14: Proceedings of the 7th ACM international conference on Web search and data mining
      February 2014
      712 pages
      ISBN:9781450323512
      DOI:10.1145/2556195
      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 24 February 2014

      Check for updates

      Author Tags

      1. comparable data
      2. cross-lingual text processing
      3. multilingual data mining
      4. multilingual topic models
      5. probabilistic topic modeling

      Qualifiers

      • Tutorial

      Conference

      WSDM 2014

      Acceptance Rates

      WSDM '14 Paper Acceptance Rate 64 of 355 submissions, 18%;
      Overall Acceptance Rate 498 of 2,863 submissions, 17%

      Upcoming Conference

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 10 Oct 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2018)Students search interest model over an organisation based on web log dataInternational Journal of Business Intelligence and Data Mining10.5555/3192182.319218513:1-3(26-39)Online publication date: 15-Dec-2018
      • (2018)A more time-efficient gibbs sampling algorithm based on SparseLDA for latent dirichlet allocationIntelligent Data Analysis10.3233/IDA-17360922:6(1227-1257)Online publication date: 12-Dec-2018
      • (2017)ECIR 2016 Workshop on Modeling, Learning and Mining for Cross/Multilinguality (MultiLingMine '16)ACM SIGIR Forum10.1145/3053408.305342450:2(89-95)Online publication date: 14-Feb-2017
      • (2016)MultiLingMine 2016: Modeling, Learning and Mining for Cross/MultilingualityAdvances in Information Retrieval10.1007/978-3-319-30671-1_83(869-873)Online publication date: 2016

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media