Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2484028.2484098acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Modeling term dependencies with quantum language models for IR

Published: 28 July 2013 Publication History

Abstract

Traditional information retrieval (IR) models use bag-of-words as the basic representation and assume that some form of independence holds between terms. Representing term dependencies and defining a scoring function capable of integrating such additional evidence is theoretically and practically challenging. Recently, Quantum Theory (QT) has been proposed as a possible, more general framework for IR. However, only a limited number of investigations have been made and the potential of QT has not been fully explored and tested. We develop a new, generalized Language Modeling approach for IR by adopting the probabilistic framework of QT. In particular, quantum probability could account for both single and compound terms at once without having to extend the term space artificially as in previous studies. This naturally allows us to avoid the weight-normalization problem, which arises in the current practice by mixing scores from matching compound terms and from matching single terms. Our model is the first practical application of quantum probability to show significant improvements over a robust bag-of-words baseline and achieves better performance on a stronger non bag-of-words baseline.

References

[1]
J. Bai, Y. Chang, H. Cui, Z. Zheng, G. Sun, and X. Li. Investigation of partial query proximity in web search. In Proc. of WWW, pages 1183--1184, 2008.
[2]
M. Bendersky and W. B. Croft. Modeling higher-order term dependencies in information retrieval using query hypergraphs. In Proc. of SIGIR, pages 941--950, 2012.
[3]
M. Bendersky, D. Metzler, and W. B. Croft. Parametrized concept weighting in verbose queries. In Proc. of SIGIR, pages 605--614, 2011.
[4]
R. Blume-Kohout. Hedged maximum likelihood estimation. Phys. Rev. Lett., 105:200504, 2010.
[5]
R. Blume-Kohout. Optimal, reliable estimation of quantum states. New J. Phys., 12:043034, 2010.
[6]
O. Chapelle, D. Metzler, Y. Zhang, P. Grinspan. Expected reciprocal rank for graded relevance In Proc. of CIKM, 2009.
[7]
S. Deerwester, S. T. Dumais, G. W. Furnas, T. K. Landauer, and R. Harshman. Indexing by latent semantic analysis. JASIST, 41:391--407, 1990.
[8]
J. L. Fagan. Automatic phrase indexing for document retrieval. In Proc. of SIGIR, pages 91--101, 1987.
[9]
J. Gao, J. Y. Nie, G. Wu, and G. Cao. Dependence language model for information retrieval. In Proc. of SIGIR, pages 170--177, 2004.
[10]
A. Gleason. Measures on the closed subspaces of a hilbert space. Journ. Math. Mech., 6:885--893, 1957.
[11]
K. S. Jones, S. Walker, and S. E. Robertson. A probabilistic model of information retrieval: development and comparative experiments. Inf. Proc. Manag., pages 779--840, 2000.
[12]
M. Lease. An improved markov random field model for supporting verbose queries. In Proc. of SIGIR, pages 476--483, 2009.
[13]
C. Lee, G. G. Lee, and M. G. Jang. Dependency structure applied to language modeling for information retrieval. ETRI, 28(3):337--346, 2006.
[14]
Y. Lv and C. Zhai. Positional language models for information retrieval. In Proc. of SIGIR, pages 299--306, 2009.
[15]
A. I. Lvovsky. Iterative maximum-likelihood reconstruction in quantum homodyne tomography. Journ. Opt. B6, pages S556--S559, 2004.
[16]
M. Melucci. Deriving a quantum information retrieval basis. The Computer Journal, 2012.
[17]
M. Melucci and K. Rijsbergen. Quantum mechanics and information retrieval. Advanced Topics in Information Retrieval, 33:125--155, 2011.
[18]
D. Metzler and W. Bruce Croft. Linear feature-based models for information retrieval. Inf. Retr., 10(3):257--274, 2007.
[19]
D. Metzler and W. B. Croft. A markov random field model for term dependencies. In Proc. of SIGIR, pages 472--479, 2005.
[20]
D. Metzler, T. Strohman, Y. Zhou, and W. B. Croft. Indri at TREC 2005: Terabyte Track. In Proc. of TREC, 2005.
[21]
M. Mitra, C. Buckley, A. Singhal, and C. Cardie. An analysis of statistical and syntactic phrases. In Proc of RIAO, pages 200--217, 1997.
[22]
W. Morgan, W. Greiff, and J. Henderson. Direct maximization of average precision by hill-climbing, with a comparison to a maximum entropy approach. In Proc. of HLT-NAACL, pages 93--96, 2004.
[23]
M. A. Nielsen and I. L. Chuang. Quantum Computation and Quantum Information. Cambridge University Press, 2004.
[24]
J. H. Park, W. B. Croft, and D. A. Smith. A quasi-synchronous dependence model for information retrieval. In Proc. of CIKM, pages 17--26, 2011.
[25]
B. Piwowarski, I. Frommholz, M. Lalmas, and K. van Rijsbergen. What can quantum theory bring to information retrieval. In Proc. of CIKM, pages 59--68, 2010.
[26]
M. Pretti. A message-passing algorithm with damping. J. Stat. Mech., page P11008, 2005.
[27]
J. Reháček, Z. Hradil, E. Knill, A. I. Lvovsky. Diluted maximum-likelihood algorithm for quantum tomography. Phys. Rev. A, 75:042108, 2007.
[28]
G. Salton, C. S. Yang, and C. T. Yu. A Theory of Term Importance in Automatic Text Analysis. JASIST, 26(1):33--44, 1975.
[29]
M. D. Smucker, J. Allan, and B. Carterette. A comparison of statistical significance tests for information retrieval evaluation. In Proc. of CIKM, pages 623--632, 2007.
[30]
F. Song and W. B. Croft. A general language model for information retrieval. In Proc. of SIGIR, pages 279--280, 1999.
[31]
M. Srikanth and R. Srihari. Biterm language models for document retrieval. In Proc. of SIGIR, pages 425--426, 2002.
[32]
H. Umegaki. Conditional expectation in an operator algebra. Kodai Mathematical Seminar Reports, 14(2):59--85, 1962.
[33]
K. van Rijsbergen. The Geometry of Information Retrieval. Cambridge University Press, 2004.
[34]
M. K. Warmuth and D. Kuzmin. Bayesian generalized probability calculus for density matrices. Machine Learning, 78(1-2):63--101, 2009.
[35]
C. Zhai. Statistical language models for information retrieval a critical review. Found. Trends Inf. Retr., 2(3):137--213, 2008.
[36]
J. Zhao and Y. Yun. A proximity language model for information retrieval. In Proc. of SIGIR, pages 291--298, 2009.
[37]
X. Zhao, P. Zhang, D. Song, and Y. Hou. A novel re-ranking approach inspired by quantum measurement. In Proc. of ECIR, pages 721--724, 2011.
[38]
G. Zuccon and L. Azzopardi. Using the quantum probability ranking principle to rank interdependent documents. In Proc. of ECIR, page 357--369, 2010.

Cited By

View all
  • (2024)A Quantum Probability Driven Framework for Joint Multi-Modal Sarcasm, Sentiment and Emotion AnalysisIEEE Transactions on Affective Computing10.1109/TAFFC.2023.327914515:1(326-341)Online publication date: Jan-2024
  • (2024)Research on Distributional Compositional Categorical Model in Both Classical and Quantum Natural Language Processing2024 IEEE/ACIS 27th International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)10.1109/SNPD61259.2024.10673943(66-71)Online publication date: 5-Jul-2024
  • (2024)Quantum-inspired Neural Network Based on Stochastic Liouville-von Neumann Equation for Sentiment Classification2024 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN60899.2024.10650170(1-8)Online publication date: 30-Jun-2024
  • Show More Cited By

Index Terms

  1. Modeling term dependencies with quantum language models for IR

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGIR '13: Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
    July 2013
    1188 pages
    ISBN:9781450320344
    DOI:10.1145/2484028
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 28 July 2013

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. density matrices
    2. language modeling
    3. retrieval models

    Qualifiers

    • Research-article

    Conference

    SIGIR '13
    Sponsor:

    Acceptance Rates

    SIGIR '13 Paper Acceptance Rate 73 of 366 submissions, 20%;
    Overall Acceptance Rate 792 of 3,983 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)74
    • Downloads (Last 6 weeks)13
    Reflects downloads up to 04 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)A Quantum Probability Driven Framework for Joint Multi-Modal Sarcasm, Sentiment and Emotion AnalysisIEEE Transactions on Affective Computing10.1109/TAFFC.2023.327914515:1(326-341)Online publication date: Jan-2024
    • (2024)Research on Distributional Compositional Categorical Model in Both Classical and Quantum Natural Language Processing2024 IEEE/ACIS 27th International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)10.1109/SNPD61259.2024.10673943(66-71)Online publication date: 5-Jul-2024
    • (2024)Quantum-inspired Neural Network Based on Stochastic Liouville-von Neumann Equation for Sentiment Classification2024 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN60899.2024.10650170(1-8)Online publication date: 30-Jun-2024
    • (2024)Quantum-inspired Neural Network with Lindblad Master Equation for Sentiment Analysis2024 6th International Conference on Natural Language Processing (ICNLP)10.1109/ICNLP60986.2024.10692739(119-123)Online publication date: 22-Mar-2024
    • (2024)Quantum Topic Model: Topic Modeling Using Variational Quantum CircuitsICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP48485.2024.10447349(5895-5899)Online publication date: 14-Apr-2024
    • (2024)Quantum Natural Language Processing: A Comprehensive SurveyIEEE Access10.1109/ACCESS.2024.342070712(99578-99598)Online publication date: 2024
    • (2024)Quantum representation based preference evolution network for e-commerce recommendationPhysica A: Statistical Mechanics and its Applications10.1016/j.physa.2024.130155(130155)Online publication date: Oct-2024
    • (2024)QSIM: A Quantum-inspired hierarchical semantic interaction model for text classificationNeurocomputing10.1016/j.neucom.2024.128658(128658)Online publication date: Oct-2024
    • (2024)Quantum-inspired language models based on unitary transformationInformation Processing & Management10.1016/j.ipm.2024.10374161:4(103741)Online publication date: Jul-2024
    • (2024)Quantum transfer learning for acceptability judgementsQuantum Machine Intelligence10.1007/s42484-024-00141-86:1Online publication date: 4-Mar-2024
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media