Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3132847.3132914acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

DeepRank: A New Deep Architecture for Relevance Ranking in Information Retrieval

Published: 06 November 2017 Publication History

Abstract

This paper concerns a deep learning approach to relevance ranking in information retrieval (IR). Existing deep IR models such as DSSM and CDSSM directly apply neural networks to generate ranking scores, without explicit understandings of the relevance. According to the human judgement process, a relevance label is generated by the following three steps: 1) relevant locations are detected; 2) local relevances are determined; 3) local relevances are aggregated to output the relevance label. In this paper we propose a new deep learning architecture, namely DeepRank, to simulate the above human judgment process. Firstly, a detection strategy is designed to extract the relevant contexts. Then, a measure network is applied to determine the local relevances by utilizing a convolutional neural network (CNN) or two-dimensional gated recurrent units (2D-GRU). Finally, an aggregation network with sequential integration and term gating mechanism is used to produce a global relevance score. DeepRank well captures important IR characteristics, including exact/semantic matching signals, proximity heuristics, query term importance, and diverse relevance requirement. Experiments on both benchmark LETOR dataset and a large scale clickthrough data show that DeepRank can significantly outperform learning to ranking methods, and existing deep learning methods.

References

[1]
Christopher JC Burges. 2010. From ranknet to lambdarank to lambdamart: An overview. Learning Vol. 11 (2010), 23--581.
[2]
Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, and Hang Li. 2007. Learning to rank: from pairwise approach to listwise approach ICML. ACM, 129--136.
[3]
Olivier Chapelle and Yi Chang. 2011. Yahoo! learning to rank challenge overview. In Proceedings of the Learning to Rank Challenge. 1--24.
[4]
Kyunghyun Cho, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. In EMNLP. 1724--1734.
[5]
Carsten Eickhoff, Sebastian Dungs, and Vu Tran. 2015. An eye-tracking study of query reformulation. In SIGIR. ACM, 13--22.
[6]
Hui Fang, Tao Tao, and ChengXiang Zhai. 2004. A formal study of information retrieval heuristics SIGIR. ACM, 49--56.
[7]
Yoav Freund, Raj Iyer, Robert E Schapire, and Yoram Singer. 2003. An efficient boosting algorithm for combining preferences. JMLR, Vol. 4, Nov (2003), 933--969.
[8]
Fredric C Gey. 1994. Inferring probability of relevance using the method of logistic regression SIGIR. Springer, 222--231.
[9]
Rich Caruana Steve Lawrence Lee Giles. 2001. Overfitting in Neural Nets: Backpropagation, Conjugate Gradient, and Early Stopping NIPS, Vol. Vol. 13. MIT Press, 402.
[10]
Alan Graves, Abdel-rahman Mohamed, and Geoffrey Hinton. 2013. Speech recognition with deep recurrent neural networks ICASSP. IEEE, 6645--6649.
[11]
Jiafeng Guo, Yixing Fan, Qingyao Ai, and W Bruce Croft. 2016. A deep relevance matching model for ad-hoc retrieval CIKM. ACM, 55--64.
[12]
Baotian Hu, Zhengdong Lu, Hang Li, and Qingcai Chen. 2014. Convolutional neural network architectures for matching natural language sentences NIPS. 2042--2050.
[13]
Po-Sen Huang, Xiaodong He, Jianfeng Gao, Li Deng, Alex Acero, and Larry Heck. 2013. Learning deep structured semantic models for web search using clickthrough data CIKM. ACM, 2333--2338.
[14]
Thorsten Joachims. 2002. Optimizing search engines using clickthrough data. SIGKDD. ACM, 133--142.
[15]
Thorsten Joachims. 2006. Training linear SVMs in linear time. In SIGIR. ACM, 217--226.
[16]
Diederik Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
[17]
Yann LeCun, Yoshua Bengio, and Geoffrey Hinton. 2015. Deep learning. Nature, Vol. 521, 7553 (2015), 436--444.
[18]
Tie-Yan Liu. 2009. Learning to rank for information retrieval. Foundations and Trends in Information Retrieval, Vol. 3, 3 (2009), 225--331.
[19]
Yuanhua Lv and ChengXiang Zhai. 2009. Positional language models for information retrieval SIGIR. ACM, 299--306.
[20]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality NIPS. 3111--3119.
[21]
Shuzi Niu, Jiafeng Guo, Yanyan Lan, and Xueqi Cheng. 2012. Top-k learning to rank: labeling, ranking and evaluation SIGIR. ACM, 751--760.
[22]
Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd. 1999. The PageRank citation ranking: Bringing order to the web. Technical Report. Stanford InfoLab.
[23]
Liang Pang, Yanyan Lan, Jiafeng Guo, Jun Xu, and Xueqi Cheng. 2016 a. A study of matchpyramid models on ad-hoc retrieval Neu-IR '16 SIGIR Workshop on Neural Information Retrieval.
[24]
Liang Pang, Yanyan Lan, Jiafeng Guo, Jun Xu, Shengxian Wan, and Xueqi Cheng. 2016 b. Text matching as image recognition. In AAAI. AAAI Press, 2793--2799.
[25]
Tao Qin, Tie-Yan Liu, Jun Xu, and Hang Li. 2010. LETOR: A benchmark collection for research on learning to rank for information retrieval. Information Retrieval Vol. 13, 4 (2010), 346--374.
[26]
Stephen Robertson. 2000. Evaluation in information retrieval. Lectures on information retrieval. Springer, 81--92.
[27]
Stephen E Robertson and Steve Walker. 1994. Some simple effective approximations to the 2-poisson model for probabilistic weighted retrieval. In SIGIR. Springer-Verlag New York, Inc., 232--241.
[28]
Aliaksei Severyn and Alessandro Moschitti. 2015. Learning to rank short text pairs with convolutional deep neural networks Proceedings of SIGIR. ACM, 373--382.
[29]
Yelong Shen, Xiaodong He, Jianfeng Gao, Li Deng, and Grégoire Mesnil. 2014. Learning semantic representations using convolutional neural networks for web search WWW. WWW, 373--374.
[30]
Mark D Smucker, James Allan, and Ben Carterette. 2007. A comparison of statistical significance tests for information retrieval evaluation CIKM. ACM, 623--632.
[31]
Tao Tao and ChengXiang Zhai. 2007. An exploration of proximity measures in information retrieval SIGIR. ACM, 295--302.
[32]
Shengxian Wan, Yanyan Lan, Jiafeng Guo, Jun Xu, Liang Pang, and Xueqi Cheng. 2016. Match-SRNN: Modeling the Recursive Matching Structure with Spatial RNN IJCAI. 2922--2928.
[33]
Ho Chung Wu, Robert WP Luk, Kam-Fai Wong, and KL Kwok. 2007. A retrospective study of a hybrid document-context based retrieval model. Information processing & management Vol. 43, 5 (2007), 1308--1331.
[34]
Jun Xu and Hang Li. 2007. Adarank: a boosting algorithm for information retrieval SIGIR. ACM, 391--398.

Cited By

View all
  • (2024)DeepRankAdvanced Interdisciplinary Applications of Deep Learning for Data Science10.4018/979-8-3693-4759-1.ch002(31-50)Online publication date: 27-Dec-2024
  • (2024)CRAS: Curriculum Regularization and Adaptive Semi-Supervised Learning with Noisy LabelsApplied Sciences10.3390/app1403120814:3(1208)Online publication date: 31-Jan-2024
  • (2024)Unstructured Data Fusion for Schema and Data ExtractionProceedings of the ACM on Management of Data10.1145/36549842:3(1-26)Online publication date: 30-May-2024
  • Show More Cited By

Index Terms

  1. DeepRank: A New Deep Architecture for Relevance Ranking in Information Retrieval

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '17: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management
    November 2017
    2604 pages
    ISBN:9781450349185
    DOI:10.1145/3132847
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 06 November 2017

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. deep learning
    2. information retrieval
    3. ranking
    4. text matching

    Qualifiers

    • Research-article

    Funding Sources

    • National Natural Science Foundation of China (NSFC)
    • Youth Innovation Promotion Association CAS
    • 973 Program of China

    Conference

    CIKM '17
    Sponsor:

    Acceptance Rates

    CIKM '17 Paper Acceptance Rate 171 of 855 submissions, 20%;
    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    CIKM '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)106
    • Downloads (Last 6 weeks)6
    Reflects downloads up to 03 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)DeepRankAdvanced Interdisciplinary Applications of Deep Learning for Data Science10.4018/979-8-3693-4759-1.ch002(31-50)Online publication date: 27-Dec-2024
    • (2024)CRAS: Curriculum Regularization and Adaptive Semi-Supervised Learning with Noisy LabelsApplied Sciences10.3390/app1403120814:3(1208)Online publication date: 31-Jan-2024
    • (2024)Unstructured Data Fusion for Schema and Data ExtractionProceedings of the ACM on Management of Data10.1145/36549842:3(1-26)Online publication date: 30-May-2024
    • (2024)Parametric CAD Primitive Retrieval via Multi-Modal Fusion and Deep HashingProceedings of the 2024 International Conference on Multimedia Retrieval10.1145/3652583.3658041(1061-1069)Online publication date: 30-May-2024
    • (2024)RLocator: Reinforcement Learning for Bug LocalizationIEEE Transactions on Software Engineering10.1109/TSE.2024.345259550:10(2695-2708)Online publication date: Oct-2024
    • (2024)TRGNN: Text-Rich Graph Neural Network for Few-Shot Document Filtering2024 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN60899.2024.10650066(1-9)Online publication date: 30-Jun-2024
    • (2024)SkillSync: Evaluating Candidates for Communication and Problem-Solving Proficiency2024 15th International Conference on Computing Communication and Networking Technologies (ICCCNT)10.1109/ICCCNT61001.2024.10724605(1-6)Online publication date: 24-Jun-2024
    • (2024)Anatomization of Neural Networks based models for Semantic Analysis of Tabular dataset2024 IEEE 9th International Conference for Convergence in Technology (I2CT)10.1109/I2CT61223.2024.10544176(1-8)Online publication date: 5-Apr-2024
    • (2024)Evaluating Transferability in Retrieval Tasks: An Approach Using MMD and Kernel Methods2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.02113(22390-22400)Online publication date: 16-Jun-2024
    • (2024)PDAM-FAQ: Paraphrasing-Based Data Augmentation and Mixed-Feature Semantic Matching for Low-Resource FAQsIEEE Access10.1109/ACCESS.2024.351608812(190054-190066)Online publication date: 2024
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media