research-article

DeepRank: A New Deep Architecture for Relevance Ranking in Information Retrieval

Authors:

Xueqi ChengAuthors Info & Claims

CIKM '17: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management

Pages 257 - 266

https://doi.org/10.1145/3132847.3132914

Published: 06 November 2017 Publication History

Abstract

This paper concerns a deep learning approach to relevance ranking in information retrieval (IR). Existing deep IR models such as DSSM and CDSSM directly apply neural networks to generate ranking scores, without explicit understandings of the relevance. According to the human judgement process, a relevance label is generated by the following three steps: 1) relevant locations are detected; 2) local relevances are determined; 3) local relevances are aggregated to output the relevance label. In this paper we propose a new deep learning architecture, namely DeepRank, to simulate the above human judgment process. Firstly, a detection strategy is designed to extract the relevant contexts. Then, a measure network is applied to determine the local relevances by utilizing a convolutional neural network (CNN) or two-dimensional gated recurrent units (2D-GRU). Finally, an aggregation network with sequential integration and term gating mechanism is used to produce a global relevance score. DeepRank well captures important IR characteristics, including exact/semantic matching signals, proximity heuristics, query term importance, and diverse relevance requirement. Experiments on both benchmark LETOR dataset and a large scale clickthrough data show that DeepRank can significantly outperform learning to ranking methods, and existing deep learning methods.

References

[1]

Christopher JC Burges. 2010. From ranknet to lambdarank to lambdamart: An overview. Learning Vol. 11 (2010), 23--581.

[2]

Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, and Hang Li. 2007. Learning to rank: from pairwise approach to listwise approach ICML. ACM, 129--136.

Digital Library

[3]

Olivier Chapelle and Yi Chang. 2011. Yahoo! learning to rank challenge overview. In Proceedings of the Learning to Rank Challenge. 1--24.

Digital Library

[4]

Kyunghyun Cho, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. In EMNLP. 1724--1734.

[5]

Carsten Eickhoff, Sebastian Dungs, and Vu Tran. 2015. An eye-tracking study of query reformulation. In SIGIR. ACM, 13--22.

Digital Library

[6]

Hui Fang, Tao Tao, and ChengXiang Zhai. 2004. A formal study of information retrieval heuristics SIGIR. ACM, 49--56.

Digital Library

[7]

Yoav Freund, Raj Iyer, Robert E Schapire, and Yoram Singer. 2003. An efficient boosting algorithm for combining preferences. JMLR, Vol. 4, Nov (2003), 933--969.

Digital Library

[8]

Fredric C Gey. 1994. Inferring probability of relevance using the method of logistic regression SIGIR. Springer, 222--231.

Digital Library

[9]

Rich Caruana Steve Lawrence Lee Giles. 2001. Overfitting in Neural Nets: Backpropagation, Conjugate Gradient, and Early Stopping NIPS, Vol. Vol. 13. MIT Press, 402.

[10]

Alan Graves, Abdel-rahman Mohamed, and Geoffrey Hinton. 2013. Speech recognition with deep recurrent neural networks ICASSP. IEEE, 6645--6649.

[11]

Jiafeng Guo, Yixing Fan, Qingyao Ai, and W Bruce Croft. 2016. A deep relevance matching model for ad-hoc retrieval CIKM. ACM, 55--64.

Digital Library

[12]

Baotian Hu, Zhengdong Lu, Hang Li, and Qingcai Chen. 2014. Convolutional neural network architectures for matching natural language sentences NIPS. 2042--2050.

Digital Library

[13]

Po-Sen Huang, Xiaodong He, Jianfeng Gao, Li Deng, Alex Acero, and Larry Heck. 2013. Learning deep structured semantic models for web search using clickthrough data CIKM. ACM, 2333--2338.

Digital Library

[14]

Thorsten Joachims. 2002. Optimizing search engines using clickthrough data. SIGKDD. ACM, 133--142.

Digital Library

[15]

Thorsten Joachims. 2006. Training linear SVMs in linear time. In SIGIR. ACM, 217--226.

Digital Library

[16]

Diederik Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).

[17]

Yann LeCun, Yoshua Bengio, and Geoffrey Hinton. 2015. Deep learning. Nature, Vol. 521, 7553 (2015), 436--444.

[18]

Tie-Yan Liu. 2009. Learning to rank for information retrieval. Foundations and Trends in Information Retrieval, Vol. 3, 3 (2009), 225--331.

Digital Library

[19]

Yuanhua Lv and ChengXiang Zhai. 2009. Positional language models for information retrieval SIGIR. ACM, 299--306.

Digital Library

[20]

Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality NIPS. 3111--3119.

Digital Library

[21]

Shuzi Niu, Jiafeng Guo, Yanyan Lan, and Xueqi Cheng. 2012. Top-k learning to rank: labeling, ranking and evaluation SIGIR. ACM, 751--760.

Digital Library

[22]

Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd. 1999. The PageRank citation ranking: Bringing order to the web. Technical Report. Stanford InfoLab.

[23]

Liang Pang, Yanyan Lan, Jiafeng Guo, Jun Xu, and Xueqi Cheng. 2016 a. A study of matchpyramid models on ad-hoc retrieval Neu-IR '16 SIGIR Workshop on Neural Information Retrieval.

[24]

Liang Pang, Yanyan Lan, Jiafeng Guo, Jun Xu, Shengxian Wan, and Xueqi Cheng. 2016 b. Text matching as image recognition. In AAAI. AAAI Press, 2793--2799.

Digital Library

[25]

Tao Qin, Tie-Yan Liu, Jun Xu, and Hang Li. 2010. LETOR: A benchmark collection for research on learning to rank for information retrieval. Information Retrieval Vol. 13, 4 (2010), 346--374.

Digital Library

[26]

Stephen Robertson. 2000. Evaluation in information retrieval. Lectures on information retrieval. Springer, 81--92.

Digital Library

[27]

Stephen E Robertson and Steve Walker. 1994. Some simple effective approximations to the 2-poisson model for probabilistic weighted retrieval. In SIGIR. Springer-Verlag New York, Inc., 232--241.

Digital Library

[28]

Aliaksei Severyn and Alessandro Moschitti. 2015. Learning to rank short text pairs with convolutional deep neural networks Proceedings of SIGIR. ACM, 373--382.

Digital Library

[29]

Yelong Shen, Xiaodong He, Jianfeng Gao, Li Deng, and Grégoire Mesnil. 2014. Learning semantic representations using convolutional neural networks for web search WWW. WWW, 373--374.

Digital Library

[30]

Mark D Smucker, James Allan, and Ben Carterette. 2007. A comparison of statistical significance tests for information retrieval evaluation CIKM. ACM, 623--632.

Digital Library

[31]

Tao Tao and ChengXiang Zhai. 2007. An exploration of proximity measures in information retrieval SIGIR. ACM, 295--302.

Digital Library

[32]

Shengxian Wan, Yanyan Lan, Jiafeng Guo, Jun Xu, Liang Pang, and Xueqi Cheng. 2016. Match-SRNN: Modeling the Recursive Matching Structure with Spatial RNN IJCAI. 2922--2928.

Digital Library

[33]

Ho Chung Wu, Robert WP Luk, Kam-Fai Wong, and KL Kwok. 2007. A retrospective study of a hybrid document-context based retrieval model. Information processing & management Vol. 43, 5 (2007), 1308--1331.

Digital Library

[34]

Jun Xu and Hang Li. 2007. Adarank: a boosting algorithm for information retrieval SIGIR. ACM, 391--398.

Digital Library

Cited By

Rakhshani HAbolqasemi AKeyhanipour A(2024)DeepRankAdvanced Interdisciplinary Applications of Deep Learning for Data Science10.4018/979-8-3693-4759-1.ch002(31-50)Online publication date: 27-Dec-2024
https://doi.org/10.4018/979-8-3693-4759-1.ch002
Higashimoto RYoshida SMuneyasu M(2024)CRAS: Curriculum Regularization and Adaptive Semi-Supervised Learning with Noisy LabelsApplied Sciences10.3390/app1403120814:3(1208)Online publication date: 31-Jan-2024
https://doi.org/10.3390/app14031208
Chen KKoudas N(2024)Unstructured Data Fusion for Schema and Data ExtractionProceedings of the ACM on Management of Data10.1145/36549842:3(1-26)Online publication date: 30-May-2024
https://dl.acm.org/doi/10.1145/3654984
Show More Cited By

Index Terms

DeepRank: A New Deep Architecture for Relevance Ranking in Information Retrieval
1. Information systems
  1. Information retrieval
    1. Retrieval models and ranking

Recommendations

Relevance Estimation with Multiple Information Sources on Search Engine Result Pages
CIKM '18: Proceedings of the 27th ACM International Conference on Information and Knowledge Management

Relevance estimation is among the most important tasks in the ranking of search results because most search engines follow the Probability Ranking Principle. Current relevance estimation methodologies mainly concentrate on text matching between the ...
Information Retrieval System: An Overview, Issues, and Challenges

Information Retrieval Systems IRS have dramatically changed the ways how people acquire information for their need. Information Retrieval IR enables user to find relevant document from collection of countless resources. This article presents an overview ...
MatchACNN: A Multi-Granularity Deep Matching Model
Abstract
This paper discusses a deep learning approach to ranking relevance in information retrieval (IR). In recent years, deep neural networks have led to exciting breakthroughs in speech recognition, computer vision, and natural language processing (NLP)...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

CIKM '17: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management

November 2017

2604 pages

ISBN:9781450349185

DOI:10.1145/3132847

General Chairs:
Ee-Peng Lim
Singapore Management University, Singapore
,
Marianne Winslett
University of Illinois at Urbana-Champaign, USA, and Advanced Digital Sciences Center, Singapore
,
Program Chairs:
Mark Sanderson
RMIT, Australia
,
Ada Fu
Chinese University of Hong Kong, Hong Kong
,
Jimeng Sun
Georgia Tech, USA
,
Shane Culpepper
RMIT, Australia
,
Eric Lo
Chinese University of Hong Kong, Hong Kong
,
Joyce Ho
Emory University, USA
,
Debora Donato
Mix Tech, Inc., USA
,
Rakesh Agrawal
Data Insights Laboratories, USA
,
Yu Zheng
Microsoft Research Asia, China
,
Carlos Castillo
Qatar Computing Research Institute, Qatar
,
Aixin Sun
Nanyang Technological University, Singapore
,
Vincent S. Tseng
National Cheng Kung University, Taiwan
,
Chenliang Li
Wuhan University, China

Copyright © 2017 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 November 2017

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Natural Science Foundation of China (NSFC)
Youth Innovation Promotion Association CAS
973 Program of China

Conference

CIKM '17

Sponsor:

CIKM '17: ACM Conference on Information and Knowledge Management

November 6 - 10, 2017

Singapore, Singapore

Acceptance Rates

CIKM '17 Paper Acceptance Rate 171 of 855 submissions, 20%;

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Sponsor:
sigir
sigir

The 34th ACM International Conference on Information and Knowledge Management

November 10 - 14, 2025

Seoul , Republic of Korea

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

140
Total Citations
View Citations
1,427
Total Downloads

Downloads (Last 12 months)106
Downloads (Last 6 weeks)6

Reflects downloads up to 03 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Rakhshani HAbolqasemi AKeyhanipour A(2024)DeepRankAdvanced Interdisciplinary Applications of Deep Learning for Data Science10.4018/979-8-3693-4759-1.ch002(31-50)Online publication date: 27-Dec-2024
https://doi.org/10.4018/979-8-3693-4759-1.ch002
Higashimoto RYoshida SMuneyasu M(2024)CRAS: Curriculum Regularization and Adaptive Semi-Supervised Learning with Noisy LabelsApplied Sciences10.3390/app1403120814:3(1208)Online publication date: 31-Jan-2024
https://doi.org/10.3390/app14031208
Chen KKoudas N(2024)Unstructured Data Fusion for Schema and Data ExtractionProceedings of the ACM on Management of Data10.1145/36549842:3(1-26)Online publication date: 30-May-2024
https://dl.acm.org/doi/10.1145/3654984
Xu MLou YMa WLi XZhou XGurrin CKongkachandra RSchoeffmann KDang-Nguyen DRossetto LSatoh SZhou L(2024)Parametric CAD Primitive Retrieval via Multi-Modal Fusion and Deep HashingProceedings of the 2024 International Conference on Multimedia Retrieval10.1145/3652583.3658041(1061-1069)Online publication date: 30-May-2024
https://dl.acm.org/doi/10.1145/3652583.3658041
Chakraborty PAlfadel MNagappan M(2024)RLocator: Reinforcement Learning for Bug LocalizationIEEE Transactions on Software Engineering10.1109/TSE.2024.345259550:10(2695-2708)Online publication date: Oct-2024
https://doi.org/10.1109/TSE.2024.3452595
Mu HZhang SWang YSun YXu H(2024)TRGNN: Text-Rich Graph Neural Network for Few-Shot Document Filtering2024 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN60899.2024.10650066(1-9)Online publication date: 30-Jun-2024
https://doi.org/10.1109/IJCNN60899.2024.10650066
Varma NMattaparty SNikitha DNuman M(2024)SkillSync: Evaluating Candidates for Communication and Problem-Solving Proficiency2024 15th International Conference on Computing Communication and Networking Technologies (ICCCNT)10.1109/ICCCNT61001.2024.10724605(1-6)Online publication date: 24-Jun-2024
https://doi.org/10.1109/ICCCNT61001.2024.10724605
Bharath BBollineni JMandala SGanguly TJ AShukla NJoseph R(2024)Anatomization of Neural Networks based models for Semantic Analysis of Tabular dataset2024 IEEE 9th International Conference for Convergence in Technology (I2CT)10.1109/I2CT61223.2024.10544176(1-8)Online publication date: 5-Apr-2024
https://doi.org/10.1109/I2CT61223.2024.10544176
Dai MRaffiee AJain ACorrea J(2024)Evaluating Transferability in Retrieval Tasks: An Approach Using MMD and Kernel Methods2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.02113(22390-22400)Online publication date: 16-Jun-2024
https://doi.org/10.1109/CVPR52733.2024.02113
Wang DWang LTang KBo Q(2024)PDAM-FAQ: Paraphrasing-Based Data Augmentation and Mixed-Feature Semantic Matching for Low-Resource FAQsIEEE Access10.1109/ACCESS.2024.351608812(190054-190066)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2024.3516088
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten