research-article

Adversarial Sampling and Training for Semi-Supervised Information Retrieval

Authors:

Yi ChangAuthors Info & Claims

WWW '19: The World Wide Web Conference

Pages 1443 - 1453

https://doi.org/10.1145/3308558.3313416

Published: 13 May 2019 Publication History

Abstract

Ad-hoc retrieval models with implicit feedback often have problems, e.g., the imbalanced classes in the data set. Too few clicked documents may hurt generalization ability of the models, whereas too many non-clicked documents may harm effectiveness of the models and efficiency of training. In addition, recent neural network-based models are vulnerable to adversarial examples due to the linear nature in them. To solve the problems at the same time, we propose an adversarial sampling and training framework to learn ad-hoc retrieval models with implicit feedback. Our key idea is (i) to augment clicked examples by adversarial training for better generalization and (ii) to obtain very informational non-clicked examples by adversarial sampling and training. Experiments are performed on benchmark data sets for common ad-hoc retrieval tasks such as Web search, item recommendation, and question answering. Experimental results indicate that the proposed approaches significantly outperform strong baselines especially for high-ranked documents, and they outperform IRGAN in NDCG@5 using only 5% of labeled data for the Web search task.

References

[1]

Alexey Borisov, Ilya Markov, Maarten de Rijke, and Pavel Serdyukov. 2016. A neural click model for web search. In Proceedings of the 25th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 531-541.

Digital Library

[2]

Chris Burges, Tal Shaked, Erin Renshaw, Ari Lazier, Matt Deeds, Nicole Hamilton, and Greg Hullender. 2005. Learning to rank using gradient descent. In Proceedings of the 22nd international conference on Machine learning. ACM, 89-96.

Digital Library

[3]

Christopher JC Burges. 2010. From ranknet to lambdarank to lambdamart: An overview. Learning11, 23-581 (2010), 81.

[4]

Christopher J Burges, Robert Ragno, and Quoc V Le. 2007. Learning to rank with nonsmooth cost functions. In Advances in neural information processing systems. 193-200.

Digital Library

[5]

Jingtao Ding, Fuli Feng, Xiangnan He, Guanghui Yu, Yong Li, and Depeng Jin. 2018. An improved sampler for bayesian personalized ranking by leveraging view data. In Companion of the The Web Conference 2018 on The Web Conference 2018. International World Wide Web Conferences Steering Committee, 13-14.

Digital Library

[6]

Minwei Feng, Bing Xiang, Michael R Glass, Lidan Wang, and Bowen Zhou. 2015. Applying deep learning to answer selection: A study and an open task. arXiv preprint arXiv:1508.01585(2015).

[7]

Xavier Glorot, Antoine Bordes, and Yoshua Bengio. 2011. Deep sparse rectifier neural networks. In Proceedings of the fourteenth international conference on artificial intelligence and statistics. 315-323.

[8]

Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial nets. In Advances in neural information processing systems. 2672-2680.

Digital Library

[9]

Ian J Goodfellow, Jonathon Shlens, and Christian Szegedy. 2014. Explaining and harnessing adversarial examples. arXiv preprint arXiv:1412.6572(2014).

[10]

Mihajlo Grbovic, Nemanja Djuric, Vladan Radosavljevic, Fabrizio Silvestri, and Narayan Bhamidipati. 2015. Context-and content-aware embeddings for query rewriting in sponsored search. In Proceedings of the 38th international ACM SIGIR conference on research and development in information retrieval. ACM, 383-392.

Digital Library

[11]

Jiafeng Guo, Yixing Fan, Qingyao Ai, and W Bruce Croft. 2016. A deep relevance matching model for ad-hoc retrieval. In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management. ACM, 55-64.

Digital Library

[12]

F Maxwell Harper and Joseph A Konstan. 2016. The movielens datasets: History and context. Acm transactions on interactive intelligent systems (tiis)5, 4(2016), 19.

Digital Library

[13]

Xiangnan He, Zhankui He, Xiaoyu Du, and Tat-Seng Chua. 2018. Adversarial Personalized Ranking for Recommendation. In The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval. ACM, 355-364.

Digital Library

[14]

Balázs Hidasi and Alexandros Karatzoglou. 2018. Recurrent neural networks with top-k gains for session-based recommendations. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management. ACM, 843-852.

Digital Library

[15]

Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation9, 8 (1997), 1735-1780.

Digital Library

[16]

Po-Sen Huang, Xiaodong He, Jianfeng Gao, Li Deng, Alex Acero, and Larry Heck. 2013. Learning deep structured semantic models for web search using clickthrough data. In Proceedings of the 22nd ACM international conference on Conference on information & knowledge management. ACM, 2333-2338.

Digital Library

[17]

Thorsten Joachims, Laura Granka, Bing Pan, Helene Hembrooke, and Geri Gay. 2005. Accurately interpreting clickthrough data as implicit feedback. In Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 154-161.

Digital Library

[18]

Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix factorization techniques for recommender systems. Computer8(2009), 30-37.

Digital Library

[19]

Tie-Yan Liu 2009. Learning to rank for information retrieval. Foundations and Trends® in Information Retrieval3, 3(2009), 225-331.

Digital Library

[20]

Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems. 3111-3119.

Digital Library

[21]

Takeru Miyato, Andrew M Dai, and Ian Goodfellow. 2016. Adversarial training methods for semi-supervised text classification. arXiv preprint arXiv:1605.07725(2016).

[22]

Takeru Miyato, Shin-ichi Maeda, Shin Ishii, and Masanori Koyama. 2018. Virtual adversarial training: a regularization method for supervised and semi-supervised learning. IEEE transactions on pattern analysis and machine intelligence (2018).

[23]

Takeru Miyato, Shin-ichi Maeda, Masanori Koyama, Ken Nakae, and Shin Ishii. 2015. Distributional smoothing with virtual adversarial training. arXiv preprint arXiv:1507.00677(2015).

[24]

Dae Hoon Park and Rikio Chiba. 2017. A neural language model for query auto-completion. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 1189-1192.

Digital Library

[25]

Tao Qin and Tie-Yan Liu. 2013. Introducing LETOR 4.0 datasets. arXiv preprint arXiv:1306.2597(2013).

[26]

Filip Radlinski and Thorsten Joachims. 2005. Query chains: learning to rank from implicit feedback. In Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining. ACM, 239-248.

Digital Library

[27]

Jinfeng Rao, Hua He, and Jimmy Lin. 2016. Noise-contrastive estimation for answer selection with deep neural networks. In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management. ACM, 1913-1916.

Digital Library

[28]

Steffen Rendle and Christoph Freudenthaler. 2014. Improving pairwise learning for item recommendation from implicit feedback. In Proceedings of the 7th ACM international conference on Web search and data mining. ACM, 273-282.

Digital Library

[29]

Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme. 2009. BPR: Bayesian personalized ranking from implicit feedback. In Proceedings of the twenty-fifth conference on uncertainty in artificial intelligence. AUAI Press, 452-461.

Digital Library

[30]

Cicero dos Santos, Ming Tan, Bing Xiang, and Bowen Zhou. 2016. Attentive pooling networks. arXiv preprint arXiv:1602.03609(2016).

[31]

Aliaksei Severyn and Alessandro Moschitti. 2016. Modeling relational information in question-answer pairs with convolutional neural networks. arXiv preprint arXiv:1604.01178(2016).

[32]

Xuehua Shen, Bin Tan, and ChengXiang Zhai. 2005. Context-sensitive information retrieval using implicit feedback. In Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 43-50.

Digital Library

[33]

Alessandro Sordoni, Yoshua Bengio, Hossein Vahabi, Christina Lioma, Jakob Grue Simonsen, and Jian-Yun Nie. 2015. A hierarchical recurrent encoder-decoder for generative context-aware query suggestion. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. ACM, 553-562.

Digital Library

[34]

Christian Szegedy, Wojciech Zaremba, Ilya Sutskever, Joan Bruna, Dumitru Erhan, Ian Goodfellow, and Rob Fergus. 2013. Intriguing properties of neural networks. arXiv preprint arXiv:1312.6199(2013).

[35]

Di Wang and Eric Nyberg. 2015. A long short-term memory model for answer sentence selection in question answering. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), Vol. 2. 707-712.

[36]

Jun Wang, Lantao Yu, Weinan Zhang, Yu Gong, Yinghui Xu, Benyou Wang, Peng Zhang, and Dell Zhang. 2017. Irgan: A minimax game for unifying generative and discriminative information retrieval models. In Proceedings of the 40th International ACM SIGIR conference on Research and Development in Information Retrieval. ACM, 515-524.

Digital Library

[37]

David Warde-Farley and Ian Goodfellow. 2016. 11 adversarial perturbations of deep neural networks. Perturbations, Optimization, and Statistics(2016), 311.

[38]

Kai Xu, Dae Hoon Park, Chang Yi, and Charles Sutton. 2018. Interpreting Deep Classifier by Visual Distillation of Dark Knowledge. arXiv preprint arXiv:1803.04042(2018).

[39]

Xiao Yang, Miaosen Wang, Wei Wang, Madian Khabsa, and Ahmed Awadallah. 2018. Adversarial Training for Community Question Answer Selection Based on Multi-scale Matching. arXiv preprint arXiv:1804.08058(2018).

[40]

Fajie Yuan, Guibing Guo, Joemon M Jose, Long Chen, Haitao Yu, and Weinan Zhang. 2016. Lambdafm: learning optimal ranking with factorization machines using lambda surrogates. In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management. ACM, 227-236.

Digital Library

[41]

Quanshi Zhang, Ying Nian Wu, and Song-Chun Zhu. 2017. Interpretable convolutional neural networks. arXiv preprint arXiv:1710.009352, 3 (2017), 5.

[42]

Weinan Zhang, Tianqi Chen, Jun Wang, and Yong Yu. 2013. Optimizing top-n collaborative filtering via dynamic negative item sampling. In Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval. ACM, 785-788.

Digital Library

Cited By

Qiu YDong HChen JHe X(2024)LightAD: accelerating AutoDebias with adaptive samplingJUSTC10.52396/JUSTC-2022-010054:4(0405)Online publication date: 2024
https://doi.org/10.52396/JUSTC-2022-0100
Zheng CJiang GYan XYin PZhou QCheng J(2024)GE2: A General and Efficient Knowledge Graph Embedding Learning SystemProceedings of the ACM on Management of Data10.1145/36549862:3(1-27)Online publication date: 30-May-2024
https://dl.acm.org/doi/10.1145/3654986
Yang ZDing MHuang TCen YSong JXu BDong YTang J(2024)Does Negative Sampling Matter? a Review With Insights Into its Theory and ApplicationsIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2024.337147346:8(5692-5711)Online publication date: Aug-2024
https://doi.org/10.1109/TPAMI.2024.3371473
Show More Cited By

Recommendations

Inductive Semi-supervised Multi-Label Learning with Co-Training
KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

In multi-label learning, each training example is associated with multiple class labels and the task is to learn a mapping from the feature space to the power set of label space. It is generally demanding and time-consuming to obtain labels for training ...
Domain Adaptation for Speaker Verification Based on Self-supervised Learning with Adversarial Training
MultiMedia Modeling
Abstract
Speaker verification models trained on a single domain have difficulty keeping performance on new domain data. Adversarial training maps different domain data to the same subspace to handle this problem. However, adversarial training only uses ...
Improvements to adversarial training for text classification tasks

Although deep learning models show powerful performance, they are still easily deceived by adversarial samples. Some methods for generating adversarial samples have the drawback of high time loss, which is problematic for adversarial training, and the ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

WWW '19: The World Wide Web Conference

May 2019

3620 pages

ISBN:9781450366748

DOI:10.1145/3308558

Editors:
Ling Liu
Georgia Tech, USA
,
Ryen White
Microsoft Research, USA

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

IW3C2: International World Wide Web Conference Committee

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 May 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

WWW '19

WWW '19: The Web Conference

May 13 - 17, 2019

CA, San Francisco, USA

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

57
Total Citations
View Citations
631
Total Downloads

Downloads (Last 12 months)53
Downloads (Last 6 weeks)5

Reflects downloads up to 09 Aug 2024

Other Metrics

View Author Metrics

Citations

Cited By

Qiu YDong HChen JHe X(2024)LightAD: accelerating AutoDebias with adaptive samplingJUSTC10.52396/JUSTC-2022-010054:4(0405)Online publication date: 2024
https://doi.org/10.52396/JUSTC-2022-0100
Zheng CJiang GYan XYin PZhou QCheng J(2024)GE2: A General and Efficient Knowledge Graph Embedding Learning SystemProceedings of the ACM on Management of Data10.1145/36549862:3(1-27)Online publication date: 30-May-2024
https://dl.acm.org/doi/10.1145/3654986
Yang ZDing MHuang TCen YSong JXu BDong YTang J(2024)Does Negative Sampling Matter? a Review With Insights Into its Theory and ApplicationsIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2024.337147346:8(5692-5711)Online publication date: Aug-2024
https://doi.org/10.1109/TPAMI.2024.3371473
Zhao YLiu DWan CLiu XNie JLiu J(2024)JMS-QA: A Joint Hierarchical Architecture for Mental Health Question AnsweringIEEE/ACM Transactions on Audio, Speech and Language Processing10.1109/TASLP.2023.332929532(352-363)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1109/TASLP.2023.3329295
Chen RFan JWu M(2024)ABNSExpert Systems with Applications: An International Journal10.1016/j.eswa.2024.123868250:COnline publication date: 15-Sep-2024
https://dl.acm.org/doi/10.1016/j.eswa.2024.123868
Chen LGong ZXie HZhou M(2024)False Negative Sample Aware Negative Sampling for RecommendationAdvances in Knowledge Discovery and Data Mining10.1007/978-981-97-2262-4_16(195-206)Online publication date: 25-Apr-2024
https://doi.org/10.1007/978-981-97-2262-4_16
Liu ZLuo W(2023)FMGAN: A Filter-Enhanced MLP Debias Recommendation Model Based on Generative Adversarial NetworkApplied Sciences10.3390/app1313797513:13(7975)Online publication date: 7-Jul-2023
https://doi.org/10.3390/app13137975
Yao FLiu QHou MTong SHuang ZChen ESha JWang SElkind E(2023)Exploiting non-interactive exercises in cognitive diagnosisProceedings of the Thirty-Second International Joint Conference on Artificial Intelligence10.24963/ijcai.2023/266(2367-2405)Online publication date: 19-Aug-2023
https://dl.acm.org/doi/10.24963/ijcai.2023/266
Chen XTan DGupta PBromuri S(2023)Pair-wise selective classification with dynamic sampling for shipment importer predictionProceedings of the 2023 15th International Conference on Machine Learning and Computing10.1145/3587716.3587741(152-157)Online publication date: 17-Feb-2023
https://dl.acm.org/doi/10.1145/3587716.3587741
Wu XYang LGong JZhou CLin TLiu XYu PFrommholz IHopfgartner FLee MOakes MLalmas MZhang MSantos R(2023)Dimension Independent Mixup for Hard Negative Sample in Collaborative FilteringProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614845(2785-2794)Online publication date: 21-Oct-2023
https://dl.acm.org/doi/10.1145/3583780.3614845
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View Table of Contents