Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3397271.3401153acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Employing Personal Word Embeddings for Personalized Search

Published: 25 July 2020 Publication History
  • Get Citation Alerts
  • Abstract

    Personalized search is a task to tailor the general document ranking list based on user interests to better satisfy the user's information need. Many personalized search models have been proposed and demonstrated their capability to improve search quality. The general idea of most approaches is to build a user interest profile according to the user's search history, and then re-rank the documents based on the matching scores between the created user profile and candidate documents. In this paper, we propose to solve the problem of personalized search in an alternative way. We know that there are many ambiguous words in natural language such as 'Apple', and people with different knowledge backgrounds and interests have personalized understandings of these words. Therefore, for different users, such a word should own different semantic representations. Motivated by this idea, we design a personalized search model based on personal word embeddings, referred to as PEPS. Specifically, we train personal word embeddings for each user in which the representation of each word is mainly decided by the user's personal data. Then, we obtain the personalized word and contextual representations of the query and documents with an attention function. Finally, we use a matching model to calculate the matching score between the personalized query and document representations. Experiments on two datasets verify that our model can significantly improve state-of-the-art personalization models.

    Supplementary Material

    M4V File (3397271.3401153.m4v)
    Presentation video

    References

    [1]
    Wasi Uddin Ahmad, Kai-Wei Chang, and Hongning Wang. 2018. Multi-Task Learning for Document Ranking and Query Suggestion. In 6th International Conference on Learning Representations, ICLR 2018, .
    [2]
    Wasi Uddin Ahmad, Kai-Wei Chang, and Hongning Wang. 2019. Context Attentive Document Ranking and Query Suggestion. In Proceedings of SIGIR 2019.
    [3]
    Nawal Ould Amer, Philippe Mulhem, and Mathias Gé ry. 2016. Toward Word Embedding for Personalized Information Retrieval. CoRR, Vol. abs/1606.06991 (2016).
    [4]
    Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2015. Neural Machine Translation by Jointly Learning to Align and Translate. In 3rd International Conference on Learning Representations, ICLR 2015 .
    [5]
    Paul N. Bennett, Filip Radlinski, Ryen W. White, and Emine Yilmaz. 2011. Inferring and using location metadata to personalize web search. In Proceeding of the 34th International ACM SIGIR 2011. 135--144.
    [6]
    Paul N. Bennett, Ryen W. White, Wei Chu, Susan T. Dumais, Peter Bailey, Fedor Borisyuk, and Xiaoyuan Cui. 2012 Modeling the impact of short- and long-term behavior on search personalization. In ACM SIGIR '12. 185--194.
    [7]
    David M. Blei, Andrew Y. Ng, and Michael I. Jordan. 2001. Latent Dirichlet Allocation. In Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, NIPS 2001. 601--608.
    [8]
    Christopher J. C. Burges, Tal Shaked, Erin Renshaw, Ari Lazier, Matt Deeds, Nicole Hamilton, and Gregory N. Hullender. 2005. Learning to rank using gradient descent. In (ICML 2005). 89--96.
    [9]
    Chris J. C. Burges, Krysta M. Svore, Qiang Wu, and Jianfeng Gao. 2008. Ranking, Boosting, and Model Adaptation. Technical Report MSR-TR-2008--109. 18 pages.
    [10]
    Fei Cai, Shangsong Liang, and Maarten de Rijke. 2014. Personalized document re-ranking based on Bayesian probabilistic matrix factorization. In The 37th International ACM SIGIR '14. 835--838.
    [11]
    Mark James Carman, Fabio Crestani, Morgan Harvey, and Mark Baillie. 2010. Towards query log based personalization using topic models. In Proceedings of the 19th ACM CIKM 2010. 1849--1852.
    [12]
    Kyunghyun Cho, Bart van Merrienboer, cC aglar Gü lcc ehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. In Proceedings of EMNLP 2014. 1724--1734.
    [13]
    Kevyn Collins-Thompson, Paul N. Bennett, Ryen W. White, Sebastian de la Chica, and David Sontag. 2011. Personalizing web search results by reading level. In Proceedings of the 20th ACM CIKM 2011. 403--412.
    [14]
    Nick Craswell, Onno Zoeter, Michael J. Taylor, and Bill Ramsey. 2008. An experimental comparison of click position-bias models. In Proceedings of the WSDM 2008. 87--94.
    [15]
    Zhuyun Dai, Chenyan Xiong, Jamie Callan, and Zhiyuan Liu. 2018. Convolutional Neural Networks for Soft-Matching N-Grams in Ad-hoc Search. In Proceedings of WSDM 2018. 126--134.
    [16]
    Zhicheng Dou, Ruihua Song, and Ji-Rong Wen. 2007. A large-scale evaluation and analysis of personalized search strategies. In Proceedings of the 16th International Conference on World Wide Web, WWW 2007.
    [17]
    Songwei Ge, Zhicheng Dou, Zhengbao Jiang, Jian-Yun Nie, and Ji-Rong Wen. 2018. Personalizing Search Results Using Hierarchical RNN with Query-aware Attention. In Proceedings of the CIKM 2018.
    [18]
    Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron C. Courville, and Yoshua Bengio. 2014. Generative Adversarial Networks. CoRR, Vol. abs/1406.2661 (2014).
    [19]
    Morgan Harvey, Fabio Crestani, and Mark James Carman. 2013. Building user profiles from topic models for personalised search. In 22nd ACM CIKM'13. 2309--2314.
    [20]
    Guangneng Hu. 2019. Personalized Neural Embeddings for Collaborative Filtering with Text. In Proceedings of the NAACL-HLT 2019. 2082--2088.
    [21]
    Jyun-Yu Jiang and Wei Wang. 2018. RIN: Reformulation Inference Network for Context-Aware Query Suggestion. In Proceedings of the CIKM 2018. 197--206.
    [22]
    Thorsten Joachims, Laura A. Granka, Bing Pan, Helene Hembrooke, and Geri Gay. 2005. Accurately interpreting clickthrough data as implicit feedback. In SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR. 154--161.
    [23]
    Cheng Li, Mingyang Zhang, Michael Bendersky, Hongbo Deng, Donald Metzler, and Marc Najork. 2019. Multi-view Embedding-based Synonyms for Email Search. In Proceedings of SIGIR 2019. 575--584.
    [24]
    Shuqi Lu, Zhicheng Dou, Xu Jun, Jian-Yun Nie, and Ji-Rong Wen. 2019. PSGAN: A Minimax Game for Personalized Search with Limited and Noisy Click Data. In Proceedings of SIGIR 2019. 555--564.
    [25]
    Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient Estimation of Word Representations in Vector Space. In ICLR 2013, Proceedings.
    [26]
    Daisuke Oba, Naoki Yoshinaga, Shoetsu Sato, Satoshi Akasaki, and Masashi Toyoda. 2019. Modeling Personal Biases in Language Use by Inducing Personalized Word Embeddings. In Proceedings of the NAACL-HLT 2019. 2102--2108.
    [27]
    Stephen E. Robertson and Hugo Zaragoza. 2009. The Probabilistic Relevance Framework: BM25 and Beyond. Foundations and Trends in Information Retrieval, Vol. 3, 4 (2009), 333--389.
    [28]
    Sameendra Samarawickrama, Shanika Karunasekera, Aaron Harwood, and Ramamohanarao Kotagiri. 2017. Search Result Personalization in Twitter Using Neural Word Embeddings. In Big Data Analytics and Knowledge Discovery - 19th International Conference, DaWaK 2017. 244--258.
    [29]
    Ahu Sieg, Bamshad Mobasher, and Robin D. Burke. 2007. Web search personalization with ontological user profiles. In Proceedings of the CIKM 2007.
    [30]
    Yang Song, Hongning Wang, and Xiaodong He. 2014. Adapting deep RankNet for personalized search. In WSDM 2014. 83--92.
    [31]
    Alessandro Sordoni, Yoshua Bengio, Hossein Vahabi, Christina Lioma, Jakob Grue Simonsen, and Jian-Yun Nie. 2015. A Hierarchical Recurrent Encoder-Decoder for Generative Context-Aware Query Suggestion. In Proceedings of CIKM 2015. 553--562.
    [32]
    Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to Sequence Learning with Neural Networks. In Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, December 8-13 2014, Montreal, Quebec, Canada. 3104--3112.
    [33]
    Jaime Teevan, Susan T. Dumais, and Daniel J. Liebling. 2008. To personalize or not to personalize: modeling queries with variation in user intent. In Proceedings of SIGIR 2008, Singapore, July 20-24, 2008. 163--170.
    [34]
    Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is All you Need. In Advances in Neural Information Processing Systems 30.
    [35]
    Maksims Volkovs. 2015. Context Models For Web Search Personalization. CoRR, Vol. abs/1502.00527 (2015).
    [36]
    Thanh Vu, Dat Quoc Nguyen, Mark Johnson, Dawei Song, and Alistair Willis. 2017. Search Personalization with Embeddings. In Advances in Information Retrieval - 39th European Conference on IR Research, ECIR 2017.
    [37]
    Thanh Tien Vu, Alistair Willis, Son Ngoc Tran, and Dawei Song. 2015. Temporal Latent Topic User Profiles for Search Personalisation. In Advances in Information Retrieval - 37th European Conference on IR Research, ECIR 2015. 605--616.
    [38]
    Hongning Wang, Xiaodong He, Ming-Wei Chang, Yang Song, Ryen W. White, and Wei Chu. 2013. Personalized ranking model adaptation for web search. In The 36th International ACM SIGIR conference on research and development in Information Retrieval, SIGIR '13, 2013.
    [39]
    Jun Wang, Arjen P. de Vries, and Marcel J. T. Reinders. 2006. A User-Item Relevance Model for Log-Based Collaborative Filtering. In Advances in Information Retrieval, 28th European Conference on IR Research, ECIR 2006. 37--48.
    [40]
    Ryen W. White, Wei Chu, Ahmed Hassan Awadallah, Xiaodong He, Yang Song, and Hongning Wang. 2013. Enhancing personalized search by mining and modeling task behavior. In 22nd International World Wide Web Conference, WWW '13. 1411--1420.
    [41]
    Chenyan Xiong, Zhuyun Dai, Jamie Callan, Zhiyuan Liu, and Russell Power. 2017. End-to-End Neural Ad-hoc Ranking with Kernel Pooling. In Proceedings of SIGIR 2017. 55--64.

    Cited By

    View all
    • (2024)Intent-Oriented Dynamic Interest Modeling for Personalized Web SearchACM Transactions on Information Systems10.1145/363981742:4(1-30)Online publication date: 8-Jan-2024
    • (2024)Integrated Personalized and Diversified Search Based on Search LogsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.329100636:2(694-707)Online publication date: 1-Feb-2024
    • (2024)Maximizing Social Influence With Minimum Information AlterationIEEE Transactions on Emerging Topics in Computing10.1109/TETC.2023.329238412:2(419-431)Online publication date: Apr-2024
    • Show More Cited By

    Index Terms

    1. Employing Personal Word Embeddings for Personalized Search

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      SIGIR '20: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval
      July 2020
      2548 pages
      ISBN:9781450380164
      DOI:10.1145/3397271
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 25 July 2020

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. personal word embedding
      2. personalized search
      3. user interest

      Qualifiers

      • Research-article

      Funding Sources

      • National Natural Science Foundation of China
      • Beijing Outstanding Young Scientist Program

      Conference

      SIGIR '20
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 792 of 3,983 submissions, 20%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)81
      • Downloads (Last 6 weeks)9

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Intent-Oriented Dynamic Interest Modeling for Personalized Web SearchACM Transactions on Information Systems10.1145/363981742:4(1-30)Online publication date: 8-Jan-2024
      • (2024)Integrated Personalized and Diversified Search Based on Search LogsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.329100636:2(694-707)Online publication date: 1-Feb-2024
      • (2024)Maximizing Social Influence With Minimum Information AlterationIEEE Transactions on Emerging Topics in Computing10.1109/TETC.2023.329238412:2(419-431)Online publication date: Apr-2024
      • (2024)How to personalize and whether to personalize? Candidate documents decideKnowledge and Information Systems10.1007/s10115-024-02138-yOnline publication date: 27-May-2024
      • (2024)On-Device Query Auto-completion for Email SearchAdvances in Information Retrieval10.1007/978-3-031-56027-9_18(295-309)Online publication date: 20-Mar-2024
      • (2023)Personalized and Diversified: Ranking Search Results in an Integrated WayACM Transactions on Information Systems10.1145/363198942:3(1-25)Online publication date: 9-Nov-2023
      • (2023)CDSM: Cascaded Deep Semantic Matching on Textual Graphs Leveraging Ad-hoc Neighbor SelectionACM Transactions on Intelligent Systems and Technology10.1145/357320414:2(1-24)Online publication date: 16-Feb-2023
      • (2023)Incorporating Explicit Subtopics in Personalized SearchProceedings of the ACM Web Conference 202310.1145/3543507.3583488(3364-3374)Online publication date: 30-Apr-2023
      • (2023)Personalized Dynamic Attention Multi-task Learning model for document retrieval and query generationExpert Systems with Applications10.1016/j.eswa.2022.119026213(119026)Online publication date: Mar-2023
      • (2022)A GNN-based Multi-task Learning Framework for Personalized Video SearchProceedings of the Fifteenth ACM International Conference on Web Search and Data Mining10.1145/3488560.3498507(1386-1394)Online publication date: 11-Feb-2022
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media