Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3121050.3121069acmconferencesArticle/Chapter ViewAbstractPublication PagesictirConference Proceedingsconference-collections
research-article
Public Access

Investigating per Topic Upper Bound for Session Search Evaluation

Published: 01 October 2017 Publication History

Abstract

Session search is a complex Information Retrieval (IR) task. As a result, its evaluation is also complex. A great number of factors need to be considered in the evaluation of session search. They include document relevance, document novelty, aspect-related novelty discounting, and user's efforts in examining the documents. Due to increased complexity, most existing session search evaluation metrics are NP-hard. Consequently, the optimal value, i.e. the upper bound, of a metric highly varies with the actual search topics. In Cranfield-like settings such as the Text REtrieval Conference (TREC), scores for systems are usually averaged across all search topics. With undetermined upper bound values, however, it could be unfair to compare IR systems across different topics. This paper addresses the problem by investigating the actual per topic upper bounds of existing session search metrics. Through decomposing the metrics, we derive the upper bounds via mathematical optimization. We show that after being normalized by the bounds, the NP-hard session search metrics are then able to provide robust comparison across various search topics. The new normalized metrics are experimented on official runs submitted to the TREC 2016 Dynamic Domain (DD) Track.

References

[1]
Lars Backstrom and Jure Leskovec 2011. Supervised random walks: predicting and recommending links in social networks Proceedings of the fourth ACM international conference on Web search and data mining. ACM, 635--644.
[2]
Shumeet Baluja, Rohan Seth, D Sivakumar, Yushi Jing, Jay Yagnik, Shankar Kumar, Deepak Ravichandran, and Mohamed Aly. 2008. Video suggestion and discovery for youtube: taking random walks through the view graph Proceedings of the 17th international conference on World Wide Web. ACM, 895--904.
[3]
Roi Blanco and Christina Lioma 2012. Graph-based term weighting for information retrieval. Information retrieval Vol. 15, 1 (2012), 54--92.
[4]
Jiajun Bu, Shulong Tan, Chun Chen, Can Wang, Hao Wu, Lijun Zhang, and Xiaofei He. 2010. Music recommendation by unified hypergraph: combining social media information and music content. In Proceedings of the international conference on Multimedia. ACM, 391--400.
[5]
Pablo Castells, Miriam Fernández, David Vallet, Phivos Mylonas, and Yannis Avrithis. 2005. Self-tuning personalized information retrieval in an ontology-based framework On the Move to Meaningful Internet Systems 2005: OTM 2005 Workshops. Springer, 977--986.
[6]
Kevyn Collins-Thompson and Jamie Callan 2005. Query expansion using random walk models. In Proceedings of the 14th ACM international conference on Information and knowledge management. ACM, 704--711.
[7]
Nick Craswell and Martin Szummer 2007. Random walks on the click graph. In Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 239--246.
[8]
Gideon Dror, Yahoo Labs, Noam Koenigstein, Yehuda Koren, and Markus Weimer 2012. The Yahoo! music dataset and KDDCup'11. In JMLR Workshop and Conference Proceedings: Proceedings of KDD Cup 2011 Competition. 3--18.
[9]
Carsten Eickhoff, Jaime Teevan, Ryen White, and Susan Dumais. 2014. Lessons from the journey: A query log analysis of within-session learning Proceedings of the 7th ACM international conference on Web search and data mining. ACM, 223--232.
[10]
Yasuhiro Fujiwara, Makoto Nakatsuji, Hiroaki Shiokawa, Takeshi Mishima, and Makoto Onizuka. 2013. Efficient ad-hoc search for personalized pagerank. Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data. ACM, 445--456.
[11]
Susan Gauch, Jason Chaffee, and Alexander Pretschner. 2003. Ontology-based personalized search and browsing. Web Intelligence and Agent Systems Vol. 1, 3--4 (2003), 219--234.
[12]
Chun Guo and Xiaozhong Liu 2015. Automatic Feature Generation on Heterogeneous Graph for Music Recommendation Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 807--810.
[13]
Aniko Hannak, Piotr Sapiezynski, Arash Molavi Kakhki, Balachander Krishnamurthy, David Lazer, Alan Mislove, and Christo Wilson. 2013. Measuring personalization of web search. In Proceedings of the 22nd international conference on World Wide Web. International World Wide Web Conferences Steering Committee, 527--538.
[14]
Yifan Hu, Y. Koren, and C. Volinsky 2008. Collaborative Filtering for Implicit Feedback Datasets Eighth IEEE International Conference on Data Mining. 263--272.

Cited By

View all
  • (2021)How Am I Doing?: Evaluating Conversational Search Systems OfflineACM Transactions on Information Systems10.1145/345116039:4(1-22)Online publication date: 17-Aug-2021
  • (2020)Cascade or RecencyProceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3397271.3401163(389-398)Online publication date: 25-Jul-2020
  • (2019)Jointly Modeling Relevance and Sensitivity for Search Among Sensitive ContentProceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3331184.3331256(615-624)Online publication date: 18-Jul-2019
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ICTIR '17: Proceedings of the ACM SIGIR International Conference on Theory of Information Retrieval
October 2017
348 pages
ISBN:9781450344906
DOI:10.1145/3121050
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 October 2017

Permissions

Request permissions for this article.

Check for updates

Badges

  • Best Student Paper

Author Tags

  1. evaluation
  2. normalization
  3. session search

Qualifiers

  • Research-article

Funding Sources

Conference

ICTIR '17
Sponsor:

Acceptance Rates

ICTIR '17 Paper Acceptance Rate 27 of 54 submissions, 50%;
Overall Acceptance Rate 235 of 527 submissions, 45%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)47
  • Downloads (Last 6 weeks)10
Reflects downloads up to 26 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2021)How Am I Doing?: Evaluating Conversational Search Systems OfflineACM Transactions on Information Systems10.1145/345116039:4(1-22)Online publication date: 17-Aug-2021
  • (2020)Cascade or RecencyProceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3397271.3401163(389-398)Online publication date: 25-Jul-2020
  • (2019)Jointly Modeling Relevance and Sensitivity for Search Among Sensitive ContentProceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3331184.3331256(615-624)Online publication date: 18-Jul-2019
  • (2019)Memory-Augmented Dialogue Management for Task-Oriented Dialogue SystemsACM Transactions on Information Systems10.1145/331761237:3(1-30)Online publication date: 8-Jul-2019
  • (2019)A Markovian Approach to Evaluate Session-Based IR SystemsAdvances in Information Retrieval10.1007/978-3-030-15712-8_40(621-635)Online publication date: 14-Apr-2019
  • (2019)Meta-evaluation of Dynamic Search: How Do Metrics Capture Topical Relevance, Diversity and User Effort?Advances in Information Retrieval10.1007/978-3-030-15712-8_39(607-620)Online publication date: 7-Apr-2019
  • (2018)Report on the 2017 ACM SIGIR International Conference Theory of Information Retrieval (ICTIR?17)ACM SIGIR Forum10.1145/3190580.319059151:3(78-87)Online publication date: 22-Feb-2018

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media