Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1835449.1835538acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

A user behavior model for average precision and its generalization to graded judgments

Published: 19 July 2010 Publication History

Abstract

We explore a set of hypothesis on user behavior that are potentially at the origin of the (Mean) Average Precision (AP) metric. This allows us to propose a more realistic version of AP where users click non-deterministically on relevant documents and where the number of relevant documents in the collection needs not be known in advance. We then depart from the assumption that a document is either relevant or irrelevant and we use instead relevance judgment similar to editorial labels used for Discounted Cumulated Gain (DCG). We assume that clicked documents provide users with a certain level of "utility" and that a user ends a search when she gathered enough utility. Based on the query logs of a commercial search engine we show how to evaluate the utility associated with a label from the record of past user interactions with the search engine and we show how the two different user models can be evaluated based on their ability to predict accurately future clicks. Finally, based on these user models, we propose a measure that captures the relative quality of two rankings.

References

[1]
G. Dupret. User models to compare and evaluate web IR metrics. In Proceedings of SIGIR 2009 Workshop on The Future of IR Evaluation, 2009.
[2]
G. Dupret and C. Liao. Estimating intrinsic document relevance from clicks. In Proceedings of the 3rd WSDM conference, 2010.
[3]
D. Kelly. Methods for Evaluating Interactive Information Retrieval Systems with Users, volume 3 of Foundations and Trends in Information Retrieval. 2009.
[4]
A. Moffat and J. Zobel. Rank-biased precision for measurement of retrieval effectiveness. ACM Trans. Inf. Syst., 27(1):1--27, 2008.
[5]
S. Robertson. A new interpretation of average precision. In Proceedings of SIGIR'08, pages 689--690, New York, NY, USA, 2008. ACM.
[6]
E. M. Voorhees and D. Harman, editors. TREC: Experiment and Evaluation in Information Retrieval. MIT press, 2005.
[7]
K. Wang, T. Walker, and Z. Zheng. Pskip: estimating relevance ranking quality from web search clickthrough data. In Proceedings of the 15th ACM SIGKDD, pages 1355--1364, New York, NY, USA, 2009. ACM. More complete references for this work can be found in {1}.

Cited By

View all
  • (2024)Semantic aware representation learning for optimizing image retrieval systems in radiologyPattern Recognition10.1016/j.patcog.2024.111060(111060)Online publication date: Oct-2024
  • (2024)User-oriented metrics for search engine deterministic sort ordersInformation Processing & Management10.1016/j.ipm.2023.10354761:1(103547)Online publication date: Jan-2024
  • (2024)How much freedom does an effectiveness metric really have?Journal of the Association for Information Science and Technology10.1002/asi.24874Online publication date: 15-Feb-2024
  • Show More Cited By

Index Terms

  1. A user behavior model for average precision and its generalization to graded judgments

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      SIGIR '10: Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
      July 2010
      944 pages
      ISBN:9781450301534
      DOI:10.1145/1835449
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 19 July 2010

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. metrics
      2. search engines
      3. statistical model
      4. user behavior
      5. user model

      Qualifiers

      • Research-article

      Conference

      SIGIR '10
      Sponsor:

      Acceptance Rates

      SIGIR '10 Paper Acceptance Rate 87 of 520 submissions, 17%;
      Overall Acceptance Rate 792 of 3,983 submissions, 20%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)10
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 23 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Semantic aware representation learning for optimizing image retrieval systems in radiologyPattern Recognition10.1016/j.patcog.2024.111060(111060)Online publication date: Oct-2024
      • (2024)User-oriented metrics for search engine deterministic sort ordersInformation Processing & Management10.1016/j.ipm.2023.10354761:1(103547)Online publication date: Jan-2024
      • (2024)How much freedom does an effectiveness metric really have?Journal of the Association for Information Science and Technology10.1002/asi.24874Online publication date: 15-Feb-2024
      • (2022)A Flexible Framework for Offline Effectiveness MetricsProceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3477495.3531924(578-587)Online publication date: 6-Jul-2022
      • (2022)Batch Evaluation Metrics in Information Retrieval: Measures, Scales, and MeaningIEEE Access10.1109/ACCESS.2022.321166810(105564-105577)Online publication date: 2022
      • (2022)Hierarchical Average Precision Training for Pertinent Image RetrievalComputer Vision – ECCV 202210.1007/978-3-031-19781-9_15(250-266)Online publication date: 23-Oct-2022
      • (2020)A Prototype System of Search: Finding Short Material for Science Education in Long and High-Definition Documentary VideosArtificial Intelligence Supported Educational Technologies10.1007/978-3-030-41099-5_7(115-132)Online publication date: 30-Apr-2020
      • (2013)Improving the Compression Efficiency for News Web Service Using Semantic Relations Among WebpagesInternational Journal of Cognitive Informatics and Natural Intelligence10.4018/ijcini.20130401047:2(49-64)Online publication date: 1-Apr-2013
      • (2013)Report on the SIGIR 2013 workshop on modeling user behavior for information retrieval evaluation (MUBE 2013)ACM SIGIR Forum10.1145/2568388.256840347:2(84-95)Online publication date: 21-Jan-2013
      • (2013)Users versus modelsProceedings of the 22nd ACM international conference on Information & Knowledge Management10.1145/2505515.2507665(659-668)Online publication date: 27-Oct-2013
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media