Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2983323.2983805acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Agents, Simulated Users and Humans: An Analysis of Performance and Behaviour

Published: 24 October 2016 Publication History

Abstract

Most of the current models that are used to simulate users in Interactive Information Retrieval (IIR) lack realism and agency. Such models generally make decisions in a stochastic manner, without recourse to the actual information encountered or the underlying information need. In this paper, we develop a more sophisticated model of the user that includes their cognitive state within the simulation. The cognitive state maintains data about what the simulated user knows, has done and has seen, along with representations of what it considers attractive and relevant. Decisions to inspect or judge are then made based upon the simulated user's current state, rather than stochastically. In the context of ad-hoc topic retrieval, we evaluate the quality of the simulated users and agents by comparing their behaviour and performance against 48 human subjects under the same conditions, topics, time constraints, costs and search engine. Our findings show that while naive configurations of simulated users and agents substantially outperform our human subjects, their search behaviour is notably different from actual searchers. However, more sophisticated search agents can be tuned to act more like actual searchers providing greater realism. This innovation advances the state of the art in simulation, from simulated users towards autonomous agents. It provides a much needed step forward enabling the creation of more realistic simulations, while also motivating the development of more advanced cognitive agents and tools to help support and augment human searchers. Future work will focus not only on the pragmatics of tuning and training such agents for topic retrieval, but will also look at developing agents for other tasks and contexts such as collaborative search and slow search.

References

[1]
G. Amati and C.J. Van Rijsbergen. Probabilistic models of information retrieval based on measuring the divergence from randomness. ACM TOIS, 20 (4): 357--389, 2002.
[2]
R. Armstrong, D. Freitag, T. Joachims, and T. Mitchell. Webwatcher: A learning apprentice for the world wide web. In Proc. AAAI Spring Symp. on Info. Gathering from Heterogeneous, Distributed Environments, pages 6--12, 1995.
[3]
L. Azzopardi. Query side evaluation: An empirical analysis of effectiveness and effort. In Proceedings of the 32nd ACM SIGIR, pages 556--563, 2009.
[4]
L. Azzopardi. The economics in interactive information retrieval. In Proc. 34th ACM SIGIR, pages 15--24, 2011.
[5]
L. Azzopardi, M. de Rijke, and K. Balog. Building simulated queries for known-item topics: An analysis using six european languages. In Proc. 30th ACM SIGIR, pages 455--462, 2007.
[6]
L. Azzopardi, K. Järvelin, J. Kamps, and M.D. Smucker. Report on the sigir 2010 workshop on the simulation of interaction. SIGIR Forum, 44 (2): 35--47, 2011.
[7]
K. Balog. Task-completion engines: A vision with a plan. In Proc. 1st SCST, 2015.
[8]
F. Baskaya, H. Keskustalo, and K. Järvelin. Time drives interaction: Simulating sessions in diverse searching environments. In Proc. 35th ACM SIGIR, pages 105--114, 2012.
[9]
F. Baskaya, H. Keskustalo, and K. Järvelin. Modeling behavioral factors in interactive information retrieval. In Proc. 22nd ACM CIKM, pages 2297--2302, 2013.
[10]
M.J. Bates. The design of browsing and berrypicking techniques for the online search interface. Online Information Review, 13 (5): 407--424, 1989.
[11]
N.J. Belkin. Anomalous states of knowledge as a basis for IR. Canadian J. of Info. Sci., (5): 133--143, 1980.
[12]
N.J. Belkin. The cognitive viewpoint in information science. J. Inf. Sci., 16 (1): 11--15, 1990.
[13]
K.D. Bollacker, S. Lawrence, and C.L. Giles. Citeseer: An autonomous web agent for automatic retrieval and identification of interesting publications. In Proc. 2nd AGENTS, pages 116--123, 1998.
[14]
P. Borlund. The iir evaluation model: a framework for evaluation of iir systems. Info. research, 8 (3), 2003.
[15]
G. Buchanan and F. Loizides. Investigating document triage on paper and electronic media. In Proc. 11th ECDL, pages 416--427, 2007.
[16]
B. Carterette, E. Kanoulas, and E. Yilmaz. Simulating simple user behavior for system effectiveness evaluation. In Proc. 20th ACM CIKM, pages 611--620, 2011.
[17]
B. Carterette, A. Bah, and M. Zengin. Dynamic test collections for retrieval evaluation. In Proc. 5th ACM ICTIR, pages 91--100, 2015.
[18]
A. Chuklin, I. Markov, and M. de Rijke. Click Models for Web Search. SLoICRS. 2015.
[19]
C.L.A. Clarke, L. Freund, M.D. Smucker, and E. Yilmaz. Report on the sigir 2013 mube workshop. SIGIR Forum, 47 (2): 84--95, 2013.
[20]
W.S. Cooper. On selecting a measure of retrieval effectiveness part ii. implementation of the philosophy. J. of the American Soc. for Info. Sci., 24 (6): 413--424, 1973.
[21]
D. D'Aloisi, V. Giannini, and F.U. Bordoni. The info agent: an interface for supporting users in intelligent retrieval. In In Proc. ERCIM Workshop, pages 143--155, 1995.
[22]
P. Edwards, C.L. Green, P.C. Lockier, and T.C. Lukins. Exploiting learning technologies for world wide web agents. In IEEE Coll. on Intelligent WWW Agents, pages 3--1, 1997.
[23]
C. Eickhoff, S. Dungs, and V. Tran. An eye-tracking study of query reformulation. In Proc. 38th ACM SIGIR, pages 13--22, 2015.
[24]
O. Etzioni and D. Weld. A softbot-based interface to the internet. Commun. ACM, 37 (7): 72--76, 1994.
[25]
W-T Fu and P. Pirolli. Snif-act: A cognitive model of user navigation on the world wide web. Human Computer Interaction, 22 (4): 355--412, 2007.
[26]
F. Guo, C. Liu, A. Kannan, T. Minka, M. Taylor, Y. Wang, and C. Faloutsos. Click chain model in web search. In Proc. 18th WWW, pages 11--20, 2009.
[27]
D. Harman. Relevance feedback revisited. In Proc. 15th ACM SIGIR, pages 1--10, New York, NY, USA, 1992. ACM.
[28]
P. Ingwersen. Information Retrieval Interaction. 1992.
[29]
P. Ingwersen and K. Järvelin. The Turn: Integration of Information Seeking and Retrieval in Context. 2005.
[30]
B.J. Jansen and S.Y. Rieh. The seventeen theoretical constructs of information searching and information retrieval. JASIST, 61 (8): 1517--1534, 2010.
[31]
K. Järvelin. Interactive relevance feedback with graded relevance and sentence extraction: Simulated user experiments. In Proc. 18th ACM CIKM, pages 2053--2056, 2009.
[32]
K. Järvelin and J. Kekäläinen. Cumulative gain-based evaluation of IR techniques. ACM TOIS, 20 (4): 422--446, 2002.
[33]
T. Joachims. Optimizing search engines using clickthrough data. In Proc. 8th ACM SIGKDD, pages 133--142, 2002.
[34]
C. Jordan, C. Watters, and Q. Gao. Using controlled query generation to evaluate blind relevance feedback algorithms. In Proc. 6th ACM/IEEE-CS JCDL, pages 286--295, 2006.
[35]
H. Keskustalo, K. Järvelin, and A. Pirkola. The effects of relevance feedback quality and quantity in interactive relevance feedback: A simulation based on user modeling. In Advances in IR, volume 3936 of LNCS, pages 191--204. 2006.
[36]
H. Keskustalo, K. Järvelin, and A. Pirkola. Evaluating the effectiveness of relevance feedback based on a user simulation model: Effects of a user scenario on cumulated gain value. Information Retrieval, 11 (3): 209--228, 2008.
[37]
H. Keskustalo, K. Järvelin, A. Pirkola, T. Sharma, and M. Lykke. Test collection-based ir evaluation needs extension toward sessions -- a case of extremely short queries. In Proc. 5th AIRS, pages 63--74, 2009.
[38]
D.H. Kraft and T. Lee. Stopping rules and their effect on expected search length. IPM, 15 (1): 47--58, 1979.
[39]
K. Lang. Newsweeder: Learning to filter netnews. In Proc. 12th Intl. Machine Learning Conference, 1995.
[40]
Y. Lashkari. The webhound personalized document filtering system. Technical report, 1995.
[41]
A. Leuski. Relevance and reinforcement in interactive browsing. In Proc. 9th ACM CIKM, pages 119--126, 2000.
[42]
H. Lieberman. Letizia: An agent that assists web browsing. In Proc. 14th IJCAI, pages 924--929, 1995.
[43]
H. Lieberman. Autonomous interface agents. In Proc. 15th SIGCHI, pages 67--74, 1997.
[44]
J. Luo, S. Zhang, and H. Yang. Win-win search: Dual-agent stochastic game in session search. In Proc. 37th ACM SIGIR, pages 587--596, 2014.
[45]
J. Luo, S. Zhang, X. Dong, and H Yang. Proc. 37th ECIR, chapter Designing States, Actions, and Rewards for Using POMDP in Session Search, pages 526--537. 2015.
[46]
G. Marchionini. Information-seeking strategies of novices using a full-text electronic encyclopedia. J. Am. Soc. Inf. Sci., 40 (1): 54--66, 1989.
[47]
D. Maxwell and L. Azzopardi. Stuck in traffic: How temporal delays affect search behaviour. In Proc. 5th IIiX, pages 155--164, 2014.
[48]
D. Maxwell and L. Azzopardi. Simulating interactive information retrieval. In Proc. 39th ACM SIGIR, pages 1141--1144, 2016.
[49]
D. Maxwell, L. Azzopardi, K. Järvelin, and H. Keskustalo. An initial investigation into fixed and adaptive stopping strategies. In Proc. 38th ACM SIGIR, pages 903--906, 2015.
[50]
D. Maxwell, L. Azzopardi, K. Järvelin, and H. Keskustalo. Searching and stopping: An analysis of stopping rules and strategies. In Proc. 24th ACM CIKM, pages 313--322, 2015.
[51]
E. Meij, W. Weerkamp, and M. de Rijke. A query model based on normalized log-likelihood. In Proc. 18th ACM CIKM, pages 1903--1906, 2009.
[52]
T. Mikolov, K. Chen, G. Corrado, and J. Dean. Efficient estimation of word representations in vector space. CoRR, 2013.
[53]
A. Moffat, P. Thomas, and F. Scholer. Users versus models: What observation tells us about effectiveness metrics. In Proc. 22nd ACM CIKM, pages 659--668, 2013.
[54]
T. Pääkkönen, K. Järvelin, J. Kekäläinen, H. Keskustalo, F. Baskaya, D. Maxwell, and L. Azzopardi. Exploring behavioral dimensions in session effectiveness. In Proc. 6th CLEF, pages 178--189, 2015.
[55]
B.J. Rhodes and T. Starner. Remembrance agent: A continuously running automated information retrieval system. In Proc. 1st PAAM, pages 487--495, 1996.
[56]
I. Ruthven. Re-examining the potential effectiveness of interactive query expansion. In Proc. 26th ACM SIGIR, pages 213--220, 2003.
[57]
M.D. Smucker. An analysis of user strategies for examining and processing ranked lists of documents. In Proc. of 5th HCIR, 2011.
[58]
J. Tague, M. Nelson, and H. Wu. Problems in the simulation of bibliographic retrieval systems. In Proc. 3rd ACM SIGIR, pages 236--255, 1980.
[59]
P. Thomas, A. Moffat, P. Bailey, and F. Scholer. Modeling decision points in user search behavior. In Proc. 5th IIiX, pages 239--242, 2014.
[60]
S. Verberne, M. Sappelli, K. Järvelin, and W. Kraaij. User simulations for interactive search: Evaluating personalized query suggestion. In Advances in Information Retrieval, volume 9022 of LNCS. 2015.
[61]
E. Voorhees and D. Harman. TREC: Experiment and Evaluation in Information Retrieval. The MIT press, 2005.
[62]
R.W. White, J.M. Jose, C.J. van Rijsbergen, and I. Ruthven. A simulated study of implicit feedback models. In Advances in Information Retrieval, volume 2997 of LNCS, pages 311--326. 2004.
[63]
R.W. White, S.T. Dumais, and J. Teevan. Characterizing the influence of domain expertise on web search behavior. In Proc 2nd ACM WSDM, pages 132--141, 2009.
[64]
W. Wu, D. Kelly, and A. Sud. Using information scent and need for cognition to understand online search behavior. In Proc 37th ACM SIGIR, pages 557--566, 2014.
[65]
G. Zuccon, B. Koopman, P. Bruza, and L. Azzopardi. Integrating and evaluating neural word embeddings in information retrieval. In Proc. 20th ADCS, pages 12:1--12:8, 2015.

Cited By

View all
  • (2024)Towards Detecting and Mitigating Cognitive Bias in Spoken Conversational SearchAdjunct Proceedings of the 26th International Conference on Mobile Human-Computer Interaction10.1145/3640471.3680245(1-10)Online publication date: 21-Sep-2024
  • (2024)Evaluating Generative Ad Hoc Information RetrievalProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657849(1916-1929)Online publication date: 10-Jul-2024
  • (2024)Tutorial on User Simulation for Evaluating Information Access Systems on the WebCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3641243(1254-1257)Online publication date: 13-May-2024
  • Show More Cited By

Index Terms

  1. Agents, Simulated Users and Humans: An Analysis of Performance and Behaviour

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management
    October 2016
    2566 pages
    ISBN:9781450340731
    DOI:10.1145/2983323
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 24 October 2016

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. agents
    2. autonomous agents
    3. continuation strategies
    4. interactive information retrieval
    5. querying strategies
    6. simulation
    7. stopping strategies
    8. user modeling

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    CIKM'16
    Sponsor:
    CIKM'16: ACM Conference on Information and Knowledge Management
    October 24 - 28, 2016
    Indiana, Indianapolis, USA

    Acceptance Rates

    CIKM '16 Paper Acceptance Rate 160 of 701 submissions, 23%;
    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)34
    • Downloads (Last 6 weeks)7
    Reflects downloads up to 03 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Towards Detecting and Mitigating Cognitive Bias in Spoken Conversational SearchAdjunct Proceedings of the 26th International Conference on Mobile Human-Computer Interaction10.1145/3640471.3680245(1-10)Online publication date: 21-Sep-2024
    • (2024)Evaluating Generative Ad Hoc Information RetrievalProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657849(1916-1929)Online publication date: 10-Jul-2024
    • (2024)Tutorial on User Simulation for Evaluating Information Access Systems on the WebCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3641243(1254-1257)Online publication date: 13-May-2024
    • (2023)User Simulation for Evaluating Information Access SystemsProceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region10.1145/3624918.3629549(302-305)Online publication date: 26-Nov-2023
    • (2023)Validating Synthetic Usage Data in Living Lab EnvironmentsJournal of Data and Information Quality10.1145/3623640Online publication date: 24-Sep-2023
    • (2023)Metaphorical User Simulators for Evaluating Task-oriented Dialogue SystemsACM Transactions on Information Systems10.1145/359651042:1(1-29)Online publication date: 22-May-2023
    • (2023)Tutorial on User Simulation for Evaluating Information Access SystemsProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3615296(5200-5203)Online publication date: 21-Oct-2023
    • (2023)Simulating Users in Interactive Web Table RetrievalProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3615187(3875-3879)Online publication date: 21-Oct-2023
    • (2023)A Synthetic Search Session Generator for Task-Aware Information Seeking and RetrievalProceedings of the Sixteenth ACM International Conference on Web Search and Data Mining10.1145/3539597.3573041(1156-1159)Online publication date: 27-Feb-2023
    • (2023)User Behavior Simulation for Search Result Re-rankingACM Transactions on Information Systems10.1145/351146941:1(1-35)Online publication date: 20-Jan-2023
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media