Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–23 of 23 results for author: White, R W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.12923  [pdf, other

    cs.IR cs.AI cs.HC

    Panmodal Information Interaction

    Authors: Chirag Shah, Ryen W. White

    Abstract: The emergence of generative artificial intelligence (GenAI) is transforming information interaction. For decades, search engines such as Google and Bing have been the primary means of locating relevant information for the general population. They have provided search results in the same standard format (the so-called "10 blue links"). The recent ability to chat via natural language with AI-based a… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  2. arXiv:2404.04268  [pdf

    cs.IR cs.AI cs.CY cs.SI

    The Use of Generative Search Engines for Knowledge Work and Complex Tasks

    Authors: Siddharth Suri, Scott Counts, Leijie Wang, Chacha Chen, Mengting Wan, Tara Safavi, Jennifer Neville, Chirag Shah, Ryen W. White, Reid Andersen, Georg Buscher, Sathish Manivannan, Nagu Rangan, Longqi Yang

    Abstract: Until recently, search engines were the predominant method for people to access online information. The recent emergence of large language models (LLMs) has given machines new capabilities such as the ability to generate new digital artifacts like text, images, code etc., resulting in a new tool, a generative search engine, which combines the capabilities of LLMs with a traditional search engine.… ▽ More

    Submitted 19 March, 2024; originally announced April 2024.

    Comments: 32 pages, 3 figures, 4 tables

    ACM Class: J.4

  3. arXiv:2403.12173  [pdf, other

    cs.CL cs.AI cs.IR

    TnT-LLM: Text Mining at Scale with Large Language Models

    Authors: Mengting Wan, Tara Safavi, Sujay Kumar Jauhar, Yujin Kim, Scott Counts, Jennifer Neville, Siddharth Suri, Chirag Shah, Ryen W White, Longqi Yang, Reid Andersen, Georg Buscher, Dhruv Joshi, Nagu Rangan

    Abstract: Transforming unstructured text into structured and meaningful forms, organized by useful category labels, is a fundamental step in text mining for downstream analysis and application. However, most existing methods for producing label taxonomies and building text-based label classifiers still rely heavily on domain expertise and manual curation, making the process expensive and time-consuming. Thi… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 9 pages main content, 8 pages references and appendix

  4. arXiv:2311.01235  [pdf, other

    cs.IR cs.AI

    Advancing the Search Frontier with AI Agents

    Authors: Ryen W. White

    Abstract: As many of us in the information retrieval (IR) research community know and appreciate, search is far from being a solved problem. Millions of people struggle with tasks on search engines every day. Often, their struggles relate to the intrinsic complexity of their task and the failure of search systems to fully understand the task and serve relevant results. The task motivates the search, creatin… ▽ More

    Submitted 2 April, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: 11 pages, 6 figures, Accepted for publication in Communications of the ACM

  5. arXiv:2310.20111  [pdf, other

    cs.CL

    Making Large Language Models Better Data Creators

    Authors: Dong-Ho Lee, Jay Pujara, Mohit Sewak, Ryen W. White, Sujay Kumar Jauhar

    Abstract: Although large language models (LLMs) have advanced the state-of-the-art in NLP significantly, deploying them for downstream applications is still challenging due to cost, responsiveness, control, or concerns around privacy and security. As such, trainable models are still the preferred option in some cases. However, these models still require human-labeled data for optimal performance, which is e… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 main conference. 12 pages, 5 figures, 6 tables. Code is available at https://github.com/microsoft/llm-data-creation

  6. arXiv:2309.13063  [pdf, other

    cs.IR cs.AI cs.CL

    Using Large Language Models to Generate, Validate, and Apply User Intent Taxonomies

    Authors: Chirag Shah, Ryen W. White, Reid Andersen, Georg Buscher, Scott Counts, Sarkar Snigdha Sarathi Das, Ali Montazer, Sathish Manivannan, Jennifer Neville, Xiaochuan Ni, Nagu Rangan, Tara Safavi, Siddharth Suri, Mengting Wan, Leijie Wang, Longqi Yang

    Abstract: Log data can reveal valuable information about how users interact with Web search services, what they want, and how satisfied they are. However, analyzing user intents in log data is not easy, especially for emerging forms of Web search such as AI-driven chat. To understand user intents from log data, we need a way to label them with meaningful categories that capture their diversity and dynamics.… ▽ More

    Submitted 9 May, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

    Report number: MSR-TR-2023-32

  7. arXiv:2308.16095  [pdf, other

    cs.CY cs.SI

    Food Choice Mimicry on a Large University Campus

    Authors: Kristina Gligoric, Arnaud Chiolero, Emre Kıcıman, Ryen W. White, Eric Horvitz, Robert West

    Abstract: Social influence is a strong determinant of food consumption, which in turn influences health. Although consistent observations have been made on the role of social factors in driving similarities in food consumption, much less is known about the precise governing mechanisms. We study social influence on food choice through carefully designed causal analyses, leveraging the sequential nature of sh… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  8. arXiv:2308.08155  [pdf, other

    cs.AI cs.CL

    AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation

    Authors: Qingyun Wu, Gagan Bansal, Jieyu Zhang, Yiran Wu, Beibin Li, Erkang Zhu, Li Jiang, Xiaoyun Zhang, Shaokun Zhang, Jiale Liu, Ahmed Hassan Awadallah, Ryen W White, Doug Burger, Chi Wang

    Abstract: AutoGen is an open-source framework that allows developers to build LLM applications via multiple agents that can converse with each other to accomplish tasks. AutoGen agents are customizable, conversable, and can operate in various modes that employ combinations of LLMs, human inputs, and tools. Using AutoGen, developers can also flexibly define agent interaction behaviors. Both natural language… ▽ More

    Submitted 3 October, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

    Comments: 43 pages (10 pages for the main text, 3 pages for references, and 30 pages for appendices)

  9. arXiv:2301.05046  [pdf, other

    cs.IR

    Taking Search to Task

    Authors: Chirag Shah, Ryen W. White, Paul Thomas, Bhaskar Mitra, Shawon Sarkar, Nicholas Belkin

    Abstract: The importance of tasks in information retrieval (IR) has been long argued for, addressed in different ways, often ignored, and frequently revisited. For decades, scholars made a case for the role that a user's task plays in how and why that user engages in search and what a search system should do to assist. But for the most part, the IR community has been too focused on query processing and assu… ▽ More

    Submitted 12 January, 2023; originally announced January 2023.

  10. arXiv:2208.03443  [pdf, other

    cs.HC

    Imagining Future Digital Assistants at Work: A Study of Task Management Needs

    Authors: Yonchanok Khaokaew, Indigo Holcombe-James, Mohammad Saiedur Rahaman, Jonathan Liono, Johanne R. Trippas, Damiano Spina, Nicholas Belkin, Peter Bailey, Paul N. Bennett, Yongli Ren, Mark Sanderson, Falk Scholer, Ryen W. White, Flora D. Salim

    Abstract: Digital Assistants (DAs) can support workers in the workplace and beyond. However, target user needs are not fully understood, and the functions that workers would ideally want a DA to support require further study. A richer understanding of worker needs could help inform the design of future DAs. We investigate user needs of future workplace DAs using data from a user study of 40 workers over a f… ▽ More

    Submitted 6 August, 2022; originally announced August 2022.

    Comments: 59 pages

  11. Understanding Questions that Arise When Working with Business Documents

    Authors: Farnaz Jahanbakhsh, Elnaz Nouri, Robert Sim, Ryen W. White, Adam Fourney

    Abstract: While digital assistants are increasingly used to help with various productivity tasks, less attention has been paid to employing them in the domain of business documents. To build an agent that can handle users' information needs in this domain, we must first understand the types of assistance that users desire when working on their documents. In this work, we present results from two user studie… ▽ More

    Submitted 7 September, 2022; v1 submitted 28 March, 2022; originally announced March 2022.

    ACM Class: H.5.3

  12. arXiv:2111.06902  [pdf, other

    cs.CL cs.AI

    MS-LaTTE: A Dataset of Where and When To-do Tasks are Completed

    Authors: Sujay Kumar Jauhar, Nirupama Chandrasekaran, Michael Gamon, Ryen W. White

    Abstract: Tasks are a fundamental unit of work in the daily lives of people, who are increasingly using digital means to keep track of, organize, triage and act on them. These digital tools -- such as task management applications -- provide a unique opportunity to study and understand tasks and their connection to the real world, and through intelligent assistance, help people be more productive. By logging… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

  13. Population-scale dietary interests during the COVID-19 pandemic

    Authors: Kristina Gligoric, Arnaud Chiolero, Emre Kıcıman, Ryen W. White, Robert West

    Abstract: The SARS-CoV-2 virus has altered people's lives around the world. Here we document population-wide shifts in dietary interests in 18 countries in 2020, as revealed through time series of Google search volumes. We find that during the first wave of the COVID-19 pandemic there was an overall surge in food interest, larger and longer-lasting than the surge during typical end-of-year holidays in Weste… ▽ More

    Submitted 25 February, 2022; v1 submitted 22 September, 2021; originally announced September 2021.

    Comments: Nature Communications (2022)

  14. CoSEM: Contextual and Semantic Embedding for App Usage Prediction

    Authors: Yonchanok Khaokaew, Mohammad Saiedur Rahaman, Ryen W. White, Flora D. Salim

    Abstract: App usage prediction is important for smartphone system optimization to enhance user experience. Existing modeling approaches utilize historical app usage logs along with a wide range of semantic information to predict the app usage; however, they are only effective in certain scenarios and cannot be generalized across different situations. This paper address this problem by developing a model cal… ▽ More

    Submitted 25 August, 2021; originally announced August 2021.

    Comments: 5 pages, short paper in Proceedings of the 30th ACM International Conference on Information and Knowledge Management (CIKM '21)

  15. Formation of Social Ties Influences Food Choice: A Campus-Wide Longitudinal Study

    Authors: Kristina Gligorić, Ryen W. White, Emre Kıcıman, Eric Horvitz, Arnaud Chiolero, Robert West

    Abstract: Nutrition is a key determinant of long-term health, and social influence has long been theorized to be a key determinant of nutrition. It has been difficult to quantify the postulated role of social influence on nutrition using traditional methods such as surveys, due to the typically small scale and short duration of studies. To overcome these limitations, we leverage a novel source of data: logs… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Journal ref: Proc. ACM Hum.-Comput. Interact.5, CSCW1, Article 184 (April 2021)

  16. arXiv:2008.07045  [pdf, other

    cs.CY cs.IR

    Population-Scale Study of Human Needs During the COVID-19 Pandemic: Analysis and Implications

    Authors: Jina Suh, Eric Horvitz, Ryen W. White, Tim Althoff

    Abstract: Most work to date on mitigating the COVID-19 pandemic is focused urgently on biomedicine and epidemiology. Yet, pandemic-related policy decisions cannot be made on health information alone. Decisions need to consider the broader impacts on people and their needs. Quantifying human needs across the population is challenging as it requires high geo-temporal granularity, high coverage across the popu… ▽ More

    Submitted 14 January, 2021; v1 submitted 16 August, 2020; originally announced August 2020.

  17. arXiv:2006.12999  [pdf, other

    cs.AI cs.IR

    Optimizing Interactive Systems via Data-Driven Objectives

    Authors: Ziming Li, Julia Kiseleva, Alekh Agarwal, Maarten de Rijke, Ryen W. White

    Abstract: Effective optimization is essential for real-world interactive systems to provide a satisfactory user experience in response to changing user behavior. However, it is often challenging to find an objective to optimize for interactive systems (e.g., policy learning in task-oriented dialog systems). Generally, such objectives are manually crafted and rarely capture complex user needs in an accurate… ▽ More

    Submitted 19 June, 2020; originally announced June 2020.

    Comments: 30 pages, 12 figures. arXiv admin note: text overlap with arXiv:1802.06306

  18. arXiv:2002.00747  [pdf, other

    cs.CL cs.AI cs.HC cs.IR

    Conversations with Documents. An Exploration of Document-Centered Assistance

    Authors: Maartje ter Hoeve, Robert Sim, Elnaz Nouri, Adam Fourney, Maarten de Rijke, Ryen W. White

    Abstract: The role of conversational assistants has become more prevalent in helping people increase their productivity. Document-centered assistance, for example to help an individual quickly review a document, has seen less significant progress, even though it has the potential to tremendously increase a user's productivity. This type of document-centered assistance is the focus of this paper. Our contrib… ▽ More

    Submitted 27 January, 2020; originally announced February 2020.

    Comments: Accepted as full paper at CHIIR 2020; 9 pages + Appendix

  19. arXiv:1712.03622  [pdf, other

    cs.IR cs.HC

    Interactions between Health Searchers and Search Engines

    Authors: George Philipp, Ryen W. White

    Abstract: The Web is an important resource for understanding and diagnosing medical conditions. Based on exposure to online content, people may develop undue health concerns, believing that common and benign symptoms are explained by serious illnesses. In this paper, we investigate potential strategies to mine queries and searcher histories for clues that could help search engines choose the most appropriat… ▽ More

    Submitted 10 December, 2017; originally announced December 2017.

    Comments: SIGIR 2014

  20. arXiv:1701.07083  [pdf, other

    cs.HC cs.CY cs.IR q-bio.NC

    Harnessing the Web for Population-Scale Physiological Sensing: A Case Study of Sleep and Performance

    Authors: Tim Althoff, Eric Horvitz, Ryen W. White, Jamie Zeitzer

    Abstract: Human cognitive performance is critical to productivity, learning, and accident avoidance. Cognitive performance varies throughout each day and is in part driven by intrinsic, near 24-hour circadian rhythms. Prior research on the impact of sleep and circadian rhythms on cognitive performance has typically been restricted to small-scale laboratory-based studies that do not capture the variability o… ▽ More

    Submitted 24 February, 2017; v1 submitted 21 January, 2017; originally announced January 2017.

    Comments: Published in Proceedings of WWW 2017

  21. arXiv:1612.07896  [pdf, other

    cs.AI cs.LG

    A Base Camp for Scaling AI

    Authors: C. J. C. Burges, T. Hart, Z. Yang, S. Cucerzan, R. W. White, A. Pastusiak, J. Lewis

    Abstract: Modern statistical machine learning (SML) methods share a major limitation with the early approaches to AI: there is no scalable way to adapt them to new domains. Human learning solves this in part by leveraging a rich, shared, updateable world model. Such scalability requires modularity: updating part of the world model should not impact unrelated parts. We have argued that such modularity will r… ▽ More

    Submitted 23 December, 2016; originally announced December 2016.

  22. arXiv:1610.02085  [pdf, other

    cs.CY cs.HC cs.IR

    Influence of Pokémon Go on Physical Activity: Study and Implications

    Authors: Tim Althoff, Ryen W. White, Eric Horvitz

    Abstract: Physical activity helps people maintain a healthy weight and reduces the risk for several chronic diseases. Although this knowledge is widely recognized, adults and children in many countries around the world do not get recommended amounts of physical activity. While many interventions are found to be ineffective at increasing physical activity or reaching inactive populations, there have been ane… ▽ More

    Submitted 27 October, 2016; v1 submitted 6 October, 2016; originally announced October 2016.

  23. arXiv:1304.3742  [pdf, other

    cs.CY cs.IR physics.soc-ph

    From Cookies to Cooks: Insights on Dietary Patterns via Analysis of Web Usage Logs

    Authors: Robert West, Ryen W. White, Eric Horvitz

    Abstract: Nutrition is a key factor in people's overall health. Hence, understanding the nature and dynamics of population-wide dietary preferences over time and space can be valuable in public health. To date, studies have leveraged small samples of participants via food intake logs or treatment data. We propose a complementary source of population data on nutrition obtained via Web logs. Our main contribu… ▽ More

    Submitted 12 April, 2013; originally announced April 2013.

    Comments: WWW 2013, 11 pages, 11 figures

    ACM Class: H.2.8