research-article

Open access

GRILLBot In Practice: Lessons and Tradeoffs Deploying Large Language Models for Adaptable Conversational Task Assistants

Authors:

Jeffrey DaltonAuthors Info & Claims

KDD '24: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

Pages 4951 - 4961

https://doi.org/10.1145/3637528.3671622

Published: 24 August 2024 Publication History

PDF eReader

Abstract

We tackle the challenge of building real-world multimodal assistants for complex real-world tasks. We describe the practicalities and challenges of developing and deploying GRILLBot, a leading (first and second prize winning in 2022 and 2023) system deployed in the Alexa Prize TaskBot Challenge. Building on our Open Assistant Toolkit (OAT) framework, we propose a hybrid architecture that leverages Large Language Models (LLMs) and specialised models tuned for specific subtasks requiring very low latency. OAT allows us to define when, how and which LLMs should be used in a structured and deployable manner. For knowledge-grounded question answering and live task adaptations, we show that LLM reasoning abilities over task context and world knowledge outweigh latency concerns. For dialogue state management, we implement a code generation approach and show that specialised smaller models have 84% effectiveness with 100x lower latency. Overall, we provide insights and discuss tradeoffs for deploying both traditional models and LLMs to users in complex real-world multimodal environments in the Alexa TaskBot challenge. These experiences will continue to evolve as LLMs become more capable and efficient -- fundamentally reshaping OAT and future assistant architectures.

Supplemental Material

MP4 File - Promotional Video for ADS 666

We present lessons learned, including tradeoffs for using LLMs, during the development of our conversational task assistant, GRILLBot. We created GRILLBot for the Alexa Prize TaskBot Challenge 1 & 2, during which users across the US could use GRILLBot as a multimodal assistant for cooking and home tasks. Research challenges include decision-making, grounded conversational question answering and task adaptation. We conclude that LLMs might not always be the answer - especially not for time-critical and high-accuracy-dependent components of a modular conversational agent.

Download
103.91 MB

References

[1]

Eugene Agichtein, Michael Johnston, Anna Gottardi, Cris Flagg, Lavina Vaz, Hangjie Shi, Desheng Zhang, Leslie Ball, Shaohua Liu, Luke Dai, et al. 2023. Alexa, let's work together: Introducing the second alexa prize taskbot challenge. 2nd Proceedings of the Alexa Prize Taskbot Challenge, Vol. 2 (2023).

Abstract

Supplemental Material

References

Cited By

Index Terms

Recommendations

Rating Prediction in Conversational Task Assistants with Behavioral and Conversational-Flow Features

Do Large Language Models Understand Conversational Implicature – A Case Study with a Chinese Sitcom

Enabling Conversational Interaction with Mobile UI using Large Language Models

Comments

Information

Published In

Sponsors

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

PDF

eReader

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations