Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3173574.3173580acmconferencesArticle/Chapter ViewAbstractPublication PageschiConference Proceedingsconference-collections
research-article

Patterns for How Users Overcome Obstacles in Voice User Interfaces

Published: 19 April 2018 Publication History

Abstract

Voice User Interfaces (VUIs) are growing in popularity. However, even the most current VUIs regularly cause frustration for their users. Very few studies exist on what people do to overcome VUI problems they encounter, or how VUIs can be designed to aid people when these problems occur. In this paper, we analyze empirical data on how users (n=12) interact with our VUI calendar system, DiscoverCal, over three sessions. In particular, we identify the main obstacle categories and types of tactics our participants employ to overcome them. We analyzed the patterns of how different tactics are used in each obstacle category. We found that while NLP Error obstacles occurred the most, other obstacles are more likely to frustrate or confuse the user. We also found patterns that suggest participants were more likely to employ a "guessing" approach rather than rely on visual aids or knowledge recall.

References

[1]
Matthew P Aylett, Per Ola Kristensson, Steve Whittaker, and Yolanda Vazquez-Alvarez. 2014. None of a CHInd: Relationship Counselling for HCI and Speech Technology. CHI '14 Extended Abstracts on Human Factors in Computing Systems (2014), 749--760.
[2]
Eric Corbett and Astrid Weber. 2016. What can I say? Proceedings of the 18th International Conference on Human-Computer Interaction with Mobile Devices and Services - MobileHCI '16 (2016), 72--82.
[3]
Anushay Furqan, Chelsea Myers, and Jichen Zhu. 2017. Learnability through Adaptive Discovery Tools in Voice User Interfaces. Proceedings of the 2017 CHI Conference Extended Abstracts on Human Factors in Computing Systems - CHI EA '17 (2017), 1617--1623.
[4]
Jonathan Huyghe, Jan Derboven, and Dirk De Grooff. 2014. ALADIN: Demo of a Multimodal Adaptive Voice Interface. Proceedings of the 8th Nordic Conference on Human-Computer Interaction: Fun, Fast, Foundational (2014), 1035--1038.
[5]
Clare M Karat, Christine Halverson, Daniel Horn, and John Karat. 1999. Patterns of entry and correction in large vocabulary continuous speech recognition systems. Proceedings of the SIGCHI conference on Human factors in computing systems: the CHI is the limit (1999), 568--575.
[6]
Lewis R. Karl, Michael Pettey, and Ben Shneiderman. 1993. Speech versus mouse commands for word processing: an empirical evaluation. International Journal of Man-Machine Studies 39, 4 (oct 1993), 667--687.
[7]
Ranjitha Gurunath Kulkarni, Ahmed El Kholy, Ziad Al Bawab, Noha Alon, Imed Zitouni, Umut Ozertem, and Shuangyu Chang. 2017. Hyperarticulation Detection in Repetitive Voice Queries Using Pairwise Comparison for Improved Speech Recognition. (2017), 4985--4989.
[8]
Ewa Luger and Abigail Sellen. 2016. "Like Having a Really Bad PA": The Gulf between User Expectation and Experience of Conversational Agents. Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems - CHI '16 (2016), 5286--5297.
[9]
Gabriel Lyons, Vinh Tran, Carsten Binnig, Ugur Cetintemel, and Tim Kraska. 2016. Making the case for Query-by-Voice with EchoQuery. Sigmod (2016), 2129--2132.
[10]
Amanda Purington, Jessie G. Taft, Shruti Sannon, Natalya N. Bazarova, and Samuel Hardman Taylor. 2017. "Alexa is my new BFF". Proceedings of the 2017 CHI Conference Extended Abstracts on Human Factors in Computing Systems - CHI EA '17 (2017), 2853--2859.
[11]
Pernilla Qvarfordt, Arne Jönsson, and Nils Dahlbäck. 2003. The Role of Spoken Feedback in Experiencing Multimodal Interfaces as Human-like. Proceedings of the 5th international conference on Multimodal interfaces ICMI 03 (2003), 250--257.
[12]
Xin Rong, Adam Fourney, Robin N Brewer, Meredith Ringel Morris, and Paul N Bennett. 2017. Managing Uncertainty in Time Expressions for Virtual Assistants. Acm (2017), 568--579.
[13]
Nicole Shechtman and Leonard M Horowitz. 2003. Media Inequality in Conversation. Proceedings of the conference on Human factors in computing systems - CHI '03 5 (2003), 281--288.
[14]
Ben Shneiderman. 2000. The limits of speech recognition. Commun. ACM 43, 9 (2000), 63--65.
[15]
Amanda J. Stent, Marie K. Huffman, and Susan E. Brennan. 2008. Adapting speaking after evidence of misrecognition: Local and global hyperarticulation. Speech Communication 50, 3 (2008), 163--178.
[16]
A. Strauss and J. Corbin. 1998. Basics of Qualitative Research: Techniques and Procedures for developing grounded theory. (second ed.). SAGE Publications.
[17]
Yu Zhong, T V Raman, Casey Burkhardt, Fadi Biadsy, and Jeffrey P Bigham. 2014. JustSpeak: Enabling Universal Voice Control on Android. Proceedings of the 11th Web for All Conference on - W4A '14 (2014), 1--4.

Cited By

View all
  • (2025)User Interaction Patterns and Breakdowns in Conversing with LLM-Powered Voice AssistantsInternational Journal of Human-Computer Studies10.1016/j.ijhcs.2024.103406195(103406)Online publication date: Jan-2025
  • (2024)User Experience and Usability of Voice User Interfaces: A Systematic Literature ReviewInformation10.3390/info1509057915:9(579)Online publication date: 19-Sep-2024
  • (2024)The Effects of Virtual Character's Intelligence and Task's Complexity during an Immersive Jigsaw Puzzle Co-solving TaskProceedings of the 17th ACM SIGGRAPH Conference on Motion, Interaction, and Games10.1145/3677388.3696324(1-12)Online publication date: 21-Nov-2024
  • Show More Cited By

Index Terms

  1. Patterns for How Users Overcome Obstacles in Voice User Interfaces

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CHI '18: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems
    April 2018
    8489 pages
    ISBN:9781450356206
    DOI:10.1145/3173574
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 19 April 2018

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. user experience
    2. voice control
    3. voice user interfaces

    Qualifiers

    • Research-article

    Conference

    CHI '18
    Sponsor:

    Acceptance Rates

    CHI '18 Paper Acceptance Rate 666 of 2,590 submissions, 26%;
    Overall Acceptance Rate 6,199 of 26,314 submissions, 24%

    Upcoming Conference

    CHI 2025
    ACM CHI Conference on Human Factors in Computing Systems
    April 26 - May 1, 2025
    Yokohama , Japan

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)281
    • Downloads (Last 6 weeks)60
    Reflects downloads up to 25 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2025)User Interaction Patterns and Breakdowns in Conversing with LLM-Powered Voice AssistantsInternational Journal of Human-Computer Studies10.1016/j.ijhcs.2024.103406195(103406)Online publication date: Jan-2025
    • (2024)User Experience and Usability of Voice User Interfaces: A Systematic Literature ReviewInformation10.3390/info1509057915:9(579)Online publication date: 19-Sep-2024
    • (2024)The Effects of Virtual Character's Intelligence and Task's Complexity during an Immersive Jigsaw Puzzle Co-solving TaskProceedings of the 17th ACM SIGGRAPH Conference on Motion, Interaction, and Games10.1145/3677388.3696324(1-12)Online publication date: 21-Nov-2024
    • (2024)Troubleshooting Conversations: Exploring Chatbot Repair StrategiesProceedings of Mensch und Computer 202410.1145/3670653.3677496(386-391)Online publication date: 1-Sep-2024
    • (2024)Understanding How Blind Users Handle Object Recognition Errors: Strategies and ChallengesProceedings of the 26th International ACM SIGACCESS Conference on Computers and Accessibility10.1145/3663548.3675635(1-15)Online publication date: 27-Oct-2024
    • (2024)Qualitative Approaches to Voice UXACM Computing Surveys10.1145/365866656:12(1-34)Online publication date: 20-Apr-2024
    • (2024)GPTVoiceTasker: Advancing Multi-step Mobile Task Efficiency Through Dynamic Interface Exploration and LearningProceedings of the 37th Annual ACM Symposium on User Interface Software and Technology10.1145/3654777.3676356(1-17)Online publication date: 13-Oct-2024
    • (2024)VOICON: Geometric Motion-Based Visual Feedback in Voice User InterfaceProceedings of the 2024 ACM Designing Interactive Systems Conference10.1145/3643834.3660741(102-115)Online publication date: 1-Jul-2024
    • (2024)GlassMail: Towards Personalised Wearable Assistant for On-the-Go Email Creation on Smart GlassesProceedings of the 2024 ACM Designing Interactive Systems Conference10.1145/3643834.3660683(372-390)Online publication date: 1-Jul-2024
    • (2024)Understanding is a Two-Way Street: User-Initiated Repair on Agent Responses and Hearing in Conversational InterfacesProceedings of the ACM on Human-Computer Interaction10.1145/36410268:CSCW1(1-26)Online publication date: 26-Apr-2024
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media