“Your Word is my Command”: Google Search by Voice: A Case Study

Schalkwyk, Johan; Beeferman, Doug; Beaufays, Françoise; Byrne, Bill; Chelba, Ciprian; Cohen, Mike; Kamvar, Maryam; Strope, Brian

doi:10.1007/978-1-4419-5951-5_4

Johan Schalkwyk²,
Doug Beeferman,
Françoise Beaufays,
Bill Byrne,
Ciprian Chelba,
Mike Cohen,
Maryam Kamvar &
…
Brian Strope

2951 Accesses
121 Citations

Abstract

An important goal at Google is to make spoken access ubiquitously available. Achieving ubiquity requires two things: availability (i.e., built into every possible interaction where speech input or output can make sense) and performance (i.e., works so well that the modality adds no friction to the interaction).

This chapter is a case study of the development of Google Search by Voice – a step toward our long-term vision of ubiquitous access. While the integration of speech input into Google search is a significant step toward more ubiquitous access, it has posed many problems in terms of the performance of core speech technologies and the design of effective user interfaces. Work is ongoing and no doubt the problems are far from solved. Nonetheless, we have at the minimum achieved a level of performance showing that usage of voice search is growing rapidly, and that many users do indeed become repeat users.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Understanding Voice Search Behavior: Review and Synthesis of Research

An expandable voice user interface as lab assistant based on an improved version of Google’s speech recognition

Article Open access 09 November 2023

Alexa vs. Siri vs. Cortana vs. Google Assistant: A Comparison of Speech-Based Natural User Interfaces

References

C. Allauzen, M. Riley, J. Schalkwyk, W. Skut, and M. Mohri. OpenFst: A general and efficient weighted finite-state transducer library. Lecture Notes in Computer Science, 4783:11, 2007.
Article Google Scholar
M. Bacchiani, F. Beaufays, J. Schalkwyk, M. Schuster, and B. Strope. Deploying GOOG-411: Early lessons in data, measurement, and testing. In Proceedings of ICASSP, pp 5260–5263, April 2008.
Google Scholar
MJF Gales. Semi-tied full-covariance matrices for hidden Markov models. 1997.
Google Scholar
B. Harb, C. Chelba, J. Dean, and G. Ghemawhat. Back-off language model compression. 2009.
Google Scholar
H. Hermansky. Perceptual linear predictive (PLP) analysis of speech. Journal of the Acoustical Society of America, 87(4):1738–1752, 1990.
Article Google Scholar
M. Kamvar and S. Baluja. A large scale study of wireless search behavior: Google mobile search. In CHI, pp 701–709, 22–27 April 2006.
Google Scholar
S. Katz. Estimation of probabilities from sparse data for the language model component of a speech recognizer. In IEEE Transactions on Acoustics, Speech and Signal Processing, volume 35, pp 400–401, March 1987.
Google Scholar
D. Povey, D. Kanevsky, B. Kingsbury, B. Ramabhadran, G. Saon, and K. Visweswariah. Boosted MMI for model and feature-space discriminative training. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2008.
Google Scholar
R. Sproat, C. Shih, W. Gale, and N. Chang. A stochastic finite-state word-segmentation algorithm for Chinese. Computational Linguistics, 22(3):377–404, 1996.
Google Scholar
C. Van Heerden, J. Schalkwyk, and B. Strope. Language modeling for what-with-where on GOOG-411. 2009.
Google Scholar

Download references

Author information

Authors and Affiliations

Google, 1600 Amphiteatre Parkway, Mountain View, CA, 94043, USA
Johan Schalkwyk (Senior Staff Engineer)

Authors

Johan Schalkwyk
View author publications
You can also search for this author in PubMed Google Scholar
Doug Beeferman
View author publications
You can also search for this author in PubMed Google Scholar
Françoise Beaufays
View author publications
You can also search for this author in PubMed Google Scholar
Bill Byrne
View author publications
You can also search for this author in PubMed Google Scholar
Ciprian Chelba
View author publications
You can also search for this author in PubMed Google Scholar
Mike Cohen
View author publications
You can also search for this author in PubMed Google Scholar
Maryam Kamvar
View author publications
You can also search for this author in PubMed Google Scholar
Brian Strope
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Johan Schalkwyk .

Editor information

Editors and Affiliations

Linguistic Technology Systems, Palisade Ave. 800, Fort Lee, 07024, New Jersey, USA
Amy Neustein

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Schalkwyk, J. et al. (2010). “Your Word is my Command”: Google Search by Voice: A Case Study. In: Neustein, A. (eds) Advances in Speech Recognition. Springer, Boston, MA. https://doi.org/10.1007/978-1-4419-5951-5_4

Download citation

DOI: https://doi.org/10.1007/978-1-4419-5951-5_4
Published: 13 August 2010
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4419-5950-8
Online ISBN: 978-1-4419-5951-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

“Your Word is my Command”: Google Search by Voice: A Case Study

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Understanding Voice Search Behavior: Review and Synthesis of Research

An expandable voice user interface as lab assistant based on an improved version of Google’s speech recognition

Alexa vs. Siri vs. Cortana vs. Google Assistant: A Comparison of Speech-Based Natural User Interfaces

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

“Your Word is my Command”: Google Search by Voice: A Case Study

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Understanding Voice Search Behavior: Review and Synthesis of Research

An expandable voice user interface as lab assistant based on an improved version of Google’s speech recognition

Alexa vs. Siri vs. Cortana vs. Google Assistant: A Comparison of Speech-Based Natural User Interfaces

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation