Online Learning of Bayes Risk-Based Optimization of Dialogue Management for Document Retrieval Systems with Speech Interface

Misu, Teruhisa; Sugiura, Komei; Kawahara, Tatsuya; Ohtake, Kiyonori; Hori, Chiori; Kashioka, Hideki; Nakamura, Satoshi

doi:10.1007/978-1-4419-7934-6_2

Teruhisa Misu⁵,
Komei Sugiura⁵,
Tatsuya Kawahara⁶,
Kiyonori Ohtake⁵,
Chiori Hori⁵,
Hideki Kashioka⁵ &
…
Satoshi Nakamura⁵

481 Accesses

Abstract

We propose an efficient online learning method of dialogue management based on Bayes risk criterion for document retrieval systems with a speech interface. The system has several choices in generating responses. So far, we have optimized the selection as minimization of Bayes risk based on reward for correct information presentation and penalty for redundant turns. In this chapter, this framework is extended to be trainable via online learning by maximum likelihood estimation of success probability of a response generation. Effectiveness of the proposed framework was demonstrated through an experiment with a large amount of utterances of real users. The online learning method was then compared with the method using reinforcement learning and discussed in terms of convergence speed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning

Article Open access 07 January 2023

A review of dialogue systems: current trends and future directions

Article 22 December 2023

Deep Reinforcement Learning for On-line Dialogue State Tracking

Author information

Authors and Affiliations

National Institute of Information and Communications Technology (NICT), Kyoto, Japan
Teruhisa Misu, Komei Sugiura, Kiyonori Ohtake, Chiori Hori, Hideki Kashioka & Satoshi Nakamura
Kyoto University, Kyoto, Japan
Tatsuya Kawahara

Authors

Teruhisa Misu
View author publications
You can also search for this author in PubMed Google Scholar
Komei Sugiura
View author publications
You can also search for this author in PubMed Google Scholar
Tatsuya Kawahara
View author publications
You can also search for this author in PubMed Google Scholar
Kiyonori Ohtake
View author publications
You can also search for this author in PubMed Google Scholar
Chiori Hori
View author publications
You can also search for this author in PubMed Google Scholar
Hideki Kashioka
View author publications
You can also search for this author in PubMed Google Scholar
Satoshi Nakamura
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Teruhisa Misu .

Editor information

Editors and Affiliations

Fak. Ingenieurwissenschaften und, Elektrotechnik, Universität Ulm, Albert-Einstein-Allee 43, Ulm, 89081, Germany
Wolfgang Minker
Technology (POSTECH), Dept. Computer Science & Engineering, Pohang University of Science &, San 31, Hyoja-dong, Pohang, Kyungbuk, 790-784, Korea, Republic of (South Korea)
Gary Geunbae Lee
Communications Technology, National Institute of Information and, Kyoto, 69121, Japan
Satoshi Nakamura
Multilingual and Multimedia Information, CNRS, Orsay, 91403, France
Joseph Mariani

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Misu, T. et al. (2011). Online Learning of Bayes Risk-Based Optimization of Dialogue Management for Document Retrieval Systems with Speech Interface. In: Minker, W., Lee, G., Nakamura, S., Mariani, J. (eds) Spoken Dialogue Systems Technology and Design. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-7934-6_2

Download citation

DOI: https://doi.org/10.1007/978-1-4419-7934-6_2
Published: 01 November 2010
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-7933-9
Online ISBN: 978-1-4419-7934-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Online Learning of Bayes Risk-Based Optimization of Dialogue Management for Document Retrieval Systems with Speech Interface

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning

A review of dialogue systems: current trends and future directions

Deep Reinforcement Learning for On-line Dialogue State Tracking

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Online Learning of Bayes Risk-Based Optimization of Dialogue Management for Document Retrieval Systems with Speech Interface

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning

A review of dialogue systems: current trends and future directions

Deep Reinforcement Learning for On-line Dialogue State Tracking

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation