Abstract
We propose an efficient online learning method of dialogue management based on Bayes risk criterion for document retrieval systems with a speech interface. The system has several choices in generating responses. So far, we have optimized the selection as minimization of Bayes risk based on reward for correct information presentation and penalty for redundant turns. In this chapter, this framework is extended to be trainable via online learning by maximum likelihood estimation of success probability of a response generation. Effectiveness of the proposed framework was demonstrated through an experiment with a large amount of utterances of real users. The online learning method was then compared with the method using reinforcement learning and discussed in terms of convergence speed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer Science+Business Media, LLC
About this paper
Cite this paper
Misu, T. et al. (2011). Online Learning of Bayes Risk-Based Optimization of Dialogue Management for Document Retrieval Systems with Speech Interface. In: Minker, W., Lee, G., Nakamura, S., Mariani, J. (eds) Spoken Dialogue Systems Technology and Design. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-7934-6_2
Download citation
DOI: https://doi.org/10.1007/978-1-4419-7934-6_2
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-7933-9
Online ISBN: 978-1-4419-7934-6
eBook Packages: EngineeringEngineering (R0)