Abstract
This paper describes work on dialogue data collection and dialogue system design for personal assistant humanoid robots undertaken at eNTERFACE 2016. The emphasis has been on the system’s speech capabilities and dialogue modeling of what we call LifeLine Dialogues, i.e. dialogues that help people tell stories about their lives. The main goal behind this type of application is to help elderly people exercise their speech and memory capabilities. The system further aims at acquiring a good level of knowledge about the person’s interests and thus is expected to feature open-domain conversations, presenting useful and interesting information to the user. The novel contributions of this work are: (1) a flexible spoken dialogue system that extends the Ravenclaw-type agent-based dialogue management model with topic management and multi-modal capabilities, especially with face recognition technologies, (2) a collection of WOZ-data related to initial encounters and presentation of information to the user, and (3) the establishment of a closer conversational relationship with the user by utilizing additional data (e.g. context, dialogue history, emotions, user goals, etc.).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
References
Bohus, D., Rudnicky, A.I.: The RavenClaw dialog management framework: architecture and systems. Comput. Speech Lang. 23(3), 332–361 (2009)
Clark, H.H., Schaefer, E.F.: Contributing to discourse. Cogn. Sci. 13(2), 259–294 (1989)
Dahlbäck, N., Jönsson, A., Ahrenberg, L.: Wizard of Oz studies: why and how. In: Proceedings of the 1st International Conference on Intelligent User Interfaces, pp. 193–200. ACM (1993)
Eskenazi, M., Black, A.W., Raux, A., Langner, B.: Lets Go Lab: a platform for evaluation of spoken dialog systems with real world users. In: InterSpeech (2008)
Ferguson, G., Allen, J.F.: TRIPs: an integrated intelligent problem-solving assistant. In: Proceedings of the AAAI/IAAI Conference on Artificial Intelligence/Innovative Applications of Artificial Intelligence, pp. 567–572 (1998)
Flandorfer, P.: Population ageing and socially assistive robots for elderly persons: the importance of sociodemographic factors for user acceptance. Int. J. Popul. Res. 2012, Article ID 829835, 13 (2012). doi:10.1155/2012/829835
Ghigi, F., Eskenazi, M., Torres, M.I., Lee, S.: Incremental dialog processing in a task-oriented dialog. In: InterSpeech, pp. 308–312 (2014)
Henderson, J., Merlo, P., Titov, I., Musillo, G.: Multilingual joint parsing of syntactic and semantic dependencies with a latent variable model. Comput. Linguist. 39(4), 949–998 (2013)
Jokinen, K., McTear, M.: Spoken Dialogue Systems, vol. 2. Morgan & Claypool Publishers, Princeton (2009)
McCool, C., Marcel, S., Hadid, A., Pietikäinen, M., Matejka, P., Cernockỳ, J., Poh, N., Kittler, J., Larcher, A., Levy, C., et al.: Bi-modal person recognition on a mobile phone: using mobile phone data. In: 2012 IEEE International Conference on Multimedia and Expo Workshops (ICMEW), pp. 635–640. IEEE (2012)
Olaso, J.M., Milhorat, P., Himmelsbach, J., Boudy, J., Chollet, G., Schlögl, S., Torres, M.I.: A Multi-lingual evaluation of the vAssist spoken dialog system. Comparing Disco and RavenClaw. In: International Workshop on Spoken Dialogue Systems (2016)
Olaso, J.M., Torres, M.I.: Dialogue system based on EDECÁN architecture. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2010. LNCS, vol. 6231, pp. 547–551. Springer, Heidelberg (2010). doi:10.1007/978-3-642-15760-8_69
Petrovska-Delacrétaz, D., Chollet, G., Dorizzi, B. (eds.): Guide to Biometric Reference Systems and Performance Evaluation. Springer, London (2009). doi:10.1007/978-1-84800-292-0
Phillips, P.J., Flynn, P.J., Scruggs, T., Bowyer, K.W., Chang, J., Hoffman, K., Marques, J., Min, J., Worek, W.: Overview of the face recognition grand challenge. In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), vol. 1, pp. 947–954. IEEE (2005)
Sansen, H., Torres, M.I., Chollet, G., Glackin, C., Petrovska-Delacretaz, D., Boudy, J., Badii, A., Schlögl, S.: The Roberta IRONSIDE project: a dialog capable humanoid personal assistant in a wheelchair for dependent persons. In: 2016 2nd International Conference on Advanced Technologies for Signal and Image Proceedings (ATSIP), pp. 381–386 (2016)
Schlögl, S., Doherty, G., Luz, S.: Wizard of Oz experimentation for language technology applications: challenges and tools. Interact. Comput. 27(6), 592–615 (2015)
Schlögl, S., Milhorat, P., Chollet, G., Boudy, J.: Designing language technology applications: a Wizard of Oz driven prototyping framework. In: Proceedings of the EACL Conference of the European Chapter of the Association for Computer Linguistics, pp. 85–88 (2014)
Serrras, M., Pére, N., Torres, M.I., Del Pozo, A.: Entropy-driven dialog for topic classification: detecting and tackling uncertainty. In: International Workshop on Spoken Dialogue Systems (2016)
ter Maat, M., Heylen, D.: Flipper: an information state component for spoken dialogue systems. In: Vilhjálmsson, H.H., Kopp, S., Marsella, S., Thórisson, K.R. (eds.) IVA 2011. LNCS, vol. 6895, pp. 470–472. Springer, Heidelberg (2011). doi:10.1007/978-3-642-23974-8_67
Traum, D.R.: A computational theory of grounding in natural language conversation. Technical report, University of Rochester, Rochester, NY, USA (1994)
Traum, D.R., Larsson, S.: The information state approach to dialogue management. In: van Kuppevelt, J., Smith, R.W. (eds.) Current and New Directions in Discourse and Dialogue. Text, Speech and Language Technology, vol. 22, pp. 325–353. Springer, Dordrecht (2003)
Turunen, M., Hakulinen, J.: Jaspis-a framework for multilingual adaptive speech applications. In: InterSpeech, pp. 719–722 (2000)
Usoltsev, A., Petrovska-Delacrétaz, D., Houssemeddine, K.: Full video processing for mobile audio-visual identity verification. In: International Conference on Pattern Recognition Applications and Methods ICPRAM 2016 (2016)
Ward, W., et al.: The CMU air travel information service: understanding spontaneous speech. In: Proceedings of the DARPA Speech and Natural Language Workshop, vol. 1, pp. 127–129 (1990)
Acknowledgments
The authors want to acknowledge the organizers of eNTERFACE 2016 and the University of Twente for providing the opportunity to develop this project. We also want to acknowledge the institutions supporting some of the authors e.g. the Spanish Science Minister under grant TIN2014-54288-C4, ‘ADAPT 13/RC/2106’, and the Academy of Finland project Digital Citizens grant number 270082.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
López, A. et al. (2017). LifeLine Dialogues with Roberta. In: Quesada, J., Martín Mateos , FJ., López Soto, T. (eds) Future and Emerging Trends in Language Technology. Machine Learning and Big Data. FETLT 2016. Lecture Notes in Computer Science(), vol 10341. Springer, Cham. https://doi.org/10.1007/978-3-319-69365-1_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-69365-1_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-69364-4
Online ISBN: 978-3-319-69365-1
eBook Packages: Computer ScienceComputer Science (R0)