Towards a Common Framework for Multimodal Generation: The Behavior Markup Language

Kopp, Stefan; Krenn, Brigitte; Marsella, Stacy; Marshall, Andrew N.; Pelachaud, Catherine; Pirker, Hannes; Thórisson, Kristinn R.; Vilhjálmsson, Hannes

doi:10.1007/11821830_17

Stefan Kopp²³,
Brigitte Krenn²⁴,
Stacy Marsella²⁶,
Andrew N. Marshall²⁶,
Catherine Pelachaud²⁵,
Hannes Pirker²⁴,
Kristinn R. Thórisson²⁷ &
…
Hannes Vilhjálmsson²⁶

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4133))

Included in the following conference series:

International Workshop on Intelligent Virtual Agents

2602 Accesses
216 Citations

Abstract

This paper describes an international effort to unify a multimodal behavior generation framework for Embodied Conversational Agents (ECAs). We propose a three stage model we call SAIBA where the stages represent intent planning, behavior planning and behavior realization. A Function Markup Language (FML), describing intent without referring to physical behavior, mediates between the first two stages and a Behavior Markup Language (BML) describing desired physical realization, mediates between the last two stages. In this paper we will focus on BML. The hope is that this abstraction and modularization will help ECA researchers pool their resources to build more sophisticated virtual humans.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Cognitive Planning for Persuasive Multimodal Interaction

A Natural Conversation Framework for Conversational UX Design

MotionChain: Conversational Motion Controllers via Multimodal Prompts

References

Cassell, J., Pelachaud, C., Badler, N., Steedman, M., Achorn, B., Becket, T., Douville, B., Prevost, S., Stone, M.: Animated Conversation: Rule-Based Generation of Facial Expression, Gesture and Spoken Intonation for Multiple Conversational Agents. In: Siggraph 1994 Conference Proceedings, ACM SIGGRAPH, pp. 413–420. Addison Wesley, Reading (1994)
Chapter Google Scholar
Cassell, J., Vilhjálmsson, H., Bickmore, T.: BEAT: the Behavior Expression Animation Toolkit. In: Proc. ACM SIGGRAPH 2001, Los Angeles, August 12-17, pp. 477–486 (2001)
Google Scholar
Cassell, J., Vilhjálmsson, H., Chang, K., Bickmore, T., Campbell, L., Yan, H.: Requirements for an Architecture for Embodied Conversational Characters. In: Computer Animation and Simulation 1999. Eurographics Series. Springer, Austria (1999)
Google Scholar
DeCarolis, B., Pelachaud, C., Poggi, I., Steedman, M.: APML, a mark-up language for believable behavior generation. In: Prendinger, H., Ishizuka, M. (eds.) Life-like Characters. Tools, Affective Functions and Applications, pp. 65–85. Springer, Heidelberg (2004)
Google Scholar
Hartmann, B., Mancini, M., Pelachaud, C.: Formational parameters and adaptive prototype instantiation for MPEG-4 compliant gesture synthesis. In: Computer Animation 2002, Geneva, Switzerland. IEEE Computer Society Press, Los Alamitos (2002)
Google Scholar
Kopp, S., Jung, B., Lessmann, N., Wachsmuth, I.: Max–A Multimodal Assistant in Virtual Reality Construction. KI 4/03, 11–17 (2003)
Google Scholar
Kopp, S., Wachsmuth, I.: Synthesizing Multimodal Utterances for Conversational Agents. Computer Animation and Virtual Worlds 15(1), 39–52 (2004)
Article Google Scholar
Krenn, B.: Representational Lego for ECAs. In: Background paper for a presentation held at the FP6 NoE HUMAINE Workshop on Emotion and Interaction, Paris (March 10-11, 2005)
Google Scholar
Krenn, B., Pirker, H.: Defining the Gesticon: Language and Gesture Coordination for Interacting Embodied Agents. In: Krenn, B., Pirker, H. (eds.) Proc. of the AISB 2004 Symposium on Language, Speech and Gesture for Expressive Characters, University of Leeds, UK, pp. 107–115 (2004)
Google Scholar
Martell, C.: FORM: An Extensible, Kinematically-based Gesture Annotation Scheme. In: Proceedings of the 2002 International Conference on Language Resources and Evaluation, Las Palmas, Canary Island (2002)
Google Scholar
Piwek, P., Krenn, B., Schröder, M., Grice, M., Baumann, S., Pirker, H.: RRL: A Rich Representation Language for the Description of Agent Behaviour in NECA. In: Proceedings of the Workshop Embodied conversational agents - let’s specify and evaluate them!, held in conjunction with AAMAS 2002, Bologna, Italy (July 16, 2002)
Google Scholar
Prendinger, H., Descamps, S., Ishizuka, M.: MPML: A markup language for controlling the behavior of life-like characters. Journal of Visual Languages and Computing 15(2), 183–203 (2004)
Article Google Scholar
Stokoe, W.C., Casterline, D.C., Croneberg, C.G.: A dictionary of American sign language on linguistic principles. Linstok Press (1976)
Google Scholar
Searle, J.R.: Speech acts: An essay in the philosophy of language. Cambridge Univ. Press, London (1969)
Google Scholar
Thórisson, K.R.: Dialogue Control in Social Interface Agents. In: InterCHI Adjunct Proceedings 1993, Amsterdam, pp. 139–140 (April 1993)
Google Scholar
Thórisson, K.R.: Computational Characteristics of Multimodal Dialogue. In: AAAI Fall Symposium on Embodied Language and Action, Massachusetts Institute of Technology, Cambridge, Massachusetts, November 10-12, pp. 102–108 (1995)
Google Scholar
Thórisson, K.R.: A Mind Model for Multimodal Communicative Creatures and Humanoids. International Journal of Applied Artificial Intelligence 13(4-5), 449–486 (1999)
Article Google Scholar
Thórisson, K.R., Vilhjalmsson, H., Kopp, S., Pelachaud, C.: Report on Representations for Multimodal Generation Workshop. AI Magazine 27(1), 108 (2006)
Google Scholar
Vilhjalmsson, H.: Animating Conversation in Online Games. In: Rauterberg, M. (ed.) ICEC 2004. LNCS, vol. 3166, pp. 139–150. Springer, Heidelberg (2004)
Chapter Google Scholar
Vilhjalmsson, H.: Augmenting Online Conversation through Automated Discourse Tagging. In: 6th annual minitrack on Persistent Conversation at the 38th Hawaii International Conference on System Sciences, Hilton Waikoloa Village, Big Island, Hawaii, January 3-6. IEEE, Los Alamitos (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Artificial .Intelligence Group, University of Bielefeld, Germany
Stefan Kopp
Austrian Research Institute for AI (OFAI), Vienna, Austria
Brigitte Krenn & Hannes Pirker
IUT de Montreuil, University de Paris 8, France
Catherine Pelachaud
Information Sciences Institute, University of Southern California, USA
Stacy Marsella, Andrew N. Marshall & Hannes Vilhjálmsson
CADIA, Dept. Of Computer Science, Reykjavik University, Iceland
Kristinn R. Thórisson

Authors

Stefan Kopp
View author publications
You can also search for this author in PubMed Google Scholar
Brigitte Krenn
View author publications
You can also search for this author in PubMed Google Scholar
Stacy Marsella
View author publications
You can also search for this author in PubMed Google Scholar
Andrew N. Marshall
View author publications
You can also search for this author in PubMed Google Scholar
Catherine Pelachaud
View author publications
You can also search for this author in PubMed Google Scholar
Hannes Pirker
View author publications
You can also search for this author in PubMed Google Scholar
Kristinn R. Thórisson
View author publications
You can also search for this author in PubMed Google Scholar
Hannes Vilhjálmsson
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute for Creative Technologies, University of Southern California, 13274 Fiji Way, 90292, Marina Del Rey, CA, USA
Jonathan Gratch
Music Department, Goldsmiths, University of London, New Cross, SE14 6NW, London, UK
Michael Young
School of Mathematical and Computer Sciences, Heriot Watt University, EH14 4AS, Edinburgh, Scotland
Ruth Aylett
BT plc, Adastral Park, IP5 3RE, Ipswich, UK
Daniel Ballin
School of Computing Science, Culture Lab, Newcastle University, NE1 7RU, Newcastle upon Tyne, UK
Patrick Olivier

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kopp, S. et al. (2006). Towards a Common Framework for Multimodal Generation: The Behavior Markup Language. In: Gratch, J., Young, M., Aylett, R., Ballin, D., Olivier, P. (eds) Intelligent Virtual Agents. IVA 2006. Lecture Notes in Computer Science(), vol 4133. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11821830_17

Download citation

DOI: https://doi.org/10.1007/11821830_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37593-7
Online ISBN: 978-3-540-37594-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Towards a Common Framework for Multimodal Generation: The Behavior Markup Language

Abstract

Access this chapter

Preview

Similar content being viewed by others

Cognitive Planning for Persuasive Multimodal Interaction

A Natural Conversation Framework for Conversational UX Design

MotionChain: Conversational Motion Controllers via Multimodal Prompts

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Towards a Common Framework for Multimodal Generation: The Behavior Markup Language

Abstract

Access this chapter

Preview

Similar content being viewed by others

Cognitive Planning for Persuasive Multimodal Interaction

A Natural Conversation Framework for Conversational UX Design

MotionChain: Conversational Motion Controllers via Multimodal Prompts

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation