Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/566570.566597acmconferencesArticle/Chapter ViewAbstractPublication PagessiggraphConference Proceedingsconference-collections
Article

Integrated learning for interactive synthetic characters

Published: 01 July 2002 Publication History

Abstract

The ability to learn is a potentially compelling and important quality for interactive synthetic characters. To that end, we describe a practical approach to real-time learning for synthetic characters. Our implementation is grounded in the techniques of reinforcement learning and informed by insights from animal training. It simplifies the learning task for characters by (a) enabling them to take advantage of predictable regularities in their world, (b) allowing them to make maximal use of any supervisory signals, and (c) making them easy to train by humans.We built an autonomous animated dog that can be trained with a technique used to train real dogs called "clicker training". Capabilities demonstrated include being trained to recognize and use acoustic patterns as cues for actions, as well as to synthesize new actions from novel paths through its motion space.A key contribution of this paper is to demonstrate that by addressing the three problems of state, action, and state-action space discovery at the same time, the solution for each becomes easier. Finally, we articulate heuristics and design principles that make learning practical for synthetic characters.

References

[1]
BALLARD, D. 1997. An Introduction to Natural Computation. MIT Press, Cambridge, MA.
[2]
BLUMBERG, B., AND GAYLEAN, T. 1995. Multi-level direction of autonomous creatures for real-time virtual environments. In Proceedings of SIGGRAPH 1995, ACM Press / ACM SIGGRAPH, Computer Graphics Proceedings, Annual Conference Series, ACM.
[3]
BURKE, R., ISLA, D., DOWNIE, M., IVANOV, Y., AND BLUMBERG, B. 2001. Creature smarts: The art and architecture of a virtual brain. In Proceedings of the Computer Game Developers Conference.
[4]
BURKE, R. 2001. Its about Time:Temporal Representation for Synthetic Characters. Master's thesis, The Media Lab, MIT.
[5]
COPPINGER, R., AND COPPINGER, L. 2001. Dogs: A Startling New Understanding of Canine Origin, Behavior, and Evolution. Scribner, New York, NY.
[6]
DOWNIE, M. 2000. behavior, animation, music: the music and movement of synthetic characters. Master's thesis, The Media Lab, MIT.
[7]
DRESCHER, G. 1991. Made-Up Minds:A Constructivist Approach to Artificial Intelligence. MIT Press, Cambridge MA.
[8]
EVANS, R. 2002. Varieties of learning. In AI Game Programming Wisdom, E. Rabin, Ed. Charles River Media, Hingham MA.
[9]
FALOUTSOS, P., VAN DE PANNE, M., AND TERZOPOLOUS, D. 2001. Composible controllers for physics-based character animation. In Proceedings of SIGGRAPH 2001, ACM Press / ACM SIGGRAPH, Computer Graphics Proceedings, Annual Conference Series, ACM.
[10]
FUNGE, J., TU, X., AND TERZOPOLOUS, D. 1999. Cognitive modeling: Knowledge, reasoning and planning for intelligent characters. In Proceedings of SIGGRAPH 1999, ACM Press / ACM SIGGRAPH, Computer Graphics Proceedings, Annual Conference Series, ACM.
[11]
GALLISTEL, C. R., AND GIBBON, J. 2000. Time, rate and conditioning. Psychological Review 107.
[12]
GLEICHER, M. 1998. Retargetting motion to new characters. In Proceedings of SIGGRAPH 1998, ACM Press / ACM SIGGRAPH, Computer Graphics Proceedings, Annual Conference Series, ACM.
[13]
GOULD, J., AND GOULD, C. 1999. The Animal Mind. W.H. Freeman, New York, NY.
[14]
GRAND, S., CLIFF, D., AND MALHOTRA, A. 1996. Creatures: Artificial life autonomous agents for home entertainment. In Proceedings of the Autonomous Agents '97 Conference.
[15]
GRZESZCZUK, R., AND TERZOPOULOS, D. 1995. Automated learning of muscle-actuated locomotion through control abstraction. In Proceedings of SIGGRAPH 1995, ACM Press / ACM SIGGRAPH, Computer Graphics Proceedings, Annual Conference Series, ACM.
[16]
GRZESZCZUK, R., TERZOPOULOS, D., AND HINTON, G. 1998. Neuroanimator: Fast neural network emulation and control of physics-based models. In Proceedings of SIGGRAPH 1998, ACM Press / ACM SIGGRAPH, Computer Graphics Proceedings, Annual Conference Series, ACM.
[17]
HODGINS, J., AND POLLARD, N. 1997. Adapting simulated behaviors for new characters. In Proceedings of SIGGRAPH 1997, ACM Press / ACM SIGGRAPH, Computer Graphics Proceedings, Annual Conference Series, ACM.
[18]
ISLA, D., BURKE, R., DOWNIE, M., AND BLUMBERG, B. 2001. A layered brain architecture for synthetic creatures. In Proceedings of The International Joint Conference on Artificial Intelligence.
[19]
ISLA, D. 2001. The Virtual Hippocampus: Spatial Common Sense for Synthetic Creatures. Master's thesis, The Media Lab, MIT.
[20]
IVANOV, Y., BLUMBERG, B., AND PENTLAND, A. 2001. Expectation maximization for weakly labeled data. In Proceedings of the 18th International Conference on Machine Learning.
[21]
IVANOV, Y. 2001. State Discovery for Autonomous Creatures. PhD thesis, The Media Lab, MIT.
[22]
KAELBLING, L. 1990. Learning in embedded systems. PhD thesis, Stanford University.
[23]
KAPLAN, F., OUDEYER, P.-Y., KUBINYI, E., AND MIKLOSI, A. 2001. Taming robots with clicker training : a solution for teaching complex behaviors. In Proceedings of the 9th European workshop on learning robots, LNAI, Springer, M. Quoy, P. Gaussier, and J. L. Wyatt, Eds.
[24]
LINDSAY, S. 2000. Applied Dog Behavior and Training. Iowa State University Press, Ames, IA.
[25]
LORENZ, K., AND LEYAHUSEN, P. 1973. Motivation of Human and Animal Behavior: An Ethological View. Van Nostrand Reinhold Co., New York, NY.
[26]
LORENZ, K. 1981. The Foundations of Ethology. Springer-Verlag, New York, NY.
[27]
MITCHELL, K. 1997. Machine Learning. McGraw Hill, New York, NY.
[28]
PERLIN, K., AND GOLDBERG, A. 1996. Improv: A system for scripting interactive actors in virtual worlds. In Proceedings of SIGGRAPH 1996, ACM Press / ACM SIGGRAPH, Computer Graphics Proceedings, Annual Conference Series, ACM.
[29]
PRYOR, K. 1999. Clicker Training for Dogs. Sunshine Books, Inc., Waltham, MA.
[30]
RABINER, L., AND JUANG, B.-H. 1993. Fundamentals of Speech Recognition. Prentice Hall, Englewood Cliffs, NJ.
[31]
RAMIREZ, K. 1999. Animal Training:Successful Animal Management Through Positive Reinforcement. Shedd Aquarium, Chicago, IL.
[32]
RESNER, B., STERN, A., AND FRANK, A. 1997. The truth about catz and dogz. In The Computer Games Developer Conference.
[33]
REYNOLDS, C. 1987. Flocks, herds and schools: A distributed behavioral model. In Proceedings of SIGGRAPH 1987, ACM Press / ACM SIGGRAPH, Computer Graphics Proceedings, Annual Conference Series, ACM.
[34]
ROSE, C., COHEN, M., AND BODENHEIMER, B. 1999. Verbs and adverbs: Multidimensional motion interpolation. IEEE Computer Graphics And Applications 18, 5.
[35]
SHETTLEWORTH, S. J. 1998. Cognition, Evolution and Behavior. Oxford University Press, New York, NY.
[36]
SUTTON, R., AND BARTO, A. 1998. Reinforcement Learning: An Introduction. MIT Press, Cambridge MA.
[37]
SUTTON, R. 1991. Reinforcement learning architectures for animats. In The First International Conference on Simulation of Adaptive Behavior, MIT Press, Paris, Fr.
[38]
THERRIEN, C. 1989. Decision Estimation and Classification: An Introduction to Pattern Recognition and Related Topics. John Wiley and Sons, New York, NY.
[39]
TOMLINSON, B., AND BLUMBERG, B. 2002. Alphawolf: Social learning, emotion and development in autonomous virtual agents. In First GSFC/JPL Workshop on Radical Agent Concepts.
[40]
TU, X., AND TERZOPOULOS, D. 1994. Artificial fishes: Physics, locomotion, perception, behavior. In Proceedings of SIGGRAPH 1994, ACM Press / ACM SIGGRAPH, Computer Graphics Proceedings, Annual Conference Series, ACM.
[41]
VAN DE PANNE, M., AND FIUME., E. 1993. Sensor-actuator networks. In Proceedings of SIGGRAPH 1993, ACM Press / ACM SIGGRAPH, Computer Graphics Proceedings, Annual Conference Series, ACM.
[42]
VAN DE PANNE, M., KIM, R., AND FIUME., E. 1994. Synthesizing parameterized motions. In 5th Eurographics Workshop on Simulation and Animation.
[43]
WATKINS, C. J., AND DAYAN, P. 1992. Q-learning. Machine Learning 8.
[44]
WILKES, G. 1995. Click and Treat Training Kit. Click and Treat Inc., Mesa, AZ.
[45]
YOON, S., BLUMBERG, B., AND SCHNEIDER, G. 2000. Motivation-driven learning for interactive synthetic characters. In Proceedings of the Fourth International Conference on Autonomous Agents.
[46]
YOON, S., BURKE, R., AND BLUMBERG, B. 2000. Interactive training for synthetic characters. In Proceedings of AAAI 2000.

Cited By

View all
  • (2022)Interactive Reinforcement Learning With Bayesian Fusion of Multimodal AdviceIEEE Robotics and Automation Letters10.1109/LRA.2022.31821007:3(7558-7565)Online publication date: Jul-2022
  • (2022)Correct Me If I am Wrong: Interactive Learning for Robotic ManipulationIEEE Robotics and Automation Letters10.1109/LRA.2022.31455167:2(3695-3702)Online publication date: Apr-2022
  • (2020)A Review on Interactive Reinforcement Learning From Human Social FeedbackIEEE Access10.1109/ACCESS.2020.30062548(120757-120765)Online publication date: 2020
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGGRAPH '02: Proceedings of the 29th annual conference on Computer graphics and interactive techniques
July 2002
574 pages
ISBN:1581135211
DOI:10.1145/566570
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 July 2002

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. animation
  2. behavioral animation
  3. computer games

Qualifiers

  • Article

Conference

SIGGRAPH02
Sponsor:

Acceptance Rates

SIGGRAPH '02 Paper Acceptance Rate 67 of 358 submissions, 19%;
Overall Acceptance Rate 1,822 of 8,601 submissions, 21%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)19
  • Downloads (Last 6 weeks)1
Reflects downloads up to 02 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Interactive Reinforcement Learning With Bayesian Fusion of Multimodal AdviceIEEE Robotics and Automation Letters10.1109/LRA.2022.31821007:3(7558-7565)Online publication date: Jul-2022
  • (2022)Correct Me If I am Wrong: Interactive Learning for Robotic ManipulationIEEE Robotics and Automation Letters10.1109/LRA.2022.31455167:2(3695-3702)Online publication date: Apr-2022
  • (2020)A Review on Interactive Reinforcement Learning From Human Social FeedbackIEEE Access10.1109/ACCESS.2020.30062548(120757-120765)Online publication date: 2020
  • (2018)Social interaction for efficient agent learning from human rewardAutonomous Agents and Multi-Agent Systems10.1007/s10458-017-9374-832:1(1-25)Online publication date: 1-Jan-2018
  • (2016)Human-animal teams as an analog for future human-robot teamsJournal of Human-Robot Interaction10.5898/JHRI.5.1.Phillips5:1(100-125)Online publication date: 23-Mar-2016
  • (2016)Timed Petri nets for fluent turn-taking over multimodal interaction resources in human-robot collaborationInternational Journal of Robotics Research10.1177/027836491562729135:11(1330-1353)Online publication date: 1-Sep-2016
  • (2016)Using informative behavior to increase engagement while learning from human rewardAutonomous Agents and Multi-Agent Systems10.1007/s10458-015-9308-230:5(826-848)Online publication date: 1-Sep-2016
  • (2015)Modeling believable agents using a descriptive approachBiologically Inspired Cognitive Architectures10.1016/j.bica.2015.09.00414(10-21)Online publication date: Oct-2015
  • (2014)A Gesture Learning Interface for Simulated Robot Path Shaping With a Human TeacherIEEE Transactions on Human-Machine Systems10.1109/TSMC.2013.229171444:1(41-54)Online publication date: Feb-2014
  • (2014)Manipulating Mental States Through Physical ActionInternational Journal of Social Robotics10.1007/s12369-014-0234-26:3(315-327)Online publication date: 13-Apr-2014
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media