Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Teachable robots: Understanding human teaching behavior to build more effective robot learners

Published: 01 April 2008 Publication History
  • Get Citation Alerts
  • Abstract

    While Reinforcement Learning (RL) is not traditionally designed for interactive supervisory input from a human teacher, several works in both robot and software agents have adapted it for human input by letting a human trainer control the reward signal. In this work, we experimentally examine the assumption underlying these works, namely that the human-given reward is compatible with the traditional RL reward signal. We describe an experimental platform with a simulated RL robot and present an analysis of real-time human teaching behavior found in a study in which untrained subjects taught the robot to perform a new task. We report three main observations on how people administer feedback when teaching a Reinforcement Learning agent: (a) they use the reward channel not only for feedback, but also for future-directed guidance; (b) they have a positive bias to their feedback, possibly using the signal as a motivational channel; and (c) they change their behavior as they develop a mental model of the robotic learner. Given this, we made specific modifications to the simulated RL robot, and analyzed and evaluated its learning behavior in four follow-up experiments with human trainers. We report significant improvements on several learning measures. This work demonstrates the importance of understanding the human-teacher/robot-learner partnership in order to design algorithms that support how people want to teach and simultaneously improve the robot's learning behavior.

    References

    [1]
    Argyle, M., Ingham, R. and McCallin, M., The different functions of gaze. Semiotica. 19-32.
    [2]
    R. Arkin, M. Fujita, T. Takagi, R. Hasegawa, An ethological and emotional basis for human--robot interaction, in: Proceedings of the Conference on Robotics and Autonomous Systems, 2003
    [3]
    Bates, J., The role of emotion in believable agents. Communications of the ACM. v37 i7. 122-125.
    [4]
    B. Blumberg, Old tricks, new dogs: Ethology and interactive creatures, Ph.D. thesis, Massachusetts Institute of Technology, 1997
    [5]
    B. Blumberg, M. Downie, Y. Ivanov, M. Berlin, M. Johnson, B. Tomlinson, Integrated learning for interactive synthetic characters, in: Proceedings of the ACM SIGGRAPH, 2002
    [6]
    Breazeal, C., Designing Sociable Robots. 2002. MIT Press, Cambridge, MA.
    [7]
    Breazeal, C., Brooks, A., Gray, J., Hoffman, G., Lieberman, J., Lee, H., Lockerd, A. and Mulanda, D., Tutelage and collaboration for humanoid robots. International Journal of Humanoid Robotics. v1 i2.
    [8]
    J. Clouse, P. Utgoff, A teaching method for reinforcement learning, in: Proc. of the Ninth International Conf. on Machine Learning (ICML), 1992, pp. 92--101
    [9]
    Cohn, D., Ghahramani, Z. and Jordan, M., Active learning with statistical models. In: Tesauro, G., Touretzky, D., Alspector, J. (Eds.), Advances in Neural Information Processing, vol. 7. Morgan Kaufmann.
    [10]
    Evans, R., Varieties of learning. In: Rabin, S. (Ed.), AI Game Programming Wisdom, Charles River Media, Hingham, MA. pp. 567-578.
    [11]
    Greenfield, P.M., Theory of the teacher in learning activities of everyday life. In: Rogoff, B., Lave, J. (Eds.), Everyday Cognition: Its Development in Social Context, Harvard University Press, Cambridge, MA.
    [12]
    E. Horvitz, J. Breese, D. Heckerman, D. Hovel, K. Rommelse, The lumiere project: Bayesian user modeling for inferring the goals and needs of software users, in: Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, Madison, WI, July 1998, pp. 256--265
    [13]
    C. Isbell, C. Shelton, M. Kearns, S. Singh, P. Stone, Cobot: A social reinforcement learning agent, in: 5th Intern. Conf. on Autonomous Agents, 2001
    [14]
    Kaplan, F., Oudeyer, P.-Y., Kubinyi, E. and Miklosi, A., Robotic clicker training. Robotics and Autonomous Systems. v38 i3--4. 197-206.
    [15]
    Krauss, R.M., Chen, Y. and Chawla, P., Nonverbal behavior and nonverbal communication: What do conversational hand gestures tell us?. In: Zanna, M. (Ed.), Advances in Experimental Social Psychology, Academic Press, Tampa. pp. 389-450.
    [16]
    Vygotsky, L.S., . In: Cole, M. (Ed.), Mind in Society: The Development of Higher Psychological Processes, Harvard University Press, Cambridge, MA.
    [17]
    Lashkari, Y., Metral, M. and Maes, P., Collaborative interface agents. In: Proceedings of the Twelfth National Conference on Artificial Intelligence, vol. 1. AAAI Press, Seattle, WA.
    [18]
    Lauria, S., Bugmann, G., Kyriacou, T. and Klein, E., Mobile robot programming using natural language. Robotics and Autonomous Systems. v38 i3--4. 171-181.
    [19]
    In: Lieberman, H. (Ed.), Your Wish is My Command: Programming by Example, Morgan Kaufmann, San Francisco, CA.
    [20]
    A. Lockerd, C. Breazeal, Tutelage and socially guided robot learning, in: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2004
    [21]
    R. Maclin, J. Shavlik, L. Torrey, T. Walker, E. Wild, Giving advice about preferred actions to reinforcement learners via knowledge-based kernel regression, in: Proceedings of the Twentieth National Conference on Artificial Intelligence (AAAI), Pittsburgh, PA, July 2005
    [22]
    Mataric, M., Reinforcement learning in the multi-robot domain. Autonomous Robots. v4 i1. 73-83.
    [23]
    T.M. Mitchell, S. Wang, Y. Huang, Extracting knowledge about users' activities from raw workstation contents, in: Proceedings of the 21st National Conference on Artificial Intelligence (AAAI), 2006
    [24]
    M.N. Nicolescu, M.J. Matarić, Natural methods for robot task learning: Instructive demonstrations, generalization and practice, in: Proceedings of the 2nd Intl. Conf. AAMAS. Melbourne, Australia, July 2003
    [25]
    Saksida, L.M., Raymond, S.M. and Touretzky, D.S., Shaping robot behavior using principles from instrumental conditioning. Robotics and Autonomous Systems. v22 i3/4. 231
    [26]
    Schaal, S., Is imitation learning the route to humanoid robots?. Trends in Cognitive Sciences. v3. 233-242.
    [27]
    Schohn, G. and Cohn, D., Less is more: Active learning with support vector machines. In: Proc. 17th ICML, Morgan Kaufmann, San Francisco, CA. pp. 839-846.
    [28]
    W.D. Smart, L.P. Kaelbling, Effective reinforcement learning for mobile robots, in: Proceedings of the IEEE International Conference on Robotics and Automation, 2002, pp. 3404--3410
    [29]
    K.O. Stanley, B.D. Bryant, R. Miikkulainen, Evolving neural network agents in the nero video game, in: Proceedings of IEEE 2005 Symposium on Computational Intelligence and Games (CIG'05), 2005
    [30]
    Steels, L. and Kaplan, F., Aibo's first words: The social learning of language and meaning. Evolution of Communication. v4 i1. 3-32.
    [31]
    Stern, A., Frank, A. and Resner, B., Virtual petz (video session): A hybrid approach to creating autonomous, lifelike dogz and catz. In: AGENTS '98: Proceedings of the Second International Conference on Autonomous Agents, ACM Press, New York. pp. 334-335.
    [32]
    Thomas, F. and Johnson, O., Disney Animation: The Illusion of Life. 1981. Abbeville Press, New York.
    [33]
    Thrun, S., Robotics. In: Russell, S., Norvig, P. (Eds.), Artificial Intelligence: A Modern Approach, Prentice Hall.
    [34]
    S.B. Thrun, T.M. Mitchell, Lifelong robot learning, Tech. Rep. IAI-TR-93-7, 1993
    [35]
    Tomlinson, B. and Blumberg, B., Social synthetic characters. Computer Graphics. v26 i2.
    [36]
    R. Voyles, P. Khosla, A multi-agent system for programming robotic agents by human demonstration, in: Proceedings of AI and Manufacturing Research Planning Workshop, 1998
    [37]
    Watkins, C. and Dayan, P., Q-learning. Machine Learning. v8 i3. 279-292.
    [38]
    Wertsch, J.V., Minick, N. and Arns, F.J., Creation of context in joint problem solving. In: Rogoff, B., Lave, J. (Eds.), Everyday Cognition: Its Development in Social Context, Harvard University Press, Cambridge, MA.

    Cited By

    View all
    • (2024)Cieran: Designing Sequential Colormaps via In-Situ Active Preference LearningProceedings of the CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642903(1-15)Online publication date: 11-May-2024
    • (2024)Interactive Robot Programming Inspired by Dog Training: An Exploratory StudyCompanion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction10.1145/3610978.3640655(965-969)Online publication date: 11-Mar-2024
    • (2024)Towards interactive reinforcement learning with intrinsic feedbackNeurocomputing10.1016/j.neucom.2024.127628587:COnline publication date: 28-Jun-2024
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Artificial Intelligence
    Artificial Intelligence  Volume 172, Issue 6-7
    April, 2008
    267 pages

    Publisher

    Elsevier Science Publishers Ltd.

    United Kingdom

    Publication History

    Published: 01 April 2008

    Author Tags

    1. Human--robot interaction
    2. Reinforcement learning
    3. User studies

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 10 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Cieran: Designing Sequential Colormaps via In-Situ Active Preference LearningProceedings of the CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642903(1-15)Online publication date: 11-May-2024
    • (2024)Interactive Robot Programming Inspired by Dog Training: An Exploratory StudyCompanion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction10.1145/3610978.3640655(965-969)Online publication date: 11-Mar-2024
    • (2024)Towards interactive reinforcement learning with intrinsic feedbackNeurocomputing10.1016/j.neucom.2024.127628587:COnline publication date: 28-Jun-2024
    • (2023)User Interface Interventions for Improving Robot Learning from DemonstrationProceedings of the 11th International Conference on Human-Agent Interaction10.1145/3623809.3623848(152-161)Online publication date: 4-Dec-2023
    • (2023)Designing Fiduciary Artificial IntelligenceProceedings of the 3rd ACM Conference on Equity and Access in Algorithms, Mechanisms, and Optimization10.1145/3617694.3623230(1-15)Online publication date: 30-Oct-2023
    • (2023)How Do We Perceive Our Trainee Robots? Exploring the Impact of Robot Errors and Appearance When Performing Domestic Physical Tasks on Teachers’ Trust and EvaluationsACM Transactions on Human-Robot Interaction10.1145/358251612:3(1-41)Online publication date: 5-May-2023
    • (2023)Investigating Learning from Demonstration in Imperfect and Real World ScenariosCompanion of the 2023 ACM/IEEE International Conference on Human-Robot Interaction10.1145/3568294.3579980(769-771)Online publication date: 13-Mar-2023
    • (2023)Self-Annotation Methods for Aligning Implicit and Explicit Human Feedback in Human-Robot InteractionProceedings of the 2023 ACM/IEEE International Conference on Human-Robot Interaction10.1145/3568162.3576986(398-407)Online publication date: 13-Mar-2023
    • (2023)SME-in-the-loop: Interaction Preferences when Supervising Bots in Human-AI CommunitiesProceedings of the 2023 ACM Designing Interactive Systems Conference10.1145/3563657.3596100(2281-2303)Online publication date: 10-Jul-2023
    • (2023)An integrated model for predicting pupils’ acceptance of artificially intelligent robots as teachersEducation and Information Technologies10.1007/s10639-023-11601-228:9(11631-11654)Online publication date: 22-Feb-2023
    • Show More Cited By

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media