Yoking-Based Identification of Learning Behavior in Artificial and Biological Agents

Baum, Manuel; Schattenhofer, Lukas; Rössler, Theresa; Osuna-Mascaró, Antonio; Auersperg, Alice; Kacelnik, Alex; Brock, Oliver

doi:10.1007/978-3-031-16770-6_6

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13499))

Included in the following conference series:

International Conference on Simulation of Adaptive Behavior

524 Accesses

Abstract

We want to understand how animals can learn to solve complex tasks. To achieve this, it makes sense to first hypothesize learning models and then compare these models to real biological learning data. But how to perform such a comparison is still unclear. We propose that yoking is an important component to such an analysis. In yoking, two agents are made to experience the same inputs, rewards or perform the same actions – possibly in combination. We use yoking as an analytical tool to identify the algorithm that drives learning in a target agent. We evaluate this approach in a synthetic task, where we know the ground truth learning algorithm. Then we apply it to biological data from a physical puzzle task, to identify the learning algorithm behind physical problem solving in Goffin’s cockatoos. Our results show that yoking works, and can be used to identify the target algorithm more reliably, with less variance and assumptions, than a more unconstrained approach to identify learning algorithms.

Funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy – EXC 2002/1 “Science of Intelligence” – project number 390523135.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Turing learning: a metric-free approach to inferring behavior and its application to swarms

Article Open access 30 August 2016

eSense 2.0: Modeling Multi-agent Biomimetic Predation with Multi-layered Reinforcement Learning

The revival of the Baldwin effect

Article 11 October 2017

References

Anderson, D.I., et al.: The flip side of perception-action coupling: locomotor experience and the ontogeny of visual-postural coupling. Hum. Mov. Sci. 20(4–5), 461–487 (2001)
Article Google Scholar
Argall, B.D., Chernova, S., Veloso, M., Browning, B.: A survey of robot learning from demonstration. Robot. Auton. Syst. 57(5), 469–483 (2009)
Article Google Scholar
Church, R.M.: Systematic effect of random error in the yoked control design. Psychol. Bull. 62(2), 122 (1964)
Article Google Scholar
Dice, L.R.: Measures of the amount of ecologic association between species. Ecology 26(3), 297–302 (1945)
Article Google Scholar
Gardner, R.A., Gardner, B.T.: Feedforward versus feedbackward: an ethological alternative to the law of effect. Behav. Brain Sci. 11(3), 429–447 (1988)
Article Google Scholar
Held, R., Hein, A.: Movement-produced stimulation in the development of visually guided behavior. J. Comp. Physiol. Psychol. 56(5), 872 (1963)
Article Google Scholar
Lee, D., Gujarathi, P., Wood, J.N.: Controlled-rearing studies of newborn chicks and deep neural networks. Preprint arXiv:2112.06106 (2021)
Ng, A.Y., Russell, S.J.: Algorithms for inverse reinforcement learning. In: Proceedings of the Seventeenth International Conference on Machine Learning, pp. 663–670. Morgan Kaufmann Publishers Inc., San Francisco (2000)
Google Scholar
Ostrovski, G., Castro, P.S., Dabney, W.: The difficulty of passive learning in deep reinforcement learning. In: Advances in Neural Information Processing Systems, vol. 34 (2021)
Google Scholar
Salkind, N.J.: Encyclopedia of Research Design, vol. 1. Sage, Thousand Oaks (2010)
Book Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (2018)
MATH Google Scholar
Torabi, F., Warnell, G., Stone, P.: Behavioral cloning from observation. Preprint arXiv:1805.01954 (2018)
Wood, S.M., Wood, J.N.: Using automation to combat the replication crisis: a case study from controlled-rearing studies of newborn chicks. Infant Behav. Dev. 57, 101329 (2019)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Science of Intelligence, Research Cluster of Excellence, Marchstr. 23, 10587, Berlin, Germany
Manuel Baum, Lukas Schattenhofer, Alice Auersperg, Alex Kacelnik & Oliver Brock
Robotics and Biology Laboratory, Technische Universität Berlin, Berlin, Germany
Manuel Baum & Oliver Brock
Comparative Cognition Group, University of Veterinary Medicine Vienna, Vienna, Austria
Theresa Rössler, Antonio Osuna-Mascaró & Alice Auersperg
Behavioural Ecology Group, University of Oxford, Oxford, UK
Alex Kacelnik

Authors

Manuel Baum
View author publications
You can also search for this author in PubMed Google Scholar
Lukas Schattenhofer
View author publications
You can also search for this author in PubMed Google Scholar
Theresa Rössler
View author publications
You can also search for this author in PubMed Google Scholar
Antonio Osuna-Mascaró
View author publications
You can also search for this author in PubMed Google Scholar
Alice Auersperg
View author publications
You can also search for this author in PubMed Google Scholar
Alex Kacelnik
View author publications
You can also search for this author in PubMed Google Scholar
Oliver Brock
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Manuel Baum or Oliver Brock .

Editor information

Editors and Affiliations

ETIS, CY Cergy Paris Université, Cergy-Pontoise, France
Lola Cañamero
ETIS, CY Cergy Paris Université, Cergy-Pontoise, France
Philippe Gaussier
Aberystwyth University, Aberystwyth, UK
Myra Wilson
ETIS, CY Cergy Paris Université, Cergy-Pontoise, France
Sofiane Boucenna
ETIS, CY Cergy Paris Université, Cergy-Pontoise, France
Nicolas Cuperlier

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Baum, M. et al. (2022). Yoking-Based Identification of Learning Behavior in Artificial and Biological Agents. In: Cañamero, L., Gaussier, P., Wilson, M., Boucenna, S., Cuperlier, N. (eds) From Animals to Animats 16. SAB 2022. Lecture Notes in Computer Science(), vol 13499. Springer, Cham. https://doi.org/10.1007/978-3-031-16770-6_6

Download citation

DOI: https://doi.org/10.1007/978-3-031-16770-6_6
Published: 09 September 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-16769-0
Online ISBN: 978-3-031-16770-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Yoking-Based Identification of Learning Behavior in Artificial and Biological Agents

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Turing learning: a metric-free approach to inferring behavior and its application to swarms

eSense 2.0: Modeling Multi-agent Biomimetic Predation with Multi-layered Reinforcement Learning

The revival of the Baldwin effect

References

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Yoking-Based Identification of Learning Behavior in Artificial and Biological Agents

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Turing learning: a metric-free approach to inferring behavior and its application to swarms

eSense 2.0: Modeling Multi-agent Biomimetic Predation with Multi-layered Reinforcement Learning

The revival of the Baldwin effect

References

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation