Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 84 results for author: Oudeyer, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.04671  [pdf, other

    cs.CL cs.LG q-bio.PE

    PhyloLM : Inferring the Phylogeny of Large Language Models and Predicting their Performances in Benchmarks

    Authors: Nicolas Yax, Pierre-Yves Oudeyer, Stefano Palminteri

    Abstract: This paper introduces PhyloLM, a method adapting phylogenetic algorithms to Large Language Models (LLMs) to explore whether and how they relate to each other and to predict their performance characteristics. Our method calculates a phylogenetic distance metrics based on the similarity of LLMs' output. The resulting metric is then used to construct dendrograms, which satisfactorily capture known re… ▽ More

    Submitted 16 June, 2024; v1 submitted 6 April, 2024; originally announced April 2024.

  2. arXiv:2403.08882  [pdf, other

    cs.MA cs.AI q-bio.PE

    Cultural evolution in populations of Large Language Models

    Authors: Jérémy Perez, Corentin Léger, Marcela Ovando-Tellez, Chris Foulon, Joan Dussauld, Pierre-Yves Oudeyer, Clément Moulin-Frier

    Abstract: Research in cultural evolution aims at providing causal explanations for the change of culture over time. Over the past decades, this field has generated an important body of knowledge, using experimental, historical, and computational methods. While computational models have been very successful at generating testable hypotheses about the effects of several factors, such as population structure o… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 17 pages, 20 figures. Open-source code available at https://github.com/jeremyperez2/LLM-Culture

    MSC Class: 68T50 ACM Class: I.2.7

  3. Interactive environments for training children's curiosity through the practice of metacognitive skills: a pilot study

    Authors: Rania Abdelghani, Edith Law, Chloé Desvaux, Pierre-Yves Oudeyer, Hélène Sauzéon

    Abstract: Curiosity-driven learning has shown significant positive effects on students' learning experiences and outcomes. But despite this importance, reports show that children lack this skill, especially in formal educational settings. To address this challenge, we propose an 8-session workshop that aims to enhance children's curiosity through training a set of specific metacognitive skills we hypothesiz… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  4. arXiv:2402.14846  [pdf, other

    cs.CL cs.AI cs.LG

    Stick to Your Role! Context-dependence and Stability of Personal Value Expression in Large Language Models

    Authors: Grgur Kovač, Rémy Portelas, Masataka Sawayama, Peter Ford Dominey, Pierre-Yves Oudeyer

    Abstract: The standard way to study Large Language Models (LLMs) with benchmarks or psychology questionnaires is to provide many different queries from similar minimal contexts (e.g. multiple choice questions). However, due to LLMs' highly context-dependent nature, conclusions from such minimal-context evaluations may be little informative about the model's behavior in deployment (where it will be exposed t… ▽ More

    Submitted 30 April, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: The project website and code are available at https://sites.google.com/view/llmvaluestability

    MSC Class: 68T07 ACM Class: I.2.7

  5. arXiv:2402.10236  [pdf, other

    cs.MA cs.AI cs.LG

    Discovering Sensorimotor Agency in Cellular Automata using Diversity Search

    Authors: Gautier Hamon, Mayalen Etcheverry, Bert Wang-Chak Chan, Clément Moulin-Frier, Pierre-Yves Oudeyer

    Abstract: The research field of Artificial Life studies how life-like phenomena such as autopoiesis, agency, or self-regulation can self-organize in computer simulations. In cellular automata (CA), a key open-question has been whether it it is possible to find environment rules that self-organize robust "individuals" from an initial state with no prior existence of things like "bodies", "brain", "perception… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  6. arXiv:2402.01669  [pdf, other

    cs.CY cs.AI cs.LG

    Improved Performances and Motivation in Intelligent Tutoring Systems: Combining Machine Learning and Learner Choice

    Authors: Benjamin Clément, Hélène Sauzéon, Didier Roy, Pierre-Yves Oudeyer

    Abstract: Large class sizes pose challenges to personalized learning in schools, which educational technologies, especially intelligent tutoring systems (ITS), aim to address. In this context, the ZPDES algorithm, based on the Learning Progress Hypothesis (LPH) and multi-armed bandit machine learning techniques, sequences exercises that maximize learning progress (LP). This algorithm was previously shown in… ▽ More

    Submitted 16 January, 2024; originally announced February 2024.

    Comments: 29 pages, 37 figures

    ACM Class: I.2.1; I.2.6

  7. arXiv:2312.00455  [pdf

    cs.AI cs.LG nlin.CG

    Meta-Diversity Search in Complex Systems, A Recipe for Artificial Open-Endedness ?

    Authors: Mayalen Etcheverry, Bert Wang-Chak Chan, Clément Moulin-Frier, Pierre-Yves Oudeyer

    Abstract: Can we build an artificial system that would be able to generate endless surprises if ran "forever" in Minecraft? While there is not a single path toward solving that grand challenge, this article presents what we believe to be some working ingredients for the endless generation of novel increasingly complex artifacts in Minecraft. Our framework for an open-ended system includes two components: a… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  8. Machine Culture

    Authors: Levin Brinkmann, Fabian Baumann, Jean-François Bonnefon, Maxime Derex, Thomas F. Müller, Anne-Marie Nussberger, Agnieszka Czaplicka, Alberto Acerbi, Thomas L. Griffiths, Joseph Henrich, Joel Z. Leibo, Richard McElreath, Pierre-Yves Oudeyer, Jonathan Stray, Iyad Rahwan

    Abstract: The ability of humans to create and disseminate culture is often credited as the single most important factor of our success as a species. In this Perspective, we explore the notion of machine culture, culture mediated or generated by machines. We argue that intelligent machines simultaneously transform the cultural evolutionary processes of variation, transmission, and selection. Recommender algo… ▽ More

    Submitted 22 November, 2023; v1 submitted 19 November, 2023; originally announced November 2023.

    Journal ref: Nat Hum Behav 7, 1855-1868 (2023)

  9. arXiv:2311.00344  [pdf, other

    cs.AI

    A Definition of Open-Ended Learning Problems for Goal-Conditioned Agents

    Authors: Olivier Sigaud, Gianluca Baldassarre, Cedric Colas, Stephane Doncieux, Richard Duro, Pierre-Yves Oudeyer, Nicolas Perrin-Gilbert, Vieri Giuliano Santucci

    Abstract: A lot of recent machine learning research papers have ``open-ended learning'' in their title. But very few of them attempt to define what they mean when using the term. Even worse, when looking more closely there seems to be no consensus on what distinguishes open-ended learning from related concepts such as continual learning, lifelong learning or autotelic learning. In this paper, we contribute… ▽ More

    Submitted 7 June, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

  10. arXiv:2310.10692  [pdf, other

    cs.LG cs.AI

    ACES: Generating Diverse Programming Puzzles with with Autotelic Generative Models

    Authors: Julien Pourcel, Cédric Colas, Gaia Molinaro, Pierre-Yves Oudeyer, Laetitia Teodorescu

    Abstract: The ability to invent novel and interesting problems is a remarkable feature of human intelligence that drives innovation, art, and science. We propose a method that aims to automate this process by harnessing the power of state-of-the-art generative models to produce a diversity of challenging yet solvable problems, here in the context of Python programming puzzles. Inspired by the intrinsically… ▽ More

    Submitted 29 May, 2024; v1 submitted 15 October, 2023; originally announced October 2023.

  11. arXiv:2310.03192  [pdf, other

    cs.CY

    Generative AI in the Classroom: Can Students Remain Active Learners?

    Authors: Rania Abdelghani, Hélène Sauzéon, Pierre-Yves Oudeyer

    Abstract: Generative Artificial Intelligence (GAI) can be seen as a double-edged weapon in education. Indeed, it may provide personalized, interactive and empowering pedagogical sequences that could favor students' intrinsic motivation, active engagement and help them have more control over their learning. But at the same time, other GAI properties such as the lack of uncertainty signalling even in cases of… ▽ More

    Submitted 10 November, 2023; v1 submitted 4 October, 2023; originally announced October 2023.

  12. arXiv:2307.08452  [pdf, other

    q-bio.BM cs.LG cs.PL

    SBMLtoODEjax: Efficient Simulation and Optimization of Biological Network Models in JAX

    Authors: Mayalen Etcheverry, Michael Levin, Clément Moulin-Frier, Pierre-Yves Oudeyer

    Abstract: Advances in bioengineering and biomedicine demand a deep understanding of the dynamic behavior of biological systems, ranging from protein pathways to complex cellular processes. Biological networks like gene regulatory networks and protein pathways are key drivers of embryogenesis and physiological processes. Comprehending their diverse behaviors is essential for tackling diseases, including canc… ▽ More

    Submitted 29 October, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

  13. arXiv:2307.07871  [pdf, other

    cs.AI cs.LG

    The SocialAI School: Insights from Developmental Psychology Towards Artificial Socio-Cultural Agents

    Authors: Grgur Kovač, Rémy Portelas, Peter Ford Dominey, Pierre-Yves Oudeyer

    Abstract: Developmental psychologists have long-established the importance of socio-cognitive abilities in human intelligence. These abilities enable us to enter, participate and benefit from human culture. AI research on social interactive agents mostly concerns the emergence of culture in a multi-agent setting (often without a strong grounding in developmental psychology). We argue that AI research should… ▽ More

    Submitted 23 November, 2023; v1 submitted 15 July, 2023; originally announced July 2023.

    Comments: Preprint, see v1 for a shorter version (accepted at the "Workshop on Theory-of-Mind" at ICML 2023) See project website for demo and code: https://sites.google.com/view/socialai-school

    MSC Class: 68T07 ACM Class: I.2.0

  14. arXiv:2307.07870  [pdf, other

    cs.CL cs.AI cs.LG

    Large Language Models as Superpositions of Cultural Perspectives

    Authors: Grgur Kovač, Masataka Sawayama, Rémy Portelas, Cédric Colas, Peter Ford Dominey, Pierre-Yves Oudeyer

    Abstract: Large Language Models (LLMs) are often misleadingly recognized as having a personality or a set of values. We argue that an LLM can be seen as a superposition of perspectives with different values and personality traits. LLMs exhibit context-dependent values and personality traits that change based on the induced perspective (as opposed to humans, who tend to have more coherent values and personal… ▽ More

    Submitted 7 November, 2023; v1 submitted 15 July, 2023; originally announced July 2023.

    Comments: Preprint

    MSC Class: 68T07 ACM Class: I.2.7

  15. arXiv:2305.12487  [pdf, other

    cs.AI cs.CL cs.LG

    Augmenting Autotelic Agents with Large Language Models

    Authors: Cédric Colas, Laetitia Teodorescu, Pierre-Yves Oudeyer, Xingdi Yuan, Marc-Alexandre Côté

    Abstract: Humans learn to master open-ended repertoires of skills by imagining and practicing their own goals. This autotelic learning process, literally the pursuit of self-generated (auto) goals (telos), becomes more and more open-ended as the goals become more diverse, abstract and creative. The resulting exploration of the space of possible skills is supported by an inter-individual exploration: goal re… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

  16. arXiv:2304.10548  [pdf, other

    cs.CL cs.AI cs.HC

    Supporting Qualitative Analysis with Large Language Models: Combining Codebook with GPT-3 for Deductive Coding

    Authors: Ziang Xiao, Xingdi Yuan, Q. Vera Liao, Rania Abdelghani, Pierre-Yves Oudeyer

    Abstract: Qualitative analysis of textual contents unpacks rich and valuable information by assigning labels to the data. However, this process is often labor-intensive, particularly when working with large datasets. While recent AI-based tools demonstrate utility, researchers may not have readily available AI resources and expertise, let alone be challenged by the limited generalizability of those task-spe… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: 28th International Conference on Intelligent User Interfaces (IUI '23 Companion), March 27--31, 2023, Sydney, NSW, Australia

  17. arXiv:2302.05244  [pdf, other

    cs.AI cs.CL cs.LG

    A Song of Ice and Fire: Analyzing Textual Autotelic Agents in ScienceWorld

    Authors: Laetitia Teodorescu, Xingdi Yuan, Marc-Alexandre Côté, Pierre-Yves Oudeyer

    Abstract: Building open-ended agents that can autonomously discover a diversity of behaviours is one of the long-standing goals of artificial intelligence. This challenge can be studied in the framework of autotelic RL agents, i.e. agents that learn by selecting and pursuing their own goals, self-organizing a learning curriculum. Recent work identified language as a key dimension of autotelic learning, in p… ▽ More

    Submitted 24 February, 2023; v1 submitted 10 February, 2023; originally announced February 2023.

    Comments: In review at ICML 2023

  18. arXiv:2302.02662  [pdf, other

    cs.LG

    Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning

    Authors: Thomas Carta, Clément Romac, Thomas Wolf, Sylvain Lamprier, Olivier Sigaud, Pierre-Yves Oudeyer

    Abstract: Recent works successfully leveraged Large Language Models' (LLM) abilities to capture abstract knowledge about world's physics to solve decision-making problems. Yet, the alignment between LLMs' knowledge and the environment can be wrong and limit functional competence due to lack of grounding. In this paper, we study an approach (named GLAM) to achieve this alignment through functional grounding:… ▽ More

    Submitted 6 September, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Journal ref: PMLR 202 (2023):3676-3713

  19. arXiv:2212.07906  [pdf, other

    cs.NE cs.AI nlin.CG

    Flow-Lenia: Towards open-ended evolution in cellular automata through mass conservation and parameter localization

    Authors: Erwan Plantec, Gautier Hamon, Mayalen Etcheverry, Pierre-Yves Oudeyer, Clément Moulin-Frier, Bert Wang-Chak Chan

    Abstract: The design of complex self-organising systems producing life-like phenomena, such as the open-ended evolution of virtual creatures, is one of the main goals of artificial life. Lenia, a family of cellular automata (CA) generalizing Conway's Game of Life to continuous space, time and states, has attracted a lot of attention because of the wide diversity of self-organizing patterns it can generate.… ▽ More

    Submitted 24 March, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

  20. GPT-3-driven pedagogical agents for training children's curious question-asking skills

    Authors: Rania Abdelghani, Yen-Hsiang Wang, Xingdi Yuan, Tong Wang, Pauline Lucas, Hélène Sauzéon, Pierre-Yves Oudeyer

    Abstract: In order to train children's ability to ask curiosity-driven questions, previous research has explored designing specific exercises relying on providing semantic and linguistic cues to help formulate such questions. But despite showing pedagogical efficiency, this method is still limited as it relies on generating the said cues by hand, which can be a very costly process. In this context, we propo… ▽ More

    Submitted 30 May, 2023; v1 submitted 25 November, 2022; originally announced November 2022.

  21. arXiv:2210.06468  [pdf, other

    cs.AI cs.CL cs.LG

    Contrastive Multimodal Learning for Emergence of Graphical Sensory-Motor Communication

    Authors: Tristan Karch, Yoann Lemesle, Romain Laroche, Clément Moulin-Frier, Pierre-Yves Oudeyer

    Abstract: In this paper, we investigate whether artificial agents can develop a shared language in an ecological setting where communication relies on a sensory-motor channel. To this end, we introduce the Graphical Referential Game (GREG) where a speaker must produce a graphical utterance to name a visual referent object while a listener has to select the corresponding object among distractor referents, gi… ▽ More

    Submitted 14 February, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

  22. arXiv:2209.11000  [pdf, other

    cs.CL

    Selecting Better Samples from Pre-trained LLMs: A Case Study on Question Generation

    Authors: Xingdi Yuan, Tong Wang, Yen-Hsiang Wang, Emery Fine, Rania Abdelghani, Pauline Lucas, Hélène Sauzéon, Pierre-Yves Oudeyer

    Abstract: Large Language Models (LLMs) have in recent years demonstrated impressive prowess in natural language generation. A common practice to improve generation diversity is to sample multiple outputs from the model. However, there lacks a simple and robust way of selecting the best output from these stochastic samples. As a case study framed in the context of question generation, we propose two prompt-b… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

  23. arXiv:2207.04118  [pdf, other

    cs.AI

    Automatic Exploration of Textual Environments with Language-Conditioned Autotelic Agents

    Authors: Laetitia Teodorescu, Eric Yuan, Marc-Alexandre Côté, Pierre-Yves Oudeyer

    Abstract: In this extended abstract we discuss the opportunities and challenges of studying intrinsically-motivated agents for exploration in textual environments. We argue that there is important synergy between text environments and autonomous agents. We identify key properties of text worlds that make them suitable for exploration by autonmous agents, namely, depth, breadth, progress niches and the ease… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

  24. arXiv:2206.09674  [pdf, other

    cs.CL cs.AI cs.LG

    EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL

    Authors: Thomas Carta, Pierre-Yves Oudeyer, Olivier Sigaud, Sylvain Lamprier

    Abstract: Reinforcement learning (RL) in long horizon and sparse reward tasks is notoriously difficult and requires a lot of training steps. A standard solution to speed up the process is to leverage additional reward signals, shaping it to better guide the learning process. In the context of language-conditioned RL, the abstraction and generalisation properties of the language input provide opportunities f… ▽ More

    Submitted 13 October, 2022; v1 submitted 20 June, 2022; originally announced June 2022.

    Comments: 24 pages, 16 figures, 5 tables

  25. arXiv:2206.05060  [pdf, other

    cs.AI cs.MA cs.SI

    Social Network Structure Shapes Innovation: Experience-sharing in RL with SAPIENS

    Authors: Eleni Nisioti, Mateo Mahaut, Pierre-Yves Oudeyer, Ida Momennejad, Clément Moulin-Frier

    Abstract: Human culture relies on innovation: our ability to continuously explore how existing elements can be combined to create new ones. Innovation is not solitary, it relies on collective search and accumulation. Reinforcement learning (RL) approaches commonly assume that fully-connected groups are best suited for innovation. However, human laboratory and field studies have shown that hierarchical innov… ▽ More

    Submitted 18 November, 2022; v1 submitted 10 June, 2022; originally announced June 2022.

  26. Language and Culture Internalisation for Human-Like Autotelic AI

    Authors: Cédric Colas, Tristan Karch, Clément Moulin-Frier, Pierre-Yves Oudeyer

    Abstract: Building autonomous agents able to grow open-ended repertoires of skills across their lives is a fundamental goal of artificial intelligence (AI). A promising developmental approach recommends the design of intrinsically motivated agents that learn new skills by generating and pursuing their own goals - autotelic agents. But despite recent progress, existing algorithms still show serious limitatio… ▽ More

    Submitted 16 November, 2022; v1 submitted 2 June, 2022; originally announced June 2022.

    Journal ref: Nature Machine Intelligence 4, 1068-1076 (2022)

  27. arXiv:2205.06111  [pdf, other

    cs.AI cs.CL

    Asking for Knowledge: Training RL Agents to Query External Knowledge Using Language

    Authors: Iou-Jen Liu, Xingdi Yuan, Marc-Alexandre Côté, Pierre-Yves Oudeyer, Alexander G. Schwing

    Abstract: To solve difficult tasks, humans ask questions to acquire knowledge from external sources. In contrast, classical reinforcement learning agents lack such an ability and often resort to exploratory behavior. This is exacerbated as few present-day environments support querying for knowledge. In order to study how agents can be taught to query external knowledge via language, we first introduce two n… ▽ More

    Submitted 3 July, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: ICML 2022; Project page: https://ioujenliu.github.io/AFK/

  28. Conversational agents for fostering curiosity-driven learning in children

    Authors: Rania Abdelghani, Pierre-Yves Oudeyer, Edith Law, Catherine de Vulpillières, Hélène Sauzéon

    Abstract: Curiosity is an important factor that favors independent and individualized learning in children. Research suggests that it is also a competence that can be fostered by training specific metacognitive skills and information-searching behaviors. In this light, we develop a conversational agent that helps children generate curiosity-driven questions, and encourages their use to lead autonomous explo… ▽ More

    Submitted 12 April, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

  29. arXiv:2201.11014  [pdf, other

    cs.CV cs.CL cs.LG

    Language-biased image classification: evaluation based on semantic representations

    Authors: Yoann Lemesle, Masataka Sawayama, Guillermo Valle-Perez, Maxime Adolphe, Hélène Sauzéon, Pierre-Yves Oudeyer

    Abstract: Humans show language-biased image recognition for a word-embedded image, known as picture-word interference. Such interference depends on hierarchical semantic categories and reflects that human language processing highly interacts with visual processing. Similar to humans, recent artificial models jointly trained on texts and images, e.g., OpenAI CLIP, show language-biased image classification. E… ▽ More

    Submitted 12 March, 2022; v1 submitted 26 January, 2022; originally announced January 2022.

    Comments: Accepted at ICLR 2022

  30. arXiv:2112.07342  [pdf, other

    cs.LG cs.AI cs.MA

    Learning to Guide and to Be Guided in the Architect-Builder Problem

    Authors: Paul Barde, Tristan Karch, Derek Nowrouzezahrai, Clément Moulin-Frier, Christopher Pal, Pierre-Yves Oudeyer

    Abstract: We are interested in interactive agents that learn to coordinate, namely, a $builder$ -- which performs actions but ignores the goal of the task, i.e. has no access to rewards -- and an $architect$ which guides the builder towards the goal of the task. We define and explore a formal setting where artificial agents are equipped with mechanisms that allow them to simultaneously learn a task while at… ▽ More

    Submitted 11 April, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: International Conference on Learning Representations (2022)

  31. Identifying Functions and Behaviours of Social Robots during Learning Activities: Teachers' Perspective

    Authors: Jessy Ceha, Edith Law, Dana Kulić, Pierre-Yves Oudeyer, Didier Roy

    Abstract: With advances in artificial intelligence, research is increasingly exploring the potential functions that social robots can play in education. As teachers are a critical stakeholder in the use and application of educational technologies, we conducted a study to understand teachers' perspectives on how a social robot could support a variety of learning activities in the classroom. Through interview… ▽ More

    Submitted 30 October, 2021; originally announced November 2021.

    Comments: This is a preprint of an article published in The International Journal of Social Robotics. The final authenticated version is available online at: https://doi.org/10.1007/s12369-021-00820-7

  32. arXiv:2107.00956  [pdf, other

    cs.LG cs.AI cs.CL

    SocialAI: Benchmarking Socio-Cognitive Abilities in Deep Reinforcement Learning Agents

    Authors: Grgur Kovač, Rémy Portelas, Katja Hofmann, Pierre-Yves Oudeyer

    Abstract: Building embodied autonomous agents capable of participating in social interactions with humans is one of the main challenges in AI. Within the Deep Reinforcement Learning (DRL) field, this objective motivated multiple works on embodied language use. However, current approaches focus on language as a communication tool in very simplified and non-diverse social situations: the "naturalness" of lang… ▽ More

    Submitted 1 September, 2021; v1 submitted 2 July, 2021; originally announced July 2021.

    Comments: under review. This paper extends and generalizes work in arXiv:2104.13207

  33. arXiv:2106.14421  [pdf, other

    cs.LG

    Causal Reinforcement Learning using Observational and Interventional Data

    Authors: Maxime Gasse, Damien Grasset, Guillaume Gaudron, Pierre-Yves Oudeyer

    Abstract: Learning efficiently a causal model of the environment is a key challenge of model-based RL agents operating in POMDPs. We consider here a scenario where the learning agent has the ability to collect online experiences through direct interactions with the environment (interventional data), but has also access to a large collection of offline experiences, obtained by observing another agent interac… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

  34. arXiv:2106.13871  [pdf, other

    cs.SD cs.GR cs.LG eess.AS

    Transflower: probabilistic autoregressive dance generation with multimodal attention

    Authors: Guillermo Valle-Pérez, Gustav Eje Henter, Jonas Beskow, André Holzapfel, Pierre-Yves Oudeyer, Simon Alexanderson

    Abstract: Dance requires skillful composition of complex movements that follow rhythmic, tonal and timbral features of music. Formally, generating dance conditioned on a piece of music can be expressed as a problem of modelling a high-dimensional continuous motion signal, conditioned on an audio signal. In this work we make two contributions to tackle this problem. First, we present a novel probabilistic au… ▽ More

    Submitted 11 June, 2022; v1 submitted 25 June, 2021; originally announced June 2021.

    Comments: Article presented at SIGGRAPH Asia 2021, and published in ACM Transactions on Graphics

  35. arXiv:2106.08858  [pdf, other

    cs.AI cs.CL cs.LG

    Grounding Spatio-Temporal Language with Transformers

    Authors: Tristan Karch, Laetitia Teodorescu, Katja Hofmann, Clément Moulin-Frier, Pierre-Yves Oudeyer

    Abstract: Language is an interface to the outside world. In order for embodied agents to use it, language must be grounded in other, sensorimotor modalities. While there is an extended literature studying how machines can learn grounded language, the topic of how to learn spatio-temporal linguistic concepts is still largely uncharted. To make progress in this direction, we here introduce a novel spatio-temp… ▽ More

    Submitted 11 October, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: Contains main article and supplementaries

    Journal ref: Neurips 2021

  36. Towards Teachable Autotelic Agents

    Authors: Olivier Sigaud, Ahmed Akakzia, Hugo Caselles-Dupré, Cédric Colas, Pierre-Yves Oudeyer, Mohamed Chetouani

    Abstract: Autonomous discovery and direct instruction are two distinct sources of learning in children but education sciences demonstrate that mixed approaches such as assisted discovery or guided play result in improved skill acquisition. In the field of Artificial Intelligence, these extremes respectively map to autonomous agents learning from their own signals and interactive learning agents fully taught… ▽ More

    Submitted 20 March, 2023; v1 submitted 25 May, 2021; originally announced May 2021.

    Journal ref: Sigaud, O., Akakzia, A., Caselles-Dupré, H., Colas, C., Oudeyer, P. Y., & Chetouani, M. (2022). Towards Teachable Autotelic Agents. IEEE Transactions on Cognitive and Developmental Systems

  37. arXiv:2104.13207  [pdf, other

    cs.LG cs.AI

    SocialAI 0.1: Towards a Benchmark to Stimulate Research on Socio-Cognitive Abilities in Deep Reinforcement Learning Agents

    Authors: Grgur Kovač, Rémy Portelas, Katja Hofmann, Pierre-Yves Oudeyer

    Abstract: Building embodied autonomous agents capable of participating in social interactions with humans is one of the main challenges in AI. This problem motivated many research directions on embodied language use. Current approaches focus on language as a communication tool in very simplified and non diverse social situations: the "naturalness" of language is reduced to the concept of high vocabulary siz… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

    Comments: Accepted at NAACL ViGIL Workshop 2021

  38. arXiv:2103.09815  [pdf, other

    cs.LG

    TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL

    Authors: Clément Romac, Rémy Portelas, Katja Hofmann, Pierre-Yves Oudeyer

    Abstract: Training autonomous agents able to generalize to multiple tasks is a key target of Deep Reinforcement Learning (DRL) research. In parallel to improving DRL algorithms themselves, Automatic Curriculum Learning (ACL) study how teacher algorithms can train DRL agents more efficiently by adapting task selection to their evolving abilities. While multiple standard benchmarks exist to compare DRL agents… ▽ More

    Submitted 9 June, 2021; v1 submitted 17 March, 2021; originally announced March 2021.

  39. Intelligent behavior depends on the ecological niche: Scaling up AI to human-like intelligence in socio-cultural environments

    Authors: Manfred Eppe, Pierre-Yves Oudeyer

    Abstract: This paper outlines a perspective on the future of AI, discussing directions for machines models of human-like intelligence. We explain how developmental and evolutionary theories of human cognition should further inform artificial intelligence. We emphasize the role of ecological niches in sculpting intelligent behavior, and in particular that human intelligence was fundamentally shaped to adapt… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

    Comments: Keywords: developmental AI, general artificial intelligence, human-like AI, embodiment, cultural evolution, language, socio-cultural skills

    Journal ref: KI - Künstliche Intelligenz KI - Künstliche Intelligenz (German Journal of Artificial Intelligence), 2021

  40. arXiv:2012.09830  [pdf, other

    cs.LG cs.AI

    Autotelic Agents with Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short Survey

    Authors: Cédric Colas, Tristan Karch, Olivier Sigaud, Pierre-Yves Oudeyer

    Abstract: Building autonomous machines that can explore open-ended environments, discover possible interactions and build repertoires of skills is a general objective of artificial intelligence. Developmental approaches argue that this can only be achieved by $autotelic$ $agents$: intrinsically motivated learning agents that can learn to represent, generate, select and solve their own problems. In recent ye… ▽ More

    Submitted 12 July, 2022; v1 submitted 17 December, 2020; originally announced December 2020.

  41. arXiv:2011.08463  [pdf, other

    cs.LG cs.AI

    Meta Automatic Curriculum Learning

    Authors: Rémy Portelas, Clément Romac, Katja Hofmann, Pierre-Yves Oudeyer

    Abstract: A major challenge in the Deep RL (DRL) community is to train agents able to generalize their control policy over situations never seen in training. Training on diverse tasks has been identified as a key ingredient for good generalization, which pushed researchers towards using rich procedural task generation systems controlled through complex continuous parameter spaces. In such complex task space… ▽ More

    Submitted 1 September, 2021; v1 submitted 16 November, 2020; originally announced November 2020.

    Comments: This paper extends and generalizes work in arXiv:2004.03168

  42. arXiv:2010.07208  [pdf, other

    cs.SD eess.AS q-bio.NC

    Emergent Jaw Predominance in Vocal Development through Stochastic Optimization

    Authors: Clément Moulin-Frier, Jules Brochard, Freek Stulp, Pierre-Yves Oudeyer

    Abstract: Infant vocal babbling strongly relies on jaw oscillations, especially at the stage of canonical babbling, which underlies the syllabic structure of world languages. In this paper, we propose, model and analyze an hypothesis to explain this predominance of the jaw in early babbling. This hypothesis states that general stochastic optimization principles, when applied to learning sensorimotor control… ▽ More

    Submitted 8 October, 2020; originally announced October 2020.

    Journal ref: IEEE Transactions on Cognitive and Developmental Systems (Volume: 12 , Issue: 3 , Sept. 2020)

  43. arXiv:2010.04452  [pdf, other

    cs.LG math.OC q-bio.PE

    EpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological Models

    Authors: Cédric Colas, Boris Hejblum, Sébastien Rouillon, Rodolphe Thiébaut, Pierre-Yves Oudeyer, Clément Moulin-Frier, Mélanie Prague

    Abstract: Epidemiologists model the dynamics of epidemics in order to propose control strategies based on pharmaceutical and non-pharmaceutical interventions (contact limitation, lock down, vaccination, etc). Hand-designing such strategies is not trivial because of the number of possible interventions and the difficulty to predict long-term effects. This task can be cast as an optimization problem where sta… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

    Journal ref: Journal of Artificial Intelligence Research-2021

  44. arXiv:2008.04388  [pdf

    cs.LG cs.AI cs.RO stat.ML

    GRIMGEP: Learning Progress for Robust Goal Sampling in Visual Deep Reinforcement Learning

    Authors: Grgur Kovač, Adrien Laversanne-Finot, Pierre-Yves Oudeyer

    Abstract: Designing agents, capable of learning autonomously a wide range of skills is critical in order to increase the scope of reinforcement learning. It will both increase the diversity of learned skills and reduce the burden of manually designing reward functions for each skill. Self-supervised agents, setting their own goals, and trying to maximize the diversity of those goals have shown great promise… ▽ More

    Submitted 7 November, 2022; v1 submitted 10 August, 2020; originally announced August 2020.

    Comments: Published in IEEE Transactions on Cognitive and Developmental Systems

    ACM Class: I.2.6

  45. arXiv:2007.01195  [pdf, other

    cs.LG cs.AI nlin.CG stat.ML

    Hierarchically Organized Latent Modules for Exploratory Search in Morphogenetic Systems

    Authors: Mayalen Etcheverry, Clement Moulin-Frier, Pierre-Yves Oudeyer

    Abstract: Self-organization of complex morphological patterns from local interactions is a fascinating phenomenon in many natural and artificial systems. In the artificial world, typical examples of such morphogenetic systems are cellular automata. Yet, their mechanisms are often very hard to grasp and so far scientific discoveries of novel patterns have primarily been relying on manual tuning and ad hoc ex… ▽ More

    Submitted 2 September, 2021; v1 submitted 2 July, 2020; originally announced July 2020.

    Journal ref: Advances in Neural Information Processing Systems 33 (NeurIPS 2020 - oral)

  46. arXiv:2006.07185  [pdf, other

    cs.AI cs.LG stat.ML

    Grounding Language to Autonomously-Acquired Skills via Goal Generation

    Authors: Ahmed Akakzia, Cédric Colas, Pierre-Yves Oudeyer, Mohamed Chetouani, Olivier Sigaud

    Abstract: We are interested in the autonomous acquisition of repertoires of skills. Language-conditioned reinforcement learning (LC-RL) approaches are great tools in this quest, as they allow to express abstract goals as sets of constraints on the states. However, most LC-RL agents are not autonomous and cannot learn without external instructions and feedback. Besides, their direct language condition cannot… ▽ More

    Submitted 25 January, 2021; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: Published at ICLR 2021

  47. arXiv:2006.07043  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Language-Conditioned Goal Generation: a New Approach to Language Grounding for RL

    Authors: Cédric Colas, Ahmed Akakzia, Pierre-Yves Oudeyer, Mohamed Chetouani, Olivier Sigaud

    Abstract: In the real world, linguistic agents are also embodied agents: they perceive and act in the physical world. The notion of Language Grounding questions the interactions between language and embodiment: how do learning agents connect or ground linguistic representations to the physical world ? This question has recently been approached by the Reinforcement Learning community under the framework of i… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

  48. arXiv:2005.06369  [pdf, other

    cs.LG cs.AI stat.ML

    Progressive growing of self-organized hierarchical representations for exploration

    Authors: Mayalen Etcheverry, Pierre-Yves Oudeyer, Chris Reinke

    Abstract: Designing agent that can autonomously discover and learn a diversity of structures and skills in unknown changing environments is key for lifelong machine learning. A central challenge is how to learn incrementally representations in order to progressively build a map of the discovered structures and re-use it to further explore. To address this challenge, we identify and target several key functi… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

  49. arXiv:2004.04546  [pdf, other

    cs.LG cs.CV stat.ML

    SpatialSim: Recognizing Spatial Configurations of Objects with Graph Neural Networks

    Authors: Laetitia Teodorescu, Katja Hofmann, Pierre-Yves Oudeyer

    Abstract: Recognizing precise geometrical configurations of groups of objects is a key capability of human spatial cognition, yet little studied in the deep learning literature so far. In particular, a fundamental problem is how a machine can learn and compare classes of geometric spatial configurations that are invariant to the point of view of an external observer. In this paper we make two key contributi… ▽ More

    Submitted 16 July, 2020; v1 submitted 9 April, 2020; originally announced April 2020.

  50. Pedagogical Agents for Fostering Question-Asking Skills in Children

    Authors: Mehdi Alaimi, Edith Law, Kevin Daniel Pantasdo, Pierre-Yves Oudeyer, Helene Sauzeon

    Abstract: Question asking is an important tool for constructing academic knowledge, and a self-reinforcing driver of curiosity. However, research has found that question asking is infrequent in the classroom and children's questions are often superficial, lacking deep reasoning. In this work, we developed a pedagogical agent that encourages children to ask divergent-thinking questions, a more complex form o… ▽ More

    Submitted 7 April, 2020; originally announced April 2020.

    Comments: Accepted at CHI 2020

    ACM Class: K.3.0; K.3.1; K.4.2; J.0; J.4