Mechanisms of imitation and social matching play a fundamental role in development, communication, interaction, learning and culture. Their investigation in different agents (animals, humans and robots) has significantly influenced our understanding of the nature and origins of social intelligence. Whilst such issues have traditionally been studied in areas such as psychology, biology and ethology, it has become increasingly recognized that a ‘constructive approach’ towards imitation and social learning via the synthesis of artificial agents can provide important insights into mechanisms and create artifacts that can be instructed and taught by imitation, demonstration and social interaction rather than by explicit programming. This book studies increasingly sophisticated models and mechanisms of social matching behaviour and marks an important step towards the development of an interdisciplinary research field, consolidating and providing a valuable reference for the increasing number of researchers in the field of imitation and social learning in robots, humans and animals.
CHRYSTOPHER L. NEHANIV is Research Professor of Mathematical and Evolutionary Computer Sciences in the School of Computer Science at the University of Hertfordshire, where he works with the Adaptive Systems, Algorithms and BioComputation Research Groups. He is the Director of the UK EPSRC Network on Evolvability in Biological and Software Systems and an Associate Editor of BioSystems: Journal of Biological and Information Processing Sciences and Interaction Studies: Social Behaviour and Communication in Biological and Artificial Systems.
KERSTIN DAUTENHAHN is Research Professor of Artificial Intelligence in the School of Computer Science at the University of Hertfordshire, where she is a coordinator of the Adaptive Systems Research Group. Her research interests include social learning, human–robot interaction, social robotics, narrative and robotic-assisted therapy for children with autism. She is the Editor-in-Chief of Interaction Studies: Social Behaviour and Communication in Biological and Artificial Systems and the general chair of the IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN 2006).
Edited by
Chrystopher L. Nehaniv and Kerstin Dautenhahn
CAMBRIDGE UNIVERSITY PRESS
Cambridge, New York, Melbourne, Madrid, Cape Town, Singapore, São Paulo
Cambridge University Press
The Edinburgh Building, Cambridge CB2 8RU, UK
Published in the United States of America by Cambridge University Press, New York
www.cambridge.org
Information on this title: www.cambridge.org/9780521845113
© Cambridge University Press 2007
This publication is in copyright. Subject to statutory exception
and to the provisions of relevant collective licensing agreements,
no reproduction of any part may take place without
the written permission of Cambridge University Press.
First published 2007
Printed in the United Kingdom at the University Press, Cambridge
A catalogue record for this publication is available from the British Library
Library of Congress cataloguing in publication data
Imitation and social learning in robots, humans and animals: behavioural, social and
communicative dimensions / edited by Chrystopher L. Nehaniv and Kerstin Dautenhahn.
p. cm.
Includes bibliographical references and index.
ISBN-13: 978-0-521-84511-3 (hardback)
1. Robotics. 2. Robots – Control systems. I. Nehaniv, Chrystopher L., 1963–.
II. Dautenhahn, Kerstin.
TJ211.I38 2006
629.8′92 – dc22
2206023587
ISBN 978-0-521-84511-3 hardback
Cambridge University Press has no responsibility for the persistence or accuracy of URLs
for external or third-party internet websites referred to in this publication, and does not
guarantee that any content on such websites is, or will remain, accurate or appropriate.
To our parents,
Chrystyna, Bohdan, Annelie and Claus-Peter
| List of plates | page x | ||
| List of figures | xii | ||
| List of tables | xvii | ||
| List of contributors | xiii | ||
| Introduction: the constructive interdisciplinary viewpoint for understanding mechanisms and models of imitation and social learning |
1 | ||
| CHRYSTOPHER L. NEHANIV AND KERSTIN DAUTENHAHN |
|||
| Part I | Correspondence problems and mechanisms | 19 | |
| 1 | Imitation: thoughts about theories | 23 | |
| GEOFFREY BIRD AND CECILIA HEYES | |||
| 2 | Nine billion correspondence problems | 35 | |
| CHRYSTOPHER L. NEHANIV | |||
| 3 | Challenges and issues faced in building a framework for conducting research in learning from observation |
47 | |
| DARRIN BENTIVEGNA, CHRISTOPHER ATKESON AND GORDON CHENG |
|||
| Part II | Mirroring and ‘mind-reading’ | 67 | |
| 4 | A neural architecture for imitation and intentional relations |
71 | |
| MARCO IACOBONI, JONAS KAPLAN AND STEPHEN WILSON |
|||
| 5 | Simulation theory of understanding others: a robotics perspective |
89 | |
| YIANNIS DEMIRIS AND MATTHEW JOHNSON | |||
| 6 | Mirrors and matchings: imitation from the perspective of mirror-self-recognition, and the parietal region’s involvement in both |
103 | |
| ROBERT W. MITCHELL | |||
| Part III | What to imitate? | 131 | |
| 7 | The question of ‘what to imitate’: inferring goals and intentions from demonstrations |
135 | |
| MALINDA CARPENTER AND JOSEP CALL | |||
| 8 | Learning of gestures by imitation in a humanoid robot | 153 | |
| SYLVAIN CALINON AND AUDE BILLARD | |||
| 9 | The dynamic emergence of categories through imitation | 179 | |
| TONY BELPAEME, BART DE BOER AND BART JANSEN | |||
| Part IV | Development and embodiment | 195 | |
| 10 | Copying strategies by people with autistic spectrum disorder: why only imitation leads to social cognitive development |
199 | |
| JUSTIN H. G. WILLIAMS | |||
| 11 | A Bayesian model of imitation in infants and robots | 217 | |
| RAJESH P. N. RAO, AARON P. SHON AND ANDREW N. MELTZOFF |
|||
| 12 | Solving the correspondence problem in robotic imitation across embodiments: synchrony, perception and culture in artifacts |
249 | |
| ARIS ALISSANDRAKIS, CHRYSTOPHER L. NEHANIV AND KERSTIN DAUTENHAHN |
|||
| Part V | Synchrony and turn-taking as communicative mechanisms |
275 | |
| 13 | How to build an imitator | 279 | |
| ARNAUD REVEL AND JACQUELINE NADEL | |||
| 14 | Simulated turn-taking and development of styles of motion |
301 | |
| TAKASHI IKEGAMI AND HIROKI IIZUKA | |||
| 15 | Bullying behaviour, empathy and imitation: an attempted synthesis |
323 | |
| KERSTIN DAUTENHAHN, SARAH N. WOODS AND CHRISTINA KAOURI |
|||
| Part VI | Why imitate? – Motivations | 341 | |
| 16 | Multiple motivations for imitation in infancy | 343 | |
| MARK NIELSEN AND VIRGINIA SLAUGHTER | |||
| 17 | The progress drive hypothesis: an interpretation of early imitation |
361 | |
| FRéDéRIC KAPLAN AND PIERRE-YVES OUDEYER | |||
| Part VII | Social feedback | 379 | |
| 18 | Training behavior by imitation: from parrots to people . . . to robots? |
383 | |
| IRENE M. PEPPERBERG AND DIANE V. SHERMAN | |||
| 19 | Task learning through imitation and human–robot interaction |
407 | |
| MONICA N. NICOLESCU AND MAJA J. MATARI | |||
| Part VIII | The ecological context | 425 | |
| 20 | Emulation learning: the integration of technical and social cognition |
427 | |
| LUDWIG HUBER | |||
| 21 | Mimicry as deceptive resemblance: beyond the one-trick ponies |
441 | |
| MARK D. NORMAN AND TOM TREGENZA | |||
| Index | 455 |
1. Figure 20.1 Curious keas (Nestor notabilis) in Mt Cook
National Park, New Zealand.
(Photos by D. Werdenich and B. Voelkl)
2. Figure 21.1 Leaf mimicry.
a. Australian Leaf-wing butterfly, Dolleschallia bisaltide;
b. Cockatoo waspfish, Ablabys taenianotus;
c. Juvenile round batfish, Platax orbicularis;
d. Broadclub cuttlefish, Sepia latimanus, ‘impersonating’
dead mangrove leaf;
e. Same individual in resting colour pattern.
(Photos by M. Norman)3. Figure 21.2 Camouflage and deceptive resemblance in
cephalopods.
a. Octopus sp. 5 (Norman, 2000) active in rock pools;
b. Same individual moments later in camouflage;
c. Hairy octopus (Octopus sp. 16 (Norman, 2000)) showing branching skin sculpture;
d. Snail mimic octopus (Octopus sp. 6 (Gosliner et al., 1996));
e. Giant cuttlefish, Sepia apama – sneaker male (centre) impersonating female amongst breeding pair (male on right)
(Photos a, b & e by M. Norman; c by Becca Saunders and d by M. Severns)4. Figure 21.3 Mimic octopus (Octopus sp. 19 (Norman, 2000)) and models.
a. Sentinel state in mouth of burrow;
b. Normal foraging colour pattern;
c. Flatfish mimicry;
d. Flatfish model, banded sole (Zebrias sp.);
e. Lionfish mimicry;
f. Lionfish model (Pterois sp.);
g. Sea snake mimicry;
h. Sea snake model, banded sea snake (Laticauda sp.).
(Photos a, b, d, e & f by M. Norman and c, g & h by R. Steene)5. Figure 3.3 The software air hockey game on the left and air hockey playing with a humanoid robot on the right.
6. Figure 3.5 Six manually defined configurations are used to interpolate the body configuration needed to place the paddle at a specific location.
7. Figure 7.1a and b The demonstrator performing the action (a) in a happy, satisfied manner and (b) in a frustrated, dissatisfied manner in Behne and colleagues’ (2006b) study.
8. Figure 7.2a and b The demonstrator (a) attending and (b) not attending to her action in Behne and colleagues’ (2006a) study.
| 1.1 | Schematic representation of the associative sequence learning theory of imitation learning. |
page 26 | |
| 3.1 | Three part framework for learning using primitives. | 50 | |
| 3.2 | Software and hardware marble maze environments. | 51 | |
| 3.3 | Software air hockey game and playing air hockey with humanoid robot. |
51 | |
| 3.4 | Primitives being explored in the marble maze. | 52 | |
| 3.5 | Manually defined configurations of robot used to interpolate configuration for paddle placement. |
53 | |
| 3.6 | Raw observed data, wall contact and recognized primitives. |
55 | |
| 3.7 | Performance on marble maze. | 56 | |
| 3.8 | Paddle trajectories and desired paths in shot maneuvers. | 58 | |
| 4.1 | Time series in a human premotor area for imitation and action observation. |
73 | |
| 4.2 | Anatomical locations of brain regions implicated in the neural architecture for imitation. |
76 | |
| 4.3 | The neural architecture for imitation mapped onto the functional elements of intentional relations theory. |
78 | |
| 5.1 | Inverse and forward models paired for action recognition. |
93 | |
| 5.2 | The robot imitating picking up an object. | 95 | |
| 5.3 | Arrangement of primitives in graphed inverse models: “Pick up object” and “Place object”. |
95 | |
| 5.4 | Confidences of inverse models while the robot learns to pick up an object. |
98 | |
| 7.1 | The demonstrator performing the action in a happy, satisfied manner and in a frustrated, dissatisfied manner in Behne and colleagues’ study. |
141 | |
| 7.2 | The demonstrator attending and not attending to her action in Behne and colleagues’ study. |
141 | |
| 7.3 | Components and processes of a flexible imitation system. |
145 | |
| 8.1 | Demonstration and reproduction of different tasks. | 155 | |
| 8.2 | Demonstration and reproduction of drawing the three stylized alphabet letters A, B and C. |
156 | |
| 8.3 | Schematic of the sensory-motor flow. | 158 | |
| 8.4 | Encoding of the hand path in Cartesian space and joint angles trajectories in a HMM. |
160 | |
| 8.5 | BIC criterion is used to determine the optimal number of states of the HMM required to encode the data. |
161 | |
| 8.6 | Example of the retrieval process. | 163 | |
| 8.7 | Function used to transform a standard deviation to a weight factor w in [0, 1]. |
164 | |
| 8.8 | Four demonstrations of the drinking gesture and the knocking gesture. |
165 | |
| 8.9 | Comparison of two metrics to evaluate the quality of a reproduced trajectory. |
166 | |
| 8.10 | Demonstration, transformation by PCA, encoding in HMM and retrieval of the two different motions waving goodbye and drawing letter B. |
168 | |
| 8.11 | Cost function when testing the gestures of the test set with the different HMMs. |
169 | |
| 8.12 | Byrne’s string parsing imitation model. |
172 | |
| 8.13 | Heyes and Ray’s associative sequence learning (ASL) model. | 173 | |
| 8.14 | Nehaniv and Dautenhahn’s algebraic framework to map states, effects and/or actions of the demonstrator and the imitator. |
174 | |
| 9.1 | An agent has categories made up of action and observation. |
183 | |
| 9.2 | Schematic outline of the imitation game. | 184 | |
| 9.3 | Agents use a robot arm to execute actions and a stereo camera to observe actions. |
185 | |
| 9.4 | Results for 50 000 imitation games between two robots. | 189 | |
| 9.5 | Imitative success and number of actions learnt in a simulation of the imitation game. |
190 | |
| 10.1 | Glass brains showing areas of brain activation during fMRI that were greater during the imitation condition than the spatial cue-execution condition. |
209 | |
| 11.1 | Imitative responses in 2- to 3-week-old infants. | 220 | |
| 11.2 | A 14-month-old infant imitating the novel action of touching a panel with the forehead. |
222 | |
| 11.3 | Infants as young as 14 months old can imitate actions on objects as seen on TV; infants can also perform deferred imitation based on actions observed on TV the previous day. |
224 | |
| 11.4 | Infants attributed goals and intentions to human but not to inanimate device performing unsuccessful act. |
225 | |
| 11.5 | Simulated maze environment and learned forward model. |
233 | |
| 11.6 | Learned priors and example of successful imitation. | 235 | |
| 11.7 | Inferring the intent of the teacher. | 236 | |
| 11.8 | Robotic platforms for testing Bayesian imitation models. | 238 | |
| 11.9 | Meltzoff and Moore’s AIM model of facial imitation. | 240 | |
| 12.1 | The Action Learning via Imitation between Corresponding Embodiments (ALICE) framework. |
251 | |
| 12.2 | Two CHESSWORLD examples. | 254 | |
| 12.3 | The ALICE framework as realized in the CHESSWORLD testbed. |
255 | |
| 12.4 | Example embodiment of a robotic arm agent. | 256 | |
| Different examples of behaviours. | 257 | ||
| 12.6 | An example of using the metrics to compare actions, states (before and after) and effects between two agents. |
258 | |
| 12.7 | The ALICE framework as realized in the RABIT robotic-arm testbed. |
261 | |
| 12.8 | An example of social transmission. | 262 | |
| 12.9 | Examples of emerging ‘proto-culture’. | 263 | |
| 12.10 | More examples of emerging ‘proto-culture’. | 263 | |
| 12.11 | Experiments comparing the use of synchronization. | 265 | |
| 12.12 | Experiments comparing using and not using proprioception. |
267 | |
| 12.13 | Experiments comparing the use of loose perceptual matching. |
268 | |
| 12.14 | Example of an agent imitating with a changing embodiment. |
269 | |
| 12.15 | Another example of an agent imitating with a changing embodiment. |
270 | |
| 12.16 | Yet another example of an agent imitating with a changing embodiment. |
270 | |
| 12.17 | Qualitative effect of the metric used by an imitating agent that changes embodiment. |
271 | |
| 13.1 | Simple sensorimotor connections allow the building of ‘reflex’ behaviours. |
281 | |
| 13.2 | Any low-level processing allows the building of ‘perceptive’ information. |
281 | |
| 13.3 | Complete PerAc architecture | 283 | |
| 13.4 | Mother–infant interaction. | 289 | |
| 13.5 | Temporal delay for holds and discards. | 290 | |
| 13.6 | Two PerAc systems S1 and S2 in interaction. | 292 | |
| 13.7 | An imitating behaviour emerges. | 293 | |
| 13.8 | Child A’s hat has slipped behind his back. Child B seeks for his hat behind his back although his hat is still on his head. |
296 | |
| 13.9 | Each system has its own motivations which are only accessible to the other system via the interaction. |
296 | |
| 14.1 | Schematic view of the mobile agent with two wheels, and two mobile agents interact to perform turn-taking behaviour. |
304 | |
| 14.2 | An agent has a three-layered recurrent network. | 305 | |
| 14.3 | The fitness value of the best agent as a function of GA generations. |
307 | |
| 14.4 | Spatial patterns of turn-taking behavior observed in the simulations. |
309 | |
| 14.5 | Spatial trails of the agents with sequential random inputs for each 100 steps. |
310 | |
| 14.6 | Prediction performance and turns for an agent after 10 000 GA generations. |
311 | |
| 14.7 | Spatial trails of agents from 3000th and 9000th generation in interactions with a controlled agent to constant output. |
312 | |
| 14.8 | Spatial patterns of the original pairs and newly coupled agents at 7000, 8000 and 10 000 GA generations. |
313 | |
| 14.9 | Adaptability is computed for each agent from populations A and B, and gradually increases with later GA generations. |
315 | |
| 15.1 | Sketch of possible connections between imitation, empathy and bullying behaviour. |
332 | |
| 16.1 | Cumulative percentage of infants showing evidence of immediate imitation, deferred imitation and synchronic imitation. |
350 | |
| 17.1 | The critic uses a predictor P and a metapredictor metaP. | 365 | |
| 17.2 | The screen problem. | 367 | |
| 17.3 | Evolution of the average values of the Noise and Reset buttons. |
370 | |
| 17.4 | Structure of local experts built during the experiments. | 370 | |
| 17.5 | Tree-like structure showing how the child could progressively organize its sensorimotor space. |
373 | |
| 19.1 | Structure of the inputs/outputs of an abstract and primitive behaviour. |
409 | |
| 19.2 | Example of a behaviour network. | 409 | |
| 19.3 | The human–robot interaction experiments setup. | 411 | |
| 19.4 | Structure of the environment and course of demonstration. |
415 | |
| 19.5 | Evolution of the task representation over three successive demonstrations. |
416 | |
| 19.6 | Structure of the environment and course of task execution during practice and feedback. |
417 | |
| 19.7 | Topologies obtained after practice and feedback. | 418 | |
| 20.1 | Curious keas (Nestor notabilis). | 431 | |
| 21.1 | Leaf mimicry. | 444 | |
| 21.2 | Camouflage and deceptive resemblance in cephalopods. | 447 | |
| 21.3 | Mimic octopus (Octopus sp. 19 (Norman, 2000)) and models. |
450 |
| 2.1 Classes of correspondence problems arising from particular combinations of granularities and metrics. |
page 38 | ||
| 10.1 Brain area activation clusters (controls and ASD group) | 215 | ||
| 10.2 Brain area activation clusters (group differences – control >ASD) |
216 | ||
| 10.3 Brain area activation clusters (group differences – ASD >control) |
216 | ||
| 18.1 Effects of training method | 389 |
ARIS ALISSANDRAKIS University of Hertfordshire, Adaptive Systems Research Group, UK
CHRISTOPHER ATKESON ATR Computational Neuroscience Lab- oratories, Department of Humanoid Robotics and Computational Neuroscience, Kyoto, Japan and Carnegie Mellon University, Robotics Institute, Pittsburgh, USA
TONY BELPAEME University of Plymouth, School of Computing, Communications and Electronics, UK
DARRIN BENTIVEGNA ATR Computational Neuroscience Lab- oratories, Department of Humanoid Robotics and Computational Neuroscience, Kyoto, Japan and Computational Brain Project, ICORP, Japan Science and Technology Agency, Kyoto, Japan
AUDE BILLARD Swiss Federal Institute of Technology Lausanne (EPFL), Autonomous Systems Lab, Switzerland
GEOFFREY BIRD Department of Psychology and Institute of Cognitive Neuroscience, University College London, UK
JOSEP CALL Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
SYLVAIN CALINON Swiss Federal Institute of Technology Lausanne (EPFL), Autonomous Systems Lab, Switzerland
MALINDA CARPENTER Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
GORDON CHENG ATR Computational Neuroscience Laboratories, Department of Humanoid Robotics and Computational Neuroscience, Kyoto, Japan and Computational Brain Project, ICORP, Japan Science and Technology Agency, Kyoto, Japan
KERSTIN DAUTENHAHN University of Hertfordshire, Adaptive Systems Research Group, UK
BART DE BOER Rijksuniversiteit Groningen, Kunstmatige Intelligentie, the Netherlands
YIANNIS DEMIRIS Imperial College London, Department of Electrical and Electronic Engineering, UK
CECILIA HEYES Department of Psychology and Institute of Cognitive Neuroscience, University College London, UK
LUDWIG HUBER University of Vienna, Institute of Zoology, Austria
MARCO IACOBONI University of California, Department of Psychiatry and Biobehavioral Sciences, Neuropsychiatric Institute and Brain Research Institute, UK
HIROYUKI IIZUKA University of Tokyo, Department of General Systems Sciences, Japan
TAKASHI IKEGAMI University of Tokyo, Department of General Systems Sciences, Japan
BART JANSEN Vrije Universiteit Brussel (VUB), Artificial Intelligence Lab, Belgiam
MATTHEW JOHNSON Imperial College London, Department of Electrical and Electronic Engineering, UK
CHRISTINA KAOURI University of Hertfordshire, Adaptive Systems Research Group, UK
FRéDéRIC KAPLAN Sony Computer Science Laboratory, Paris, France
JONAS KAPLAN University of California, FPR-UCLA Center for Culture, Brain and Development, Department of Psychology, USA
MAJA J. MATARIć University of Southern California, Computer Science Department, USA
ANDREW N. MELTZOFF University of Washington, Institute for Learning and Brain Sciences, Seattle, USA
ROBERT W. MITCHELL Eastern Kentucky University, Department of Psychology, USA
JACQUELINE NADEL CNRS, Group Development and Psychopathology, France
CHRYSTOPHER L. NEHANIV University of Hertfordshire, Adaptive Systems and Algorithms Research Groups, UK
MONICA N. NICOLESCU University of Nevada, Department of Computer Science and Engineering, USA
MARK NIELSEN University of Queensland, School of Psychology, Early Cognitive Development Unit, Australia
MARK D. NORMAN Museum Victoria, Department of Marine Invertebrates, Australia
PIERRE-YVES OUDEYER Sony Computer Science Laboratory, Paris
IRENE M. PEPPERBERG MIT School of Architecture and Planning and Brandeis University, Department of Psychology, USA
RAJESH P. N. RAO University of Washington, Department of Computer Science and Engineering, USA
ARNAUD REVEL CNRS, Group ETIS, France
DIANE V. SHERMAN New-Found Therapies, Inc., USA
AARON P. SHON University of Washington, Department of Computer Science and Engineering, USA
VIRGINIA SLAUGHTER University of Queensland, School of Psychology, Early Cognitive Development Unit, Australia
TOM TREGENZA University of Exeter, School of Biosciences, Centre for Ecology & Conservation, UK
JUSTIN H. G. WILLIAMS University of Aberdeen Medical School, Department of Child Health, UK
STEPHEN WILSON University of California, Department of Psychiatry and Biobehavioral Sciences, Brain Research Institute, USA
SARAH N. WOODS University of Hertfordshire, Adaptive Systems Research Group, UK