This work is aimed at the larger question of how to build a more open ended learning system. Today, much of the learning-based research in robotics is targeted at training a robot to learn a specific task, model, representation, et cetera. Often the researcher decides a priori what task the robot is to learn (such as navigating around an office environment) and then sets out to engineer the learning task accordingly. The learning task is completed once the robot can perform the task to a desired measure of success. However, because the learning algorithm is carefully tailored to a specific task, a new learning algorithm must be painstakingly designed for the robot to learn a different task. The design of learning algorithms for robots is a labor intensive process, and it is proving difficult to scale current techniques to more complex tasks in more complex environments.
In contrast, this work explores how to design a more open ended learning system. To this end, it is heavily inspired by the theories, observations, and experimental results of child developmental psychology. The heart of this research is to figure out how to design an integrated learning system such that the learner can bootstrap from previously acquired skills and cognitive structures to learn new, more diverse, and more sophisticated skills. Human infants are the prime exhibitors of the kinds of learning we want our system to emulate, often characterized as having a developmental profile where earlier skills and competencies are progressively modified, adapted, and built upon to produce more sophisticated, diverse, or new abilities.
The interaction between learner and caretaker forms a mutually regulating process. Using emotional feedback from the infant, the caregiver orchestrates the learning episode to suit the learner's current level of sophistication. For instance, if the learner is over-stimulated (too overwhelmed by environmental complexity), the caretaker must simplify or even pause the learning episode. Alternatively, if the learner appears bored, the caretaker introduces either a bit more variety or a little more difficulty to the learning episode. Over the course of learning, the learner constructs internal structures to implement more sophisticated skills and competencies. As a result, it is capable of handling a slightly more complex environment. Hence, a balance is maintained where the learner is always sufficiently challenged to learn beyond what it already has, but is never completely overwhelmed so that is has little chance of learning anything. As development proceeds, new goals are learned as interesting outcomes are discovered as well as different means to achieve them.
From a broader perspective, this research not only aims at trying to build an open ended learning system, but also to build a system that humans can interact with and train in a natural, instinctive manner. Humans are highly social creatures and use a variety of cues and modalities to communicate with each other. Building systems that can exploit and understand similar social cues could make machines easier for people to use, and enable humans to communicate with machines in richer ways.