Abstract:
An apparatus and methods for training and/or operating a robotic device to perform a composite task comprising a plurality of subtasks. Subtasks may be arranged in a hierarchy. Individual tasks of the hierarchy may be operated by a respective learning controller. Individual learning controllers may interface to appropriate components of feature extractor configured to detect features in sensory input. Individual learning controllers may be trained to produce activation output based on occurrence of one or more relevant features and using training input. Output of a higher level controller may be provided as activation indication to one or more lower level controllers. Inactive activation indication may be utilized to deactivate one or more components thereby improving operational efficiency. Output of a given feature extractor may be shared between two or more learning controllers.
Abstract:
Robotic devices may be trained by a trainer guiding the robot along a target trajectory using physical contact with the robot. The robot may comprise an adaptive controller configured to generate control commands based on one or more of the trainer input, sensory input, and/or performance measure. The trainer may observe task execution by the robot. Responsive to observing a discrepancy between the target behavior and the actual behavior, the trainer may provide a teaching input via a haptic action. The robot may execute the action based on a combination of the internal control signal produced by a learning process of the robot and the training input. The robot may infer the teaching input based on a comparison of a predicted state and actual state of the robot. The robot's learning process may be adjusted in accordance with the teaching input so as to reduce the discrepancy during a subsequent trial.
Abstract:
Robotic devices may be trained by a user guiding the robot along target action trajectory using an input signal. A robotic device may comprise an adaptive controller configured to generate control signal based on one or more of the user guidance, sensory input, performance measure, and/or other information. Training may comprise a plurality of trials, wherein for a given context the user and the robot's controller may collaborate to develop an association between the context and the target action. Upon developing the association, the adaptive controller may be capable of generating the control signal and/or an action indication prior and/or in lieu of user input. The predictive control functionality attained by the controller may enable autonomous operation of robotic devices obviating a need for continuing user guidance.
Abstract:
Apparatus and methods for a modular robotic device with artificial intelligence that is receptive to training controls. In one implementation, modular robotic device architecture may be used to provide all or most high cost components in an autonomy module that is separate from the robotic body. The autonomy module may comprise controller, power, actuators that may be connected to controllable elements of the robotic body. The controller may position limbs of the toy in a target position. A user may utilize haptic training approach in order to enable the robotic toy to perform target action(s). Modular configuration of the disclosure enables users to replace one toy body (e.g., the bear) with another (e.g., a giraffe) while using hardware provided by the autonomy module. Modular architecture may enable users to purchase a single AM for use with multiple robotic bodies, thereby reducing the overall cost of ownership.
Abstract:
Robots have the capacity to perform a broad range of useful tasks, such as factory automation, cleaning, delivery, assistive care, environmental monitoring and entertainment. Enabling a robot to perform a new task in a new environment typically requires a large amount of new software to be written, often by a team of experts. It would be valuable if future technology could empower people, who may have limited or no understanding of software coding, to train robots to perform custom tasks. Some implementations of the present invention provide methods and systems that respond to users' corrective commands to generate and refine a policy for determining appropriate actions based on sensor-data input. Upon completion of learning, the system can generate control commands by deriving them from the sensory data. Using the learned control policy, the robot can behave autonomously.
Abstract:
Apparatus and methods for training and controlling of e.g., robotic devices. In one implementation, a robot may be utilized to perform a target task characterized by a target trajectory. The robot may be trained by a user using supervised learning. The user may interface to the robot, such as via a control apparatus configured to provide a teaching signal to the robot. The robot may comprise an adaptive controller comprising a neuron network, which may be configured to generate actuator control commands based on the user input and output of the learning process. During one or more learning trials, the controller may be trained to navigate a portion of the target trajectory. Individual trajectory portions may be trained during separate training trials. Some portions may be associated with robot executing complex actions and may require additional training trials and/or more dense training input compared to simpler trajectory actions.
Abstract:
Sensory encoder may be implemented. Visual encoder apparatus may comprise spiking neuron network configured to receive photodetector input. Excitability of neurons may be adjusted and output spike may be generated based on the input. When neurons generate spiking response, spiking threshold may be dynamically adapted to produce desired output rate. The encoder may dynamically adapt its input range to match statistics of the input and to produce output spikes at an appropriate rate and/or latency. Adaptive input range adjustment and/or spiking threshold adjustment collaborate to enable recognition of features in sensory input of varying dynamic range.
Abstract:
Apparatus and methods for activity based plasticity in a spiking neuron network adapted to process sensory input. In one embodiment, the plasticity mechanism may be configured for example based on activity of one or more neurons providing feed-forward stimulus and activity of one or more neurons providing inhibitory feedback. When an inhibitory neuron generates an output, inhibitory connections may be potentiated. When an inhibitory neuron receives inhibitory input, the inhibitory connection may be depressed. When the inhibitory input arrives subsequent to the neuron response, the inhibitory connection may be depressed. When input features are unevenly distributed in occurrence, the plasticity mechanism is capable of reducing response rate of neurons that develop receptive fields to more prevalent features. Such functionality may provide network output such that rarely occurring features are not drowned out by more widespread stimulus.
Abstract:
A robotic device may comprise an adaptive controller configured to learn to predict consequences of robotic device's actions. During training, the controller may receive a copy of the planned and/or executed motor command and sensory information obtained based on the robot's response to the command. The controller may predict sensory outcome based on the command and one or more prior sensory inputs. The predicted sensory outcome may be compared to the actual outcome. Based on a determination that the prediction matches the actual outcome, the training may stop. Upon detecting a discrepancy between the prediction and the actual outcome, the controller may provide a continuation signal configured to indicate that additional training may be utilized. In some classification implementations, the discrepancy signal may be used to indicate occurrence of novel (not yet learned) objects in the sensory input and/or indicate continuation of training to recognize said objects.
Abstract:
Robotic devices may be trained by a user guiding the robot along a target trajectory using a correction signal. A robotic device may comprise an adaptive controller configured to generate control commands based on one or more of the trainer input, sensory input, and/or performance measure. Training may comprise a plurality of trials. During an initial portion of a trial, the trainer may observe robot's operation and refrain from providing the training input to the robot. Upon observing a discrepancy between the target behavior and the actual behavior during the initial trial portion, the trainer may provide a teaching input (e.g., a correction signal) configured to affect robot's trajectory during subsequent trials. Upon completing a sufficient number of trials, the robot may be capable of navigating the trajectory in absence of the training input.