Patent
1990-06-21
1992-05-12
MacDonald, Allen R.
395906, G06F 1518
Patent
active
051134829
ABSTRACT:
An object, such as a robot, is located at an initial state in a finite state space area and moves under the control of the unsupervised neural network model of the invention. The network instructs the object to move in one of several directions from the initial state. Upon reaching another state, the model again instructs the object to move in one of several directions. These instructions continue until either: a) the object has completed a cycle by ending up back at a state it has been to previously during this cycle, or b) the object has completed a cycle by reaching the goal state. If the object ends up back at a state it has been to previously during this cycle, the neural network model ends the cycle and immediately begins a new cycle from the present location. When the object reaches the goal state, the neural network model learns that this path is productive towards reaching the goal state, and is given delayed reinforcement in the form of a "reward". Upon reaching a state, the neural network model calculates a level of satisfaction with its progress towards reaching the goal state. If the level of satisfaction is low, the neural network model is more likely to override what has been learned thus far and deviate from a path known to lead to the goal state to experiment with new and possibly better paths. If the level of satisfaction is high, the neural network model is much less likely to experiment with new paths. The object is guaranteed to eventually find the best path to the goal state from any starting location, assuming that the level of satisfaction does not exceed a threshold point where learning ceases.
REFERENCES:
patent: 4884216 (1989-11-01), Kuperstein
patent: 4933871 (1990-06-01), Desieno
A Linguistic Self-Organizing Process Controller; T. J. Procyk et al.; Automatica; vol. 15; pp. 15-30; 1979.
Learning and Sequential Decision Making by A. G. Barto, R. S. Sutton, C. J. C. H. Watkins Coins Technical Report 89-95, Sep. 1989.
International Business Machines - Corporation
MacDonald Allen R.
Rose Curtis G.
LandOfFree
Neural network model for reaching a goal state does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Neural network model for reaching a goal state, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Neural network model for reaching a goal state will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2427705