Method and apparatus for improved reward-based learning...

Data processing: artificial intelligence – Machine learning

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Reexamination Certificate

active

08060454

ABSTRACT:
The present invention is a method and an apparatus for reward-based learning of management policies. In one embodiment, a method for reward-based learning includes receiving a set of one or more exemplars, where at least two of the exemplars comprise a (state, action) pair for a system, and at least one of the exemplars includes an immediate reward responsive to a (state, action) pair. A distance measure between pairs of exemplars is used to compute a Non-Linear Dimensionality Reduction (NLDR) mapping of (state, action) pairs into a lower-dimensional representation, thereby producing embedded exemplars, wherein one or more parameters of the NLDR are tuned to minimize a cross-validation Bellman error on a holdout set taken from the set of one or more exemplars. The mapping is then applied to the set of exemplars, and reward-based learning is applied to the embedded exemplars to obtain a learned management policy.

REFERENCES:
patent: 2007/0203862 (2007-08-01), Sekiai et al.
Belkin, Mkihail and Partha Niyogi. “Laplacian Eigenmaps for Dimensionality Reduction and Data Representation” [Online] Downloaded Oct. 21, 2010. Neural Computation 15, 1373-1396 2003.
Bengio, Yoshua, Jean Francois Paiment, and Pascal Vincent “Out-ofSample Extensions for LLE, Isomap, MDS, Eigenmaps, and Spectral Clustering” [Online] Downloaded Oct. 26, 2010. Jul. 25, 2003.
Si, Jennie and Yu-Tsung Wang, “Neuro-Dynamic Programming Based on Self-Organized Patterns”, Proceedings of the 1999 IEEE International Symposium on Intelligent Control/Intelligent Systems and Seniotics, 1999, [Online] Downloaded Apr. 20, 2011, http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=796641.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method and apparatus for improved reward-based learning... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method and apparatus for improved reward-based learning..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for improved reward-based learning... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4301685

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.