Data processing: database and file management or data structures – Database and file access
Reexamination Certificate
2007-10-31
2010-10-26
Lewis, Cheryl (Department: 2167)
Data processing: database and file management or data structures
Database and file access
C707S802000
Reexamination Certificate
active
07822730
ABSTRACT:
Most recent research of scalable inductive learning on very large streaming dataset focuses on eliminating memory constraints and reducing the number of sequential data scans. However, state-of-the-art algorithms still require multiple scans over the data set and use sophisticated control mechanisms and data structures. There is discussed herein a general inductive learning framework that scans the dataset exactly once. Then, there is proposed an extension based on Hoeffding's inequality that scans the dataset less than once. The proposed frameworks are applicable to a wide range of inductive learners.
REFERENCES:
patent: 6772179 (2004-08-01), Chen et al.
patent: 7337161 (2008-02-01), Fan et al.
patent: 7373473 (2008-05-01), Bukowski et al.
patent: 2005/0203930 (2005-09-01), Bukowski et al.
Shafer, John, et al, “Sprint: A Scalable Parallel Classifier for Data Mining,” Proceedings of the 22nd VLDB Conference Mumbai (Bombay), India, 1996, pp. 1-12.
Hellerstein, Joseph M., et al., “Online Aggregation,” ACM SIGMOD International Conference on Management of Data, May 1997, Tucson, Arizona, pp. 1-12.
Domingos, Pedro, et al., “Learning from Infinite Data in Finite Time,” Department of Computer Science and Engineering University of Washington, Seattle, WA 98185-2350, USA.
Gehrke, Johannes, et al., “Boat—Optimistic Decision Tree Construction,” SIGMOD '99, Philadelphia, PA, Copyright ACM 1999 1-58113-084-8/99/05, pp. 169-180.
Gehrke, Johannes, et al., “RainForest—A Framework for Fast Decision Tree Construction of Large Datasets,” Proceedings of the 24th VLDB Conference, New York, USA 1998.
Freund, Yoav, et al., “A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting,” Proceedings of the Second European Conference on Computational Learning Theory, Barcelona, Mar. 1995, pp. 1-34.
Fan, Wei, et al., “A Framework for Scalable Cost-sensitive Learning Based on Combining Probabilities and Benefits,” IBM T.J. Watson Research; Department of Computer Science, Columbia University.
Breiman, Leo, “Bagging Predictors,” Department of Statistics, University of California, Berkeley, California 94720, pp. 1-19.
Fan Wei
Wang Haixun
Yu Philip S.
Ference & Associates LLC
International Business Machines - Corporation
Lewis Cheryl
LandOfFree
Systems and methods for sequential modeling in less than one... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Systems and methods for sequential modeling in less than one..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Systems and methods for sequential modeling in less than one... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4152637