Patient rule induction method on large disk resident data...

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000, C705S002000, C705S003000

Reexamination Certificate

active

09470444

ABSTRACT:
The present invention relates to analysis of large, disk resident data sets using a Patient Rule Induction Method (PRIM) in a computer system wherein a relational data table is initially received. The relational data table includes continuous attributes, discrete attributes, a matter parameter and a cost attribute. The cost attribute represents cost output values based on continuous attribute values and discrete attribute values as inputs. A hyper-rectangle is then formed which encloses a multi-dimensional space defined by the continuous attribute values and the discrete attribute values. The continuous attribute values and the discrete attribute values are represented as points within the multi-dimensional space. A plurality of points along edges of the hyper-rectangle are then removed based on an average of the cost output value from the plurality of points until a count of the points enclosed within the hyper-rectangle equals the meta parameter. Discrete attribute values and continuous attribute values which were removed from the hyper-rectangle are next added along edges of the hyper-rectangle until a sum of the cost output value over the multi-dimensional space enclosed by the hyper-rectangle changes. In a further embodiment a parallel architecture computer system calculates the cost attribute average values over the plurality of points enclosed by the hyper-rectangle in parallel. The invention analyzes large disk resident data sets without having to load the data set into main memory and can be practiced on a parallel computer architecture or a symmetric multi-processor architecture to improve performance.

REFERENCES:
patent: 5787274 (1998-07-01), Agrawal et al.
patent: 5813019 (1998-09-01), Van de Vanter
patent: 5960435 (1999-09-01), Rathmann et al.
patent: 5991728 (1999-11-01), DeBusk et al.
patent: 6229918 (2001-05-01), Toyama
patent: 6307965 (2001-10-01), Aggarwal et al.
patent: 6563952 (2003-05-01), Srivastava et al.
patent: WO9406088 (1994-03-01), None
patent: WO9726609 (1997-07-01), None
Xindong Wu, “Induction by attribute Elimination”, Knowledge and data Engineering, IEEE Transactions on vol. 11 issue: 5 Sep./Oct. 1999.
Jorme et al. “Bump Hunting in High-dimensional data”, Kluwer Academic Publishers, Hingham, MA, USA, p. 123-143, Year pf publication 1999.
Friedman et al., “Bump Hunting In High-Dimesional Data,” Dept. of Statistics and Stanford Linear Accelerator Center, Stanford University, Stanford, CA 94305, Oct. 28, 1998, pp. 1-31.
Mehta et al., “SLIQ: A Fast Scalable Classifier for Data Mining,” IBM Almaden Research Center, 650 Harry Road, San Jose, CA 95120.
Srivastava et al., “An Efficient, Scalable, Parallel Classifier for Data Mining,” Dept. of Computer Science, University of Minnesota, Dept. of Computer Science, 4-192 EE/CS Bldg., 200 Union St. SE, Minneopolis, MN 55455.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Patient rule induction method on large disk resident data... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Patient rule induction method on large disk resident data..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Patient rule induction method on large disk resident data... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3733084

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.