Data processing: database and file management or data structures – Database design – Data structure types
Reexamination Certificate
2006-01-31
2006-01-31
Amsbury, Wayne (Department: 2161)
Data processing: database and file management or data structures
Database design
Data structure types
C707S793000
Reexamination Certificate
active
06993516
ABSTRACT:
A system, method and computer readable medium for sampling data from a relational database are disclosed, where an information processing system chooses rows from a table in a relational database for sampling, wherein data values are arranged into rows, rows are arranged into pages, and pages are arranged into tables. Pages are chosen for sampling according to a probability P and rows in a selected page are chosen for sampling according to a probability R, so that the overall probability of choosing a row for sampling is Q=PR. The probabilities P and R are based on the desired precision of estimates computed from a sample, as well as processing speed. The probabilities P and R are further based on either catalog statistics of the relational database or a pilot sample of rows from the relational database.
REFERENCES:
patent: 5675786 (1997-10-01), McKee et al.
patent: 5878426 (1999-03-01), Plasek et al.
patent: 5890150 (1999-03-01), Ushijima et al.
patent: 5950189 (1999-09-01), Cohen et al.
patent: 5978788 (1999-11-01), Castelli et al.
patent: 6067542 (2000-05-01), Carino, Jr.
patent: 6182061 (2001-01-01), Matsuzawa et al.
patent: 6223171 (2001-04-01), Chaudhuri et al.
patent: 6278989 (2001-08-01), Chaudhuri et al.
patent: 6301575 (2001-10-01), Chadha et al.
patent: 6363371 (2002-03-01), Chaudhuri et al.
patent: 6374251 (2002-04-01), Fayyad et al.
patent: 6493637 (2002-12-01), Steeg
patent: 6532458 (2003-03-01), Chaudhuri et al.
patent: 2001/0000536 (2001-04-01), Tarin
patent: 2002/0077968 (2002-06-01), Kaniwa et al.
patent: 2002/0087518 (2002-07-01), Ellis et al.
patent: 2002/0198863 (2002-12-01), Anjur et al.
Chaudhuri et al, Random sampling for histogram construction: how much is enough? (ABSTRACT), 1998 ACM SIGMOD Int'l Conference on Management of Data, Seattle, WA, USA, Jun. 1-4, 1998, vol. 27, No. 2, p. 436-447.
Chang, Lee and Chang, Sue-An, “Utilizing Page-Level Join Index for Optimization in Parallel Join Execution,” IEEE Transactions on Knowledge and Data Engineering, vol. 7, No. 6, Dec. 1995.
Haas Peter Jay
Lohman Guy Maring
Pirahesh Mir Hamid
Simmen David Everett
Singh Ashutosh Vir Vikram
Amsbury Wayne
Fleit Kain Gibbons Gutman Bongini & Bianco P.L.
Gutman Jose
LandOfFree
Efficient sampling of a relational database does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Efficient sampling of a relational database, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Efficient sampling of a relational database will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3556923