Efficient sampling of a relational database

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000

Reexamination Certificate

active

06993516

ABSTRACT:
A system, method and computer readable medium for sampling data from a relational database are disclosed, where an information processing system chooses rows from a table in a relational database for sampling, wherein data values are arranged into rows, rows are arranged into pages, and pages are arranged into tables. Pages are chosen for sampling according to a probability P and rows in a selected page are chosen for sampling according to a probability R, so that the overall probability of choosing a row for sampling is Q=PR. The probabilities P and R are based on the desired precision of estimates computed from a sample, as well as processing speed. The probabilities P and R are further based on either catalog statistics of the relational database or a pilot sample of rows from the relational database.

REFERENCES:
patent: 5675786 (1997-10-01), McKee et al.
patent: 5878426 (1999-03-01), Plasek et al.
patent: 5890150 (1999-03-01), Ushijima et al.
patent: 5950189 (1999-09-01), Cohen et al.
patent: 5978788 (1999-11-01), Castelli et al.
patent: 6067542 (2000-05-01), Carino, Jr.
patent: 6182061 (2001-01-01), Matsuzawa et al.
patent: 6223171 (2001-04-01), Chaudhuri et al.
patent: 6278989 (2001-08-01), Chaudhuri et al.
patent: 6301575 (2001-10-01), Chadha et al.
patent: 6363371 (2002-03-01), Chaudhuri et al.
patent: 6374251 (2002-04-01), Fayyad et al.
patent: 6493637 (2002-12-01), Steeg
patent: 6532458 (2003-03-01), Chaudhuri et al.
patent: 2001/0000536 (2001-04-01), Tarin
patent: 2002/0077968 (2002-06-01), Kaniwa et al.
patent: 2002/0087518 (2002-07-01), Ellis et al.
patent: 2002/0198863 (2002-12-01), Anjur et al.
Chaudhuri et al, Random sampling for histogram construction: how much is enough? (ABSTRACT), 1998 ACM SIGMOD Int'l Conference on Management of Data, Seattle, WA, USA, Jun. 1-4, 1998, vol. 27, No. 2, p. 436-447.
Chang, Lee and Chang, Sue-An, “Utilizing Page-Level Join Index for Optimization in Parallel Join Execution,” IEEE Transactions on Knowledge and Data Engineering, vol. 7, No. 6, Dec. 1995.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Efficient sampling of a relational database does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Efficient sampling of a relational database, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Efficient sampling of a relational database will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3556923

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.