Data processing: database and file management or data structures – Database design – Data structure types
Patent
1997-08-21
1999-02-09
Amsbury, Wayne
Data processing: database and file management or data structures
Database design
Data structure types
707101, 707104, G06F 1730
Patent
active
058707522
ABSTRACT:
Techniques for maintaining an approximate histogram of a relation in a database, in the presence of updates to the relation. The histogram includes a number of subsets, or "buckets," each representing at least one possible value of an attribute of the relation. Each of the subsets has a count associated therewith indicative of the frequency of occurrence of the corresponding value of the attribute. After an update to the relation, the counts associated with the subsets are compared to a threshold. If the count associated with a given subset exceeds the threshold, the given subset is separated at its median into two separate subsets. After the separation operation, the two subsets with the lowest counts are combined such that a constant number of subsets are maintained in the histogram, if the total combined count of the subsets does not exceed the threshold. If no two subsets have a total combined count which does not exceed the threshold, the histogram is recomputed from a random sample of the relation. The invention substantially reduces the number of times the histogram must be recomputed from the random sample, and is particularly well-suited for use with approximate equi-depth and compressed histograms.
REFERENCES:
patent: 4956774 (1990-09-01), Shibamiya et al.
patent: 5542089 (1996-07-01), Lindsay et al.
patent: 5630120 (1997-05-01), Vachey
patent: 5675786 (1997-10-01), McKee et al.
patent: 5689696 (1997-11-01), Gibbons et al.
patent: 5692171 (1997-11-01), Andres
V. Poosala, Y. Ioannidis, P. Haas and E. Shekita, "Improved histograms for selectivity estimation of range predicates," Proc. of ACM Sigmod Conf., pp. 294-305, Jun. 1996.
N.Ramesh et al., Thresholding based on histogram approximation, IEE Proc.-Vis.Image Signal Process;I vol. 142, No. 5, 271-279, Oct. 1995.
Prasanna K. Sahoo et al., Threshold selection based on histogram modeling, IEEE 1992 and 351-356, Aug. 1992.
Yannis E.loannidis et al., Optimal Histograms for Limiting Worst-Case Error Propagation in the Size of Join Results, ACM Transactions on Database Systems, vol. 18, No. 4, and 709-748, Dec. 1993.
Viswanath Poosala et al., Balancing Histogram Optimality and Practicality for Query Result Size Estimation, ACM Sigmod 1995 and 233-244, Jun. 1995.
Gibbons Phillip B.
Matias Yossi
Poosala Viswanath
Witkowski Andrew
Amsbury Wayne
Channavajjala Srirama
Lucent Technologies - Inc.
NCR Corporation
LandOfFree
Incremental maintenance of an approximate histogram in a databas does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Incremental maintenance of an approximate histogram in a databas, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Incremental maintenance of an approximate histogram in a databas will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1960181