Data processing: database and file management or data structures – Database design – Data structure types
Patent
1997-02-10
1999-05-04
Black, Thomas G.
Data processing: database and file management or data structures
Database design
Data structure types
707 3, G06F 1730
Patent
active
058999868
ABSTRACT:
Methods for collecting query workload based statistics within a relational database management system (RDBMS) and for identifying columns for which statistics collection is to be performed. The novel system collects workload statistics that are dependent on multiple columns, rather than merely single columns. Multi-column statistic generation provides more accurate results for columns having correlated data, and therefore leads to better estimated cost analysis by an RDBMS optimizer. In one embodiment, a column duplicity factor is based on an analysis of distinct data rows, e.g., combinations of values within multiple columns, rather than rows of single columns. The novel system also collects separate statistics regarding the presence of null data within the rows of a column group. Separate null data statistics improve the determined result carnality used by the RDBMS optimizer because the cardinality of a relational operation's result is generally determined by the number of input rows with non-null data. The novel system includes an RDBMS optimizer that automatically identifies column groups and column groups on which workload statistics are to be generated. The parameters within a query (e.g., equi-joins, equi-selections, and projections) are analyzed by the optimizer to automatically identify the column groups. The identified columns are then registered within in a system catalog. The registered column groups are read by statistics generation procedures to identify those column groups for which workload statistics are to be collected.
REFERENCES:
patent: 5598559 (1997-01-01), Chaudhuri
"Performing Group-By before Join," Yan et al., Proceedings of the 10th International Conference on Data Engineering, Houston, TX, USA, pp. 89-100, Feb. 14-18, 1994.
Alam Hosain T.
Black Thomas G.
Oracle Corporation
LandOfFree
Methods for collecting query workload based statistics on column does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Methods for collecting query workload based statistics on column, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Methods for collecting query workload based statistics on column will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-1867168