Data processing: database and file management or data structures – Database design – Data structure types
Reexamination Certificate
2007-09-18
2007-09-18
Mofiz, Apu M. (Department: 2165)
Data processing: database and file management or data structures
Database design
Data structure types
C707S793000
Reexamination Certificate
active
10383182
ABSTRACT:
A system and method determine numerical representations for categorical data fields by taking advantage of the redundancy of the data records to allow automatic discovery of an order of the categories. A categorical data field is recoded by creating separate tables for each numerical data field occurring in the data records. The separate tables are sorted according to the numerical values of the respective data fields. The recoding of the categories is performed based on the average sort order of occurrences of the category in a specific sorted table. The standard deviation of the numerical codes provided by the categories is calculated for each of the separate recoding tables. The recoding table with the maximum standard deviation is selected as the recoding table to perform the recoding of the categories contained in the respective categorical data field of the data records. A plausibility check is performed for the selected recoding table by excluding the numerical data field that has formed the basis for the sorting of the respective table and recreating the recoding table from the data records. The resulting recoding table and the original recoding table are compared. Resulting recoding tables that are similar indicate a high level of confidence that the originally selected recoding table is optimal.
REFERENCES:
patent: 2003/0176931 (2003-09-01), Pednault et al.
H. Ralambondrainy, “A Conceptual Version of the K-Means Algorithm,” Pattern Recognition Letters 16, 1995, pp. 1147-1157.
Arning Andreas
Lingenfelder Christoph
Meyer Gregor
Roller Dieter
Wohland Swen
International Business Machines - Corporation
Kassatly Samuel A.
Mofiz Apu M.
LandOfFree
System and method for determining numerical representations... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with System and method for determining numerical representations..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for determining numerical representations... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3788238