Data processing: database and file management or data structures – Database design – Data structure types
Reexamination Certificate
2006-08-15
2006-08-15
Pardo, Thuy N. (Department: 2165)
Data processing: database and file management or data structures
Database design
Data structure types
C707S793000, C707S793000, C707S793000, C707S793000, C707S793000
Reexamination Certificate
active
07092956
ABSTRACT:
A system to load data in a data warehouse includes reception of a plurality of records, determination, for each of the plurality of records, of values representing differences between a record and each other of the plurality of records, identification of at least two of the plurality records as duplicates based on a determined value representing a difference between the two records, and storage of the two records in the data warehouse in association with a same identifier. Determination of the values may include determination, for each of a first plurality of data fields of the record, of a first value representing a difference between data specified in the data field and data specified in a respective one of a second plurality of data fields of one of the other of the plurality of records, determination, for each of the second plurality of data fields, of a second value representing a difference between data specified in the data field and data specified in a respective one of the first plurality of data fields, and determination of a third value representing a difference between the record and the one of the other of the plurality of records based on the determined first and second values.
REFERENCES:
patent: 5675785 (1997-10-01), Hall et al.
patent: 5680611 (1997-10-01), Rail et al.
patent: 5799302 (1998-08-01), Johnson et al.
patent: 5848405 (1998-12-01), Norcott
patent: 5974441 (1999-10-01), Rogers et al.
patent: 5999936 (1999-12-01), Pattison et al.
patent: 6167405 (2000-12-01), Rosensteel et al.
patent: 6292880 (2001-09-01), Mattis et al.
patent: 6397214 (2002-05-01), Rogers
patent: 6430545 (2002-08-01), Honarvar et al.
patent: 6457006 (2002-09-01), Gruenwald
patent: 6651055 (2003-11-01), Kilmer et al.
patent: 2002/0087516 (2002-07-01), Cras et al.
Saroia et al., “Production Datawarehouse and Software Toolset to support Productivity Improvement Activities”, IEEE, 1999, pp. 183-187.
Bhowmick et al., “Cost-benefit analysis of Web bag in a Web warehouse”, IEEE, 1999, pp. 1-9.
SmartSoft Datatech, “Match IT Database Deduplication and Merge Purge Software”, http://www.smartsoftusa.com/matchit.html (download Nov. 2, 2001) 3 pgs.
TrueData International, “TrueData International specialists in Data Management and database deduplication software”, http://www.truedata.co.uk/products.htm (download Sep. 13, 2001) 3 pgs.
DataFlux—“Case Studies: Eliminating Redundance”, © 2001 DataFlux Corporation, http://www.dataflux.com/customers/redundancy. asp (download Nov. 1, 2001) 2pgs.
Batec “Dedupe”, http://www.batec.com/dedup/dedup.html (download Nov. 1, 2001) 2pgs.
Buckley Maschoff & Talwalkar LLC
General Electric Capital Corporation
Pardo Thuy N.
LandOfFree
Deduplication system does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Deduplication system, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Deduplication system will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3665371