Data processing: database and file management or data structures – Database design – Data structure types
Reexamination Certificate
2007-11-06
2007-11-06
Mofiz, Apu (Department: 2161)
Data processing: database and file management or data structures
Database design
Data structure types
C707S793000
Reexamination Certificate
active
10293485
ABSTRACT:
A method for distributing and sorting data among a plurality of nodes is described herein. After receiving a portion of a data set (e.g., a database), each node sorts its portion and estimates a partitioning of the sorted dataset among the nodes based in part on its own sorted data portion. Each node then provides a representation of its estimated partition to a master node. The master node, using the provided estimated partitions, determines a tentative partitioning and submits the tentative partitioning to each node. Each node then determines the effect the tentative partitioning using its data portion. If the effect is acceptable for each node, the tentative partitioning plan is used to partition the data. Otherwise, the tentative partitioning plan is repeatedly revised by the master node and considered by the nodes having data portions until an acceptable or optimum partitioning is determined. Each node then distributes data from its data portion that falls outside the partition assigned to the node to the appropriate node. Upon receipt of this data, each node can perform a merge sort to add the received data to the previously sorted data portion at the node.
REFERENCES:
patent: 4543630 (1985-09-01), Neches
patent: 4860201 (1989-08-01), Stolfo et al.
patent: 4870568 (1989-09-01), Kahle et al.
patent: 4925311 (1990-05-01), Neches et al.
patent: 5006978 (1991-04-01), Neches
patent: 5146590 (1992-09-01), Lorie et al.
patent: 5276899 (1994-01-01), Neches
patent: 5303383 (1994-04-01), Neches et al.
patent: 5423037 (1995-06-01), Hvasshovd
patent: 5471622 (1995-11-01), Eadline
patent: 5495606 (1996-02-01), Borden et al.
patent: 5551027 (1996-08-01), Choy et al.
patent: 5555404 (1996-09-01), Torbjørnsen et al.
patent: 5655080 (1997-08-01), Dias et al.
patent: 5732400 (1998-03-01), Mandler et al.
patent: 5745746 (1998-04-01), Jhingran et al.
patent: 5878408 (1999-03-01), Van Huben et al.
patent: 5884299 (1999-03-01), Ramesh et al.
patent: 5890159 (1999-03-01), Sealby et al.
patent: 5897638 (1999-04-01), Lasser et al.
patent: 5970495 (1999-10-01), Baru et al.
patent: 5983228 (1999-11-01), Kobayashi et al.
patent: 6006249 (1999-12-01), Leong
patent: 6026394 (2000-02-01), Tsuchida et al.
patent: 6081801 (2000-06-01), Cochrane et al.
patent: 6266804 (2001-07-01), Isman
patent: 6311169 (2001-10-01), Duhon
patent: 6427148 (2002-07-01), Cossock
Eike Schallehn et al., “Advanced Grouping and Aggregation for Data Integration,” Department of Computer Science, Paper ID: 222, pp. 1-16, no date.
Vincent Coppola, “Killer APP,” Men's Journal, vol. 12, No. 3, Apr. 2003, pp. 86-90.
Eike Schallehn et al., “Extensible and Similarity-based Grouping for Data Integration,” Department of Computer Science, pp. 1-17, 2002.
Rohit Ananthakrishna et al., “Eliminating Fuzzy Duplicates in Data Warehouses,” 12 pages, 2002.
Peter Christen et al., “Parallel Computing Techniques for High-Performance Probabilistic Record Linkage,” Data Mining Group, Australian National University, Epidemiology and Surveillance Branch, Project web page: http://datamining.anu.edu.au/linkage.html, 2002, pp. 1-11.
Peter Christen et al., “Parallel Techniques for High-Performance Record Linkage (Data Matching),” Data Mining Group, Australian National University, Epidemiology and Surveillance Branch, Project web page: http://datamining.anu.edu.au/linkage.html, 2002, pp. 1-27.
Peter Christen et al., “High-Performance Computing Techniques for Record Linkage,” Data Mining Group, Australian National University, Epidemiology and Surveillance Branch, Project web page: http://datamining.anu.edu.au/linkage.html, 2002, pp. 1-14.
William E. Winkler, “Matching and Record Linkage,” U.S. Bureau of the Census, pp. 1-38, no date.
Peter Christen et al., “High-Performance Computing Techniques for Record Linkage,” ANU Data Mining Group, Australian National University, Epidemiology and Surveillance Branch, Project web page: http://datamining.anu.edu.au/linkage.html, pp. 1-11, no date.
William E. Winkler, “The State of Record Linkage and Current Research Problems,” U.S. Bureau of the Census, 15 pages, No Date.
William E. Winkler, “Advanced Methods for Record Linkage,” Bureau of the Census, pp. 1-21, no date.
William E. Winkler, Frequency-Based Matching in Fellegi-Sunter Model of Record Linkage, Bureau Of The Census Statistical Research Division, Oct. 4, 2000, 14 pages.
William E. Winkler, “State of Statistical Data Editing And Current Research Problems,” Bureau Of The Census Statistical Reseach Division, 10 pages, no date.
The First Open ETL/EAI Software For The Real-Time Enterprise, Sunopsis, A New Generation ETL Tool, “Sunopsis™ v4 expedites integration between heterogeneous systems for Data WAREHOUSE, Data Mining, Business Intelligence, and OLAP projects,” <www.suopsis.com>, 6 pages, no date
Alan Dumas, “The ETL Market and Sunopsis™ v3 Business Intelligence, Data Warehouse & Datamart Projects,” 2002, Sunopsis, pp. 1-7.
Teradata Warehouse Solutions, “Teradata Database Technical Overview,” 2002, pp. 1-7.
WhiteCross White Paper, May 25, 2000, “wx/des-Technical Information,” pp. 1-36.
Teradata Alliance Solutions, “Teradata and Ab Initio,” pp. 1-2, 2001.
Peter Christen et al., The Australian National University, “Febri—Freely extensible biomedical record linkage,” Oct. 2002, pp. 1-67.
William E. Winkler, “Using the EM Algorithim for Weight Computation in the Fellegi-Sunter Model of Record Linkage,” Bureau Of The Census Statistical Research Division, Oct. 4, 2000, 12 pages.
William E. Winkler et al., “An Application Of The Fellegi-Sunter Model Of Record Linkage To The 1990 U.S. Decennial Census,” U.S. Bureau of the Census, pp. 1-22.
William E. Winkler, “Improved Decision Rules In The Fellegi-Sunter Model Of Record Linkage,” Bureau of the Census, pp. 1-13, no date.
Fritz Scheuren et al., “Recursive Merging and Analysis of Administrative Lists and Data,” U.S. Bureau of the Census, 9 pages, no date.
William E. Winkler, “Record Linkage Software and Methods for Merging Administrative Lists,” U.S. Bureau of the Census, Jul. 7, 2001, 11 pages.
Enterprises, Publishing and Broadcasting Limited, Acxiom-Abilitec, pp. 44-45, no date.
TransUnion, Credit Reporting System, Oct. 9, 2002, 4 pages, <http://www.transunion.com/content/page.jsp?id=/transunion/general/data/business/BusCre...>.
TransUnion, ID Verification & Fraud Detection, Account Acquisition, Account Management, Collection & Location Services, Employment Screening, Risk Management, Automotive, Banking-Saving & Loan, Credit Card Providers, Credit Unions, Energy & Utilities, Healthcare, Insurance, Investment, Real Estate, Telecommunications, Oct. 9, 2002, 46 pages, <http://www.transunion.com>.
White Paper An Introduction to OLAP Multidimensional Terminology and Technology, 20 pages, no date.
Bayliss David
Chapman Richard
Halliday Gavin
Hicks Nigel
Poulsen Ole
Hunton & Williams LLP
Mofiz Apu
Padmanabhan Kavita
Seisint, Inc.
LandOfFree
Method for sorting and distributing data among a plurality... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method for sorting and distributing data among a plurality..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method for sorting and distributing data among a plurality... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3843855