Data exploration system and method

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000, C707S793000, C706S046000, C706S059000

Reexamination Certificate

active

06601058

ABSTRACT:

BACKGROUND OF THE INVENTION
1. Field of the Invention
This invention relates to database, data warehouse, and data mart technology and, more particularly, to an improved system and method for exploring information relationships in data.
2. Discussion of Related Art
Modern computing databases have extremely large quantities of data. Businesses often desire to discover information relationships in this data to make better informed business decisions. In this regard, “data warehousing” is used to describe computing technologies used to discover relationships within a database, and “data mart” is used to describe technologies for a subject-specific data warehouse.
To date, data warehousing and data mart tools have been undesirable because of their high cost, both in infrastructure and human capital. Modern systems are effectively customized database applications. Consequently, exploring relationships usually involves the creation of new, custom queries and typically requires a management information systems (MIS) professional, or other programming personnel, to implement the query. If a user, for example, in a marketing department, wishes to investigate a potential new information relationship, he or she is often forced to cross department boundaries and as a result almost invariably experiences undesirable delays. As a result, much of the data is under utilized because many relations are never explored because the delay outweighs the benefit.
Moreover, because modem data warehouse systems are effectively customized database applications, they often inherit inefficiencies from the underlying database. These inefficiencies may be information related (e.g., inherently precluding certain lines of questioning because the application is tightly-coupled to the database's schema) or performance related (e.g., the system may be optimized for a certain type of transactional access that does not perform well to the accesses involved in the data warehousing queries).
More specifically, concerning performance related issues, most systems rely on the relational data model (RDM). The performance of a RDM implementation is typically limited by its “access method.” Commercially-available systems, for example, have their software logic rely on an access method (e.g., “B+tree”) that requires multiple accesses to storage (e.g., memory or disk) to obtain a given record. Some of the accesses are to reference structures that are used to effectively “point to” the data of interest (e.g., indices or hierarchies of linked lists). Sometimes, these reference structures can get so large that portions of the structure must reside on disk. Thus a given request for a database record may involve multiple disk storage requests. Moreover, the database operation algorithms are tightly bound to the access method. That is, the algorithm itself has been optimized to the access method and is thus dependent on the existence of the access method. Much of the literature on database performance explicitly or implicitly assumes the existence of such access methods.
Aside from the above limitations, most commercial systems are limited to the actual data within the database. The systems cannot query other important data elements such as the schema, the meta data, or the data dictionary without significant custom programming. Consequently, significant knowledge, e.g. useful queries, is not reported or available for use within such systems.
The above difficulties are exacerbated in the context of data residing on disparate databases.
Alternative approaches have been attempted. Childs, for example, discusses set-theoretic approaches in Feasibility of a Set-Theoretic Data Structure: a General Structure Based on Reconstituted Definition of Relation, Information Processing
68
, Edinburgh, 1968; Description of a Set-Theoretic Data Structure, Fall Joint Computer Conference, San Francisco, 1968; and Extended Set Theory: a General Model for Very Large, Distributed, Backend Information Systems. He is believed to have developed a system (STDS and XTDS) in which a user may express queries directly from a small set of set operators.
SUMMARY
Preferred embodiments of the invention provide a system for, and method of, exploring relationships in data stored in a computer readable medium. A query is received having at least one operator chosen from a set of operators that includes relational operators and having at least one input and output associated with the operator and defined as a table having at least one domain having a type associated therewith. The query is transformed into a set program having at least one operation structure, corresponding to the operator and having logic for type-independently performing an operation, corresponding to the operator, and having a data relation structure, cooperating with the operation structure, for handling all data access and storage associated with the operation.


REFERENCES:
patent: 4989132 (1991-01-01), Mellender et al.
patent: 5019961 (1991-05-01), Addesso et al.
patent: 5121494 (1992-06-01), Dias et al.
patent: 5193185 (1993-03-01), Lanter
patent: 5226111 (1993-07-01), Black et al.
patent: 5259066 (1993-11-01), Schmidt
patent: 5276870 (1994-01-01), Shan et al.
patent: 5301317 (1994-04-01), Lohman et al.
patent: 5307445 (1994-04-01), Dalal et al.
patent: 5325525 (1994-06-01), Shan et al.
patent: 5418942 (1995-05-01), Krawchuk et al.
patent: 5418943 (1995-05-01), Borgida et al.
patent: 5469568 (1995-11-01), Shiefer et al.
patent: 5481700 (1996-01-01), Thuraisingham
patent: 5495606 (1996-02-01), Borden et al.
patent: 5515531 (1996-05-01), Fujiwara et al.
patent: 5537590 (1996-07-01), Amado
patent: 5542073 (1996-07-01), Schiefer et al.
patent: 5546571 (1996-08-01), Shan et al.
patent: 5548749 (1996-08-01), Kroenke et al.
patent: 5548755 (1996-08-01), Leung et al.
patent: 5548770 (1996-08-01), Bridges
patent: 5550971 (1996-08-01), Brunner et al.
patent: 5555404 (1996-09-01), Torbjornsen et al.
patent: 5560006 (1996-09-01), Layden et al.
patent: 5574900 (1996-11-01), Huang et al.
patent: 5590319 (1996-12-01), Cohen et al.
patent: 5590321 (1996-12-01), Lin et al.
patent: 5596682 (1997-01-01), Yamazaki
patent: 5615341 (1997-03-01), Agrawal et al.
patent: 5630124 (1997-05-01), Coyle, Jr. et al.
patent: 5655080 (1997-08-01), Dias et al.
patent: 5655116 (1997-08-01), Kirk et al.
patent: 5664171 (1997-09-01), Agrawal et al.
patent: 5664172 (1997-09-01), Antoshenkov
patent: 5666524 (1997-09-01), Kunkel et al.
patent: 5666527 (1997-09-01), Tatsumi et al.
patent: 5668986 (1997-09-01), Nilsen et al.
patent: 5687369 (1997-11-01), Li
patent: 5953716 (1999-09-01), Madnick et al.
patent: 5963932 (1999-10-01), Jakobsson et al.
patent: 6006224 (1999-12-01), McComb et al.
patent: 6327587 (2001-12-01), Forster
Chamberlin, D.D., “Using the new DB2: IBM's object-relational database system”, ISBN 1-55860-373-5, Morgan Kaufmann Series in Data Management Systems, Morgan Kaufmann Publishers, 1996, pp. 647-651.
PCT International Search Report corresponding to International Application No. PCT/US99/22148; Authorized Officer: Fournier, C.; Date of Completion: Feb. 29, 2000; Date of Mailing: Mar. 16, 2000 (3 pages).
Balkir, N.H. et al., “Visual: a graphical icon-based query language”, Proceedings of the Twelfth International Conference on Data Engineering (Cat. No. 96CB3588), New Orleans, LA, Feb. 26-Mar. 1, 1996, pp. 524-533.
Childs, D.L., “Extended set theory. A general model for very large, distributed backened information systems”, Proceedings on Very Large Data Bases, Tokyo, Japan, Oct. 6-Oct. 8, 1977, pp. 28-46.
Grahne, G. et al., 1997 Journal of Computer and System Sciences 54, “Knowledgebase Transformations”, Article No. SS971454, pp. 98-112.
Chemiack, M. et al., Jun. 1996 SIGMOD Record, vol. 25, Issue 2, “Rule Languages and Internal Algebras for Rule-Based Optimizers”, pp. 401-412.
Ayres, R. et al., Advances in Databases, 14th British National Conference on Databases BNCOD 14, Edinburgh, United Kingdom, Jul. 3-5, 1996 Proceedings, “Querying Graph Databases Using a Functional Language Extended with Se

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Data exploration system and method does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Data exploration system and method, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Data exploration system and method will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3010030

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.