Data processing: database and file management or data structures – Database design – Data structure types
Reexamination Certificate
2001-04-05
2004-11-16
Channavajjala, Srirama (Department: 2177)
Data processing: database and file management or data structures
Database design
Data structure types
C707S793000, C707S793000, C706S015000
Reexamination Certificate
active
06820089
ABSTRACT:
BACKGROUND OF THE INVENTION
1. Field of the Invention
The field of the invention is data processing, that is, methods and systems for financial, business practice, business management, or cost/price determinations.
2. Description of the Related Art
A data mining tool is computer software that analyzes data and discovers relationships, patterns, knowledge, or information from the data. Data mining is also referred to as knowledge discovery. Data mining tools attempt to solve the problem of users being overwhelmed by the volume of data that computers can collect. Data mining tools attempt to shield users from the unwieldy body of data by analyzing it, summarizing it, or drawing conclusions from the data that the user can understand. For example, one known computer software data mining product is IBM's “Intelligent Miner” which can be operated in several computing environments including AIX, AS/400, and OS/390. The IBM Intelligent Miner is an enterprise data mining tool, designed for client/server configurations and optimized to mine very large data sets, such as gigabyte data sets. The IBM Intelligent Miner includes a plurality of data mining techniques or tools used to analyze large databases and provides visualization tools used to view and interpret the different mining results.
An analytic application is a software application that inputs historical data collected from a production system over time, analyzes samples of this historical data and outputs the findings back to the production system to help improve its operation. For example, an e-commerce server that manages an internet shopping site is a production system, and an analytic application might use historical data collected from the e-commerce server to report on what type users are visiting the site and how many of these are actually buying products. The term “analytic application” is used throughout this specification to mean “analytic software application,” referring to that category of software typically understood to be used directly by users to solve practical problems in their work.
Data mining is an important technology to be integrated into analytic applications. Data mining data processing technology, combinations of hardware and software, that dynamically discover patterns in historical data records and apply properties associated with these records (e.g., likely to buy) to production data records that exhibit similar patterns. Use of data mining typically involves steps such as identifying a business problem to be solved, selecting a mining algorithm useful to solve the business problem, defining data schema to be used as inputs and outputs to and from the mining algorithm, populating input data schema with historical data, training the model based upon the historical data, and scoring production data by use of the model.
In prior art, however, with available data mining tools, the end user of an analytic application must be sufficiently skilled in data mining to accomplish all these tasks, some of which require substantial expertise in data mining. For applications such as e-commerce, which are being widely adopted by businesses of all sizes and in all commerce areas, it is difficult and expensive for every business using data mining to acquire substantial data mining expertise. It would be desirable and useful, therefore, for an analytic application to integrate data mining so as to reduce the need for end users to have special expertise in data mining as such.
SUMMARY OF THE INVENTION
Embodiments of the present invention include methods and systems in which elements of data mining, such as identifying a business problem to be solved, selecting a mining algorithm useful to solve the business problem, defining data schema to be used as inputs and outputs to and from the mining algorithm, and defining data mining models, are performed by an analytic application developer. An “analytic application developer” is a software developer that develops analytic software applications. Throughout this specification, the analytic application developer is described in contrast to end users. An “end user” is a person or entity that installs and uses an analytic application for purposes of scoring and analyzing actual production data. Analytic application developers create the analytic applications that end users use.
In typical embodiments of the invention, the analytic application developer identifies a set of interesting business problems capable of definition sufficient to support data mining solutions. The analytic application developer then selects data mining algorithms useful for solving the identified problems, defines data schema useful as inputs to and outputs from the selected mining algorithm, and defines data mining models. Because the mining algorithms, the data schema, and the mining models are selected, identified, and defined prior to involvement by any end user, the mining algorithms, data schema, and mining models are referred to as being “preselected” and “predefined.”
In typical embodiments of the present invention, it is end users who carry out the data mining steps of populating input data schema with historical data, training the model based upon the historical data, and scoring production data by use of the model. Because the more difficult steps of defining business problems, preselecting mining algorithms, and predefining data schema are performed by an analytic application developer before an end user acquires the analytic application, the end user need only perform straightforward steps guided by such routine graphical user interface elements as mouse-clickable buttons, pull down menus, and wizards. The overall effect of the inventive method is to greatly reduce the data mining expertise needed by the end user.
Use of data mining typically involves steps such as identifying a business problem to be solved, selecting a mining algorithm useful to solve the business problem, defining data schema to be used as inputs and outputs to and from the mining algorithm, populating input data schema with historical data, training the model based upon the historical data, and scoring production data by use of the model.
A useful key to simplifying the use of data mining in analytic applications is to make the analytic application domain-specific. “Domain” refers to a problem subject area, and “domain-specific” means that an analytic application is designed to operate on the basis of data related to a particular problem subject area, where the data has specific defined data elements with defined relations among the data elements. For example, e-commerce is a specific domain, and a domain-specific analytic application for e-commerce would accept and analyze only e-commerce data. For illustration purposes in this specification, e-commerce is chosen as the domain of interest.
For a specific domain, it is a typical use of embodiments of the present invention to identify business problems that are applicable to such a specific domain. Once the business problems that need data mining are identified, embodiments of the invention then typically are used to build an analytic application to solve these business problems so that the analytic application developer can embed in the analytic application all data mining related knowledge needed for the solution so that the end user of the application does not require data mining specific expertise.
The steps of the inventive method in an example business problem are discussed in detail in this specification. Process flow involved in steps of typical embodiments is described by the diagram given in FIG.
1
. In typical embodiments, as mentioned above, the steps of defining business problems, preselecting mining algorithms, predefining data schema, and predefining data mining models are done by the analytic application developer, whereas only the steps of populating the input data schema with historical data, production training the model, and production scoring are left to the end user.
This specification describes sets of business questions useful to the end user
Biggers John R.
Biggers & Ohanian LLP
Channavajjala Srirama
Pham Khanh
Walker Mark S.
LandOfFree
Method and system for simplifying the use of data mining in... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and system for simplifying the use of data mining in..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and system for simplifying the use of data mining in... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3360422