Data extraction from world wide web pages

Data processing: database and file management or data structures – Database design – Data structure types

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

707201, 707 2, 707 4, G06F 1730

Patent

active

059132140

ABSTRACT:
A system for querying disparate, heterogeneous data sources over a network, where at least some of the data sources are World Wide Web pages or other semi-structured data sources, includes a query converter, a command transmitter, and a data retriever. The query converter produces, from at least a portion of a query, a set of commands which can be used to interact with a semi-structured data source. The query converter may accept a request in the same form as normally used to access a relational data base, therefore increasing the number of data bases available to a user in a transparent manner. The command transmitter issues the produced commands to the semi-structured data source. The data retriever then retrieves the desired data from the data source. In this manner, structured queries may be used to access both traditional, relational data bases as well as non-traditional, semi-structured data bases such as web sites and flat files. The system may also include a request translator and a data translator for providing data context interchange. The request translator translates a request for data having a first data context into a query having a second data context which the query converter described above. The data translator translates data retrieved from the data context of the data source into the data context associated with the request. A related method for querying disparate data sources over a network is also described.

REFERENCES:
patent: 4714995 (1987-12-01), Materna et al.
patent: 5345586 (1994-09-01), Hamala et al.
patent: 5506984 (1996-04-01), Miller
patent: 5511186 (1996-04-01), Carhart et al.
patent: 5596744 (1997-01-01), Dao et al.
patent: 5600831 (1997-02-01), Levy et al.
patent: 5634053 (1997-05-01), Noble et al.
patent: 5737592 (1998-04-01), Nguyen et al.
patent: 5826258 (1998-10-01), Gupta et al.
Daruwala et al,. "The Context Interchange Network", Database Applications Semantics, Proceedings of the IFIP WG 2.6 Working Conference on Database Applications Semantics (DS-6), Stone Mountain, Atlanta, Georgia, U.S.A., May 30-Jun. 2, 1995, pp. 65-91.
Tomasic et al., "Scaling Heterogeneous Databases and the Design of Disco", Proceedings of the 16th International Conference on Distributed Computing Systems, Hong Kong, May 27-30, 1996, pp. 449-457.
Tomasic et al., "The Distibuted Information Search Component (Disco) and the World Wide Web", ACM Sigmod International Conference on Management of Data, Tucson, Arizona, U.S.A. May 13-15, 1997, pp. 546-548.
Woelk et al., "InfoSleuth: Networked Exploitation of Information Using Semantic Agents", Digest of Papers of the Computer Society Computer Conference (Spring) Compcon, Technologies for the Information Superhighway, San Francisco, California, Mar. 5-9, 1995, pp. 147-152.
Qu, Jessica F., "Data Wrapping on the World Wide Web," Masters Thesis, Sloan School of Management, Massachusetts Institute of Technology, Feb. 1996.
Jakobiasiak, Marta, "Programming the Web--Design and Implementation of a Multidatabase Browser," Masters Thesis, Sloan School of Management, Massachusetts Institute of Technology, May 1996.
Siegel, et al. "Using Semantic Values to Facilitate Interoperability Among Heterogeneous Information Systems", Working Paper, Alfred P. Sloan School of Management, Massachusetts Institute of Technology, Feb. 1993.
Kay, Roger L., "What's the meaning of this?|", Computerworld, pp. 89-93 (1994).
Daruwala, et al. "The Context Interchange Network Prototype", Sixth IFP TC-2 Working Conference on Data Semantics (DS-6), Massachusetts Institute of Technology, The Sloan School of Management, May. 1995.
Goh, C.H. et al. "Ontologies, Contexts, and Mediation: Representing and Reasoning about Semantics Conflicts in Heterogeneous and Autonomous Systems", Working Paper, MIT Sloan School of Management, Aug. 1995.
Madnick, S. et al. "Using Knowledge About Data to Integrate Disparate Sources", Intelligent Integration of Information (I.sup.3) Workshop, Sloan School of Management, Massachusetts Institute of Technology, San Diego, California, Jan. 9-12, 1996.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Data extraction from world wide web pages does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Data extraction from world wide web pages, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Data extraction from world wide web pages will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-410536

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.