Method and system for indexing and searching databases

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000, C707S793000

Reexamination Certificate

active

07136851

ABSTRACT:
A search system generates an index for databases by generatively sampling the databases and uses that index to identify and formulate queries for searching the databases. The generated index is referred to as a domain-attribute index and contains a domain-level index and site-level indexes. A site-level index for a database maps site attributes to distinct attribute values within the database. The domain-level index for a domain maps attribute values to database and site attribute pairs that contain those attribute values. To generate a site-level index for a database within a certain domain, the search system starts out with an initial set of the sample data for that domain. The search system generates sampling queries based on the sample data and submits the sampling queries to a database. The search system updates the site-level index based on the sampling results and uses the results to generate more sampling queries.

REFERENCES:
patent: 5548770 (1996-08-01), Bridges
patent: 5999928 (1999-12-01), Yan
patent: 7020679 (2006-03-01), Tian
patent: 2002/0077968 (2002-06-01), Kaniwa et al.
patent: 2003/0177111 (2003-09-01), Egendorf et al.
patent: 2003/0212737 (2003-11-01), Moricz et al.
Arasu, Arvind and Hector Garcia-Molina, “Extracting Structured Data from Web Pages,” SIGMOD San Diego, ACM, Jun. 9-12, 2003.
Bergman, Michael K., “The Deep Web: Surfacing Hidden Value,” Journal of Electronic Publishing, University of Michigan Press, Jul. 2001.
Callan, Jamie, Margaret Connell and Aiqun Du, “Automatic Discovery of Language Models for Text Databases,” SIGMOD, Philadelphia, ACM 1999.
Chang, Chia-Hui and Shao-Chen Lui, “IEPAD: Information Extraction Based on Pattern Discovery,” WWW Hong Kong, ACM May 1-5, 2001.
Chang, Kevin Chen-Chuan, Bin He, Chengkai Li, Mitesh Patel and Zhen Zhang, “Structured Databases on the Web: Observations and Implications,” Technical Report UIUCCDCS-R-2003-2321, CS Department, University of Illinois at Urbana-Champaign, Feb. 2003.
Crescenzi, Valter, Giansalvatore Mecca and Paolo Merialdo, “RoadRunner: Towards Automatic Data Extraction from Large Web Sites,” Proceedings of the 27th VLDB Conference, Italy, 2001.
Florescu, Daniela, Alon Levy and Alberto Mendelzon, “Database Techniques for the World-Wide Web: A Survey,” SIGMOD, 1998.
He, Bin and Kevin Chen-Chuan Chang, “Statistical Schema Matching across Web Query Interfaces,” SIGMOD, San Diego, CA, ACM Jun. 9-12, 2003.
He, Hai, Weiyi Meng, Clement Yu, and Zonghuan Wu, “WISE-Integrator. An Automatic Integrator of Web Search Interfaces for E-Commerce,” Proceedings of th 29th VLDB Conference, Germany, 2003.
Cho, Junghoo and Hector Garcia-Molina, “Synchronizing a database to Improve Freshness.”
SIGMOD Conference, Oct. 25, 1999.
Wang, Jiying and Frederick H. Lochovsky, “Wrapper Induction based on nested Pattern Discovery,” Technical Report HKUST-CS-27-02, Department of Computer Science, Hong Kong University of Science & Technology, 2002.
Ipeirotis, Panagiotis G. et al., “ Probe, Count, and Classify: Categorizing Hidden-Web Databases,” ACM SIGMOD May 21-24, 2001, Santa Barbara, California, Copyright 2001 (12 pages).
Wang, Jiying and Lochovsky, Fred, H., “Data Extraction and Label Assignment for Web Databases,” May 20-24, 2003, Budapest, Hungary (18 pages) http://www2003.org/cdrom/papers/refereed/p470/470-wang.htm.
Gravano, Luis and Panagiotis, Ipeirotis, G., “Qprober: A System for Automatic Classification of Hidden-Web Databases,” ACM Transactions on Information Systems, vol. 21, No. 1, Jan. 2003 (pp. 1-41).
Meng, Weiyi et al., “Building Efficient and Effective Metasearch Engines,” ACM Computing Surveys, vol. 34, No. 1, Mar. 2002 (42 pages).
Meng, Weiyi et al., “A Highly Scalable and Effective Method for Metasearch,” ACM Transations on Information Systems, vol. 19, No. 3, Jul. 2001 (26 pages).
Kossman, Donald, “The State of the Art in Distributed Query Processing,” ACM Computing Surveys, vol. 32, No. 4, Dec. 2000 (48 pages).
Ipeirotis, Panagiotis, G., “Distributed Search Over the Hidden Web: Hierarchical Database Sampling and Selection,” Proceedings of the 28thVLDB Conference, Hong Kong, China, 2002 (12 pages).
Raghavan, Snram and Garcia-Molina, Hector, “Crawling the Hidden Web,” Proceedings of the 27thVLDB Conference, Rome, Italy, 2001 (10 pages).

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method and system for indexing and searching databases does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method and system for indexing and searching databases, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and system for indexing and searching databases will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3640140

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.