Data processing: database and file management or data structures – Database design – Data structure types
Reexamination Certificate
2006-11-14
2006-11-14
Bruce, David (Department: 2191)
Data processing: database and file management or data structures
Database design
Data structure types
C707S793000, C707S793000
Reexamination Certificate
active
07136851
ABSTRACT:
A search system generates an index for databases by generatively sampling the databases and uses that index to identify and formulate queries for searching the databases. The generated index is referred to as a domain-attribute index and contains a domain-level index and site-level indexes. A site-level index for a database maps site attributes to distinct attribute values within the database. The domain-level index for a domain maps attribute values to database and site attribute pairs that contain those attribute values. To generate a site-level index for a database within a certain domain, the search system starts out with an initial set of the sample data for that domain. The search system generates sampling queries based on the sample data and submits the sampling queries to a database. The search system updates the site-level index based on the sampling results and uses the results to generate more sampling queries.
REFERENCES:
patent: 5548770 (1996-08-01), Bridges
patent: 5999928 (1999-12-01), Yan
patent: 7020679 (2006-03-01), Tian
patent: 2002/0077968 (2002-06-01), Kaniwa et al.
patent: 2003/0177111 (2003-09-01), Egendorf et al.
patent: 2003/0212737 (2003-11-01), Moricz et al.
Arasu, Arvind and Hector Garcia-Molina, “Extracting Structured Data from Web Pages,” SIGMOD San Diego, ACM, Jun. 9-12, 2003.
Bergman, Michael K., “The Deep Web: Surfacing Hidden Value,” Journal of Electronic Publishing, University of Michigan Press, Jul. 2001.
Callan, Jamie, Margaret Connell and Aiqun Du, “Automatic Discovery of Language Models for Text Databases,” SIGMOD, Philadelphia, ACM 1999.
Chang, Chia-Hui and Shao-Chen Lui, “IEPAD: Information Extraction Based on Pattern Discovery,” WWW Hong Kong, ACM May 1-5, 2001.
Chang, Kevin Chen-Chuan, Bin He, Chengkai Li, Mitesh Patel and Zhen Zhang, “Structured Databases on the Web: Observations and Implications,” Technical Report UIUCCDCS-R-2003-2321, CS Department, University of Illinois at Urbana-Champaign, Feb. 2003.
Crescenzi, Valter, Giansalvatore Mecca and Paolo Merialdo, “RoadRunner: Towards Automatic Data Extraction from Large Web Sites,” Proceedings of the 27th VLDB Conference, Italy, 2001.
Florescu, Daniela, Alon Levy and Alberto Mendelzon, “Database Techniques for the World-Wide Web: A Survey,” SIGMOD, 1998.
He, Bin and Kevin Chen-Chuan Chang, “Statistical Schema Matching across Web Query Interfaces,” SIGMOD, San Diego, CA, ACM Jun. 9-12, 2003.
He, Hai, Weiyi Meng, Clement Yu, and Zonghuan Wu, “WISE-Integrator. An Automatic Integrator of Web Search Interfaces for E-Commerce,” Proceedings of th 29th VLDB Conference, Germany, 2003.
Cho, Junghoo and Hector Garcia-Molina, “Synchronizing a database to Improve Freshness.”
SIGMOD Conference, Oct. 25, 1999.
Wang, Jiying and Frederick H. Lochovsky, “Wrapper Induction based on nested Pattern Discovery,” Technical Report HKUST-CS-27-02, Department of Computer Science, Hong Kong University of Science & Technology, 2002.
Ipeirotis, Panagiotis G. et al., “ Probe, Count, and Classify: Categorizing Hidden-Web Databases,” ACM SIGMOD May 21-24, 2001, Santa Barbara, California, Copyright 2001 (12 pages).
Wang, Jiying and Lochovsky, Fred, H., “Data Extraction and Label Assignment for Web Databases,” May 20-24, 2003, Budapest, Hungary (18 pages) http://www2003.org/cdrom/papers/refereed/p470/470-wang.htm.
Gravano, Luis and Panagiotis, Ipeirotis, G., “Qprober: A System for Automatic Classification of Hidden-Web Databases,” ACM Transactions on Information Systems, vol. 21, No. 1, Jan. 2003 (pp. 1-41).
Meng, Weiyi et al., “Building Efficient and Effective Metasearch Engines,” ACM Computing Surveys, vol. 34, No. 1, Mar. 2002 (42 pages).
Meng, Weiyi et al., “A Highly Scalable and Effective Method for Metasearch,” ACM Transations on Information Systems, vol. 19, No. 3, Jul. 2001 (26 pages).
Kossman, Donald, “The State of the Art in Distributed Query Processing,” ACM Computing Surveys, vol. 32, No. 4, Dec. 2000 (48 pages).
Ipeirotis, Panagiotis, G., “Distributed Search Over the Hidden Web: Hierarchical Database Sampling and Selection,” Proceedings of the 28thVLDB Conference, Hong Kong, China, 2002 (12 pages).
Raghavan, Snram and Garcia-Molina, Hector, “Crawling the Hidden Web,” Proceedings of the 27thVLDB Conference, Rome, Italy, 2001 (10 pages).
Ma Wei-Ying
Wen Ji-Rong
Bruce David
Perkins Coie LLP
Sanders Aaron J.
LandOfFree
Method and system for indexing and searching databases does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and system for indexing and searching databases, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and system for indexing and searching databases will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3640140