Method for automatic wrapper repair

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000, C707S793000

Reexamination Certificate

active

07440974

ABSTRACT:
A method of information extraction from a Web page using an initial wrapper which has become partially inoperative, wherein the initial wrapper comprises an initial set of rules for extracting information and for assigning labels from a wrapper set of labels to the extracted information, includes using the initial set of rules to extract strings from the Web page parsed in forward direction; analyzing the extracted strings according to the initial set of rules for assigning labels associated with the wrapper; assigning labels to those strings which satisfy the label rules; using the initial set of rules to extract strings from the Web page in backward/(opposite) direction; analyzing the extracted strings according to the set of rules for assigning labels associated with the wrappers; and assigning labels to those unlabeled strings from which satisfy the label rules.

REFERENCES:
patent: 4996642 (1991-02-01), Hey
patent: 5704017 (1997-12-01), Heckerman et al.
patent: 5724567 (1998-03-01), Rose et al.
patent: 5794237 (1998-08-01), Gore, Jr.
patent: 5826258 (1998-10-01), Gupta et al.
patent: 5913214 (1999-06-01), Madnick et al.
patent: 6009441 (1999-12-01), Mathieu et al.
patent: 6085198 (2000-07-01), Skinner et al.
patent: 6102969 (2000-08-01), Christianson et al.
patent: 6266668 (2001-07-01), Vanderveldt et al.
patent: 6304870 (2001-10-01), Kushmerick et al.
patent: 6424980 (2002-07-01), Iizuka et al.
patent: 6442749 (2002-08-01), Hirao et al.
patent: 6473752 (2002-10-01), Fleming, III
patent: 6606625 (2003-08-01), Muslea et al.
patent: 6691264 (2004-02-01), Huang
patent: 6714941 (2004-03-01), Lerman et al.
patent: 6778979 (2004-08-01), Grefenstette et al.
patent: 6792576 (2004-09-01), Chidlovskii
patent: 6820075 (2004-11-01), Shanahan et al.
patent: 7007017 (2006-02-01), Bergholz et al.
patent: 2002/0062312 (2002-05-01), Gupta et al.
patent: 2004/0015784 (2004-01-01), Chidlovskii
patent: 2005/0022114 (2005-01-01), Shanahan et al.
Ashish, Naveen et al., “Semi-Automatic Generation for Internet Information Sources”, Proceedings of the Second IFCIS International Conference on Cooperative Information Systems, COOPIS 1997, Jun. 24-27, 1997, pp. 160-169.
Ashish, Naveen et al., “Wrapper Generation for Semi-Structured Internet Sources”, SIGMOD Record, vol. 26, No. 4, Dec. 1997, pp. 8-15.
Chidlovskii, Boris et al., “Towards Sophisticated Wrapping of Web-Based Information Repositories”, Proceedings of the 5th International RIAO Conference, Montreal, Canada, Jun. 25-27, 1997, pp. 123-135.
Crescenzi, Valter et al., “Grammers HAve Exceptions”, Information Systems, vol. 23, No. 9, Dec. 1998, pp. 539-565, (1998 Elsevier Science Ltd., pp. 1-25).
Hoding, Michael et al., “Schema Derivation for WWW Information Sources and Their Integration with Databases in Bioinformatics”, Proceedings of the Second East European Symposium on Advances in Databases and Information Systems, Lecture Notes in Computer Science; vol. 1475, 1998, pp. 296-304.
Hong, Theodore, “Visualizing Real Estate Property Information on the Web”, Proceedings of the 1999 IEEE International Conference on Information Visualization, Jul. 1999, pp. 182-187.
Huck, Gerald et al., “JEDI: Extracting and Synthesizing Information from the Web”, Proceedings of the 3rd IFCIS International Conference on Cooperative Information Systems, Aug. 1998, pp. 32-41.
Kushmerick, Nicholas et al., “Wrapper Induction for Information Extraction”, International Joint Conference on Artificial Intelligence, IJCAI-1997, pp. 729-737.
Kushmerick, Nicholas, Wrapper Induction: Efficiency and Expressiveness (Extended Abstract), 1998, Artificial Intelligence, vol. 118, No. 1-2, (2000) pp. 1-8.
Kushmerick, Nicholas, “Wrapper Induction for Information Extraction” Dissertation for the Doctor of Philosophy, Department of Computer science and Engineering, Nov. 26, 1997, University of Washington, 249 Pages.
Muslea, Ion et al., “Stalker: Learning Extraction Rules for Semistructured, Web-based Information Sources”, In Proceedings of AAAI-98 Workshop on AI and Information Integration, Technical Report WS-98-01, AAAI Press, Menlo Park, CA (1998). http://citeseer.comp.nus.edu.sg/25537.html, 8 Pages.
Noah, William W., “The Integration of the World Wide Web and Intranet Data Resources”, Proceedings of the Thirty-First Annual Hawaii International Conference on System Sciences-vol. 4, Jan. 1998, pp. 496-502.
Gruser, J.R.,et al . “Wrapper Generation for Web Accessible Data Sources.” CoopIS 1998. http://citeseer.ist.psu.edu/article/gruser98wrapper.html, pp. 14-23.
Faensen, D. et al., “Hermes—a notification service for digital libraries”, Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries, ACM 2001, pp. 373-380.
Kushmerick, Nicholas, “Regression testing for wrapper maintenance”, Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applicatio, AAAI-99, 1999, pp. 74-79 (1 -6).
Stonebraker, Michael et al., “Content integration for e-business”, ACM SIGMOD Record, vol. 30, Issue 2 (Jun. 2001), pp. 552-560.
Faensen, D. et al., “Hermes—A Notification Service for Digital Libraries”, ACM 2001, pp. 373-380.
Kushmerick, Nicholas “Regression testing for wrapper maintenance”, American Association for Artificial Intelligence, 1999, pp. 1-6.
Kushmerick, Nicholas “Gleaning the Web”, IEEE Intelligent Systems, pp. 20-22.
Stonebraker, Michael et al., “Content Integration for E-Business”, ACM Sigmod, May 21-24, 2001, pp. 552-560.
Chidlovskii, Boris, System and Method of Automatic Wrapper Grammer Generation, Jul. 6, 1999, U.S. Appl. No. 09/361/496.
Gruser, Jean-Robert, et. al., Wrapper Generation for Web Accessible Data Sources, University of Maryland, College Park Maryland {gruser, louiqa, mvidal, bright} @umiacs.umd.edu.
Kushmerick, Nicholas, Wrapper Induction: Efficiency and expressiveness, Artificial Intelligence 118, 2000 Elsevier Science, pp. 15-68.
Kashmerick, Nicholas, Wrapper Induction, Efficiency and expressiveness (Extended Abstract), School of Computer Applications, Dublin City University, Ireland, 25-37 (nick@compapp.dcu.ie).

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method for automatic wrapper repair does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method for automatic wrapper repair, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method for automatic wrapper repair will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4009016

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.