Method for automatic wrapper repair

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000, C715S252000, C715S252000, C714S005110

Reexamination Certificate

active

07139777

ABSTRACT:
A method for repairing a wrapper associated with an information source, includes defining a classifier, based on content features of extracted and labeled information using the wrapper, using the classifier to extract content information from the file according to a set of classifier extraction rules; analyzing the extracted content information according to the content features and assigning a label to any extracted content information which satisfies the label's rules; and defining a repaired wrapper as the classifier and those labels in the set which have been assigned to extracted content information. Additional content information and labels can be extracted by iteratively creating a classifier based on both content features and structure features of extracted strings.

REFERENCES:
patent: 5826258 (1998-10-01), Gupta et al.
patent: 5913214 (1999-06-01), Madnick et al.
patent: 6009441 (1999-12-01), Mathieu et al.
patent: 6085198 (2000-07-01), Skinner et al.
patent: 6304870 (2001-10-01), Kushmerick et al.
patent: 6424980 (2002-07-01), Iizuka et al.
patent: 6442749 (2002-08-01), Hirao et al.
patent: 6691264 (2004-02-01), Huang
patent: 6792576 (2004-09-01), Chidlovskii
patent: 7093156 (2006-08-01), Shubat et al.
patent: 2002/0062312 (2002-05-01), Gupta et al.
patent: 2004/0015784 (2004-01-01), Chidlovskii
Knoblock, Craig A. et al., “Accurately and Reliably Extracting Data from the Web: A Machine Learning Approach”, 2000 IEEE, Dec. 2000, pp. 1-13.
Muslea, Ion et al., “A Hierarchical Approach to Wrapper Induction”, 1999 ACM, pp. 190-197.
Faensen, D. et al., “Hermes—A Notification Service for Digital Libraries”, ACM 2001, pp. 373-380.
Kushmerick, Nicholas “Regression testing for wrapper maintenance”, American Association for Artificial Intelligence, 1999, pp. 1-6.
Kushmerick, Nicholas “Gleaning the Web”, IEEE Intelligent Systems, pp. 20-22.
Stonebraker, Michael et al., “Content Integration for E-Business”, ACM SIGMOD, May 21-24, 2001, pp. 552-560.
Chidlovskii, Boris, System and Method of Automatic Wrapper Grammar Generation, Jul. 6, 1999, U.S. Appl. No. 09/361,496.
Gruser, Jean-Robert, et. al., Wrapper Generation for Web Accessible Data Sources, University of Maryland, College Park Maryland (gruser, louiqa, mvidal, bright) @umiacs.umd.edu.
Kushmerick, Nicholas, Wrapper Induction: Efficiency and expressiveness, Artificial Intelligence 118, 2000 Elsevier Science, pp. 15-68.
Kashmerick, Nicholas, Wrapper Induction, Efficiency and expressiveness (Extended Abstract), School of Computer Applications, Dublin City University, Ireland, 25-37 (nick@compapp.dcu.ie).

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method for automatic wrapper repair does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method for automatic wrapper repair, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method for automatic wrapper repair will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3680385

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.