Data processing: database and file management or data structures – Database design – Data structure types
Reexamination Certificate
2005-12-21
2008-05-27
Wong, Don (Department: 2163)
Data processing: database and file management or data structures
Database design
Data structure types
C707S793000, C707S793000
Reexamination Certificate
active
07379932
ABSTRACT:
A method (100) of crawling the Web (620) is disclosed. The method (100) crawls (120) Web pages on the Web starting from a given (110) set of seed Universal Resource Locators (URLs). Crawled Web pages are partitioned (140) into sets of relevant and irrelevant pages. A set of exclusion and/or inclusion patterns are discovered (150) from the sets of relevant and irrelevant pages, and subsequent crawling of the Web is restricted through the set of exclusion and/or inclusion patterns.
REFERENCES:
patent: 6418433 (2002-07-01), Chakrabarti et al.
patent: 6516312 (2003-02-01), Kraft et al.
patent: 6611835 (2003-08-01), Huang et al.
patent: 7080073 (2006-07-01), Jiang et al.
patent: 7231405 (2007-06-01), Xia
patent: 2003/0149694 (2003-08-01), Ma et al.
patent: 2004/0030683 (2004-02-01), Evans et al.
patent: 2005/0086206 (2005-04-01), Balasubramanian et al.
patent: 2005/0138056 (2005-06-01), Stefix et al.
Soumen Chakrabarti, Martin Van Den Berg; Byron Dom; http://www.cs.berkeley.edu/˜soumen/doc/www1999f/pdf/prelim.pdf ; 1999; Computer Networks; Amsterdam, Netherlands.
Agrawal Neeraj
Balakrishnan Sreeram Viswanath
Joshi Sachindra
Filipczyk Marc R
Wong Don
LandOfFree
System and a method for focused re-crawling of Web sites does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with System and a method for focused re-crawling of Web sites, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and a method for focused re-crawling of Web sites will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3984266