Unsupervised, automated web host dynamicity detection, dead...

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000, C707S793000, C715S205000, C725S037000

Reexamination Certificate

active

07610267

ABSTRACT:
Automated crawling of page links associated with a site domain that was previously crawled involves computing the dynamicity of a site based on totals of continuous dead links, live links and/or prerequisite pages encountered while crawling page links corresponding to the site. The degree to which links are crawled is optimized based on the dynamicity of the site. Some pages require that another particular page (i.e., a prerequisite page) is retrieved from the host prior to retrieving a given page, e.g., so that the prerequisite page can set a cookie. Prerequisite pages are determined based on stored information about pages that were retrieved, during a previous crawl, prior to retrieving a page. Prerequisite pages are identified to a search system so that when a user clicks on the URL for the page, the request is redirected to the prerequisite page to set the cookie appropriately.

REFERENCES:
patent: 6064952 (2000-05-01), Imanaka et al.
patent: 6219818 (2001-04-01), Freivald et al.
patent: 6738344 (2004-05-01), Bunton et al.
patent: 6871213 (2005-03-01), Graham et al.
patent: 7155489 (2006-12-01), Heilbron et al.
patent: 7464326 (2008-12-01), Kawai et al.
patent: 2002/0156779 (2002-10-01), Elliott et al.
patent: 2004/0070606 (2004-04-01), Yang et al.
patent: 2004/0083424 (2004-04-01), Kawai et al.
patent: 2005/0114319 (2005-05-01), Brent et al.
patent: 2005/0120060 (2005-06-01), Meng
patent: 2005/0192936 (2005-09-01), Meek et al.
patent: 2005/0262063 (2005-11-01), Conboy et al.
patent: 2005/0289446 (2005-12-01), Moncsko et al.
patent: 2006/0112089 (2006-05-01), Broder et al.
patent: 2006/0294052 (2006-12-01), Kulkami et al.
patent: 2008/0097958 (2008-04-01), Ntoulas et al.
patent: WO 0152078 (2001-07-01), None
J.L. Wolf et al, “Optimal crawling strategies for web search engines”, In proceedings of the 11th International World Wide Web Conference, pp. 136-147, 2002.
Dennis Fetterly et al, “A Large-Scale Study of the Evolution of Web Pages”, ACM, Budapest, Hungary, 2003, p. 669-678.
Brian E. Brewington et al, “How dynamic is the web?”, Thayer School of Engineering, Dartmouth College, 2000, p. 1-20.
Ziv Bar-Yossef et al, “Sic Transit Gloria Telae: Towards an Understanding of the Web's Decay”, ACM, 2004, p. 328-338.
Fred Douglis et al, “Rate of Change and other Metrics: a Live Study of the World Wide Web”, AT & T Labs Research, 1997.
Hedley, Y.L.; Younas, M.; James, A.; Sanderson, M.; Query-related data extraction of hidden web documents; Jul. 25-29, 2004; http://doi.acm.org/10.1145/1008992.1009119, p. 558-559.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Unsupervised, automated web host dynamicity detection, dead... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Unsupervised, automated web host dynamicity detection, dead..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Unsupervised, automated web host dynamicity detection, dead... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4084338

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.