System and method for crawl ordering by search impact

Data processing: database and file management or data structures – Database and file access – Search engines

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S711000, C707S748000, C707S751000

Reexamination Certificate

active

07899807

ABSTRACT:
An improved system and method for crawl ordering of a web crawler by impact upon search results of a search engine is provided. Content-independent features of uncrawled web pages may be obtained, and the impact of uncrawled web pages may be estimated for queries of a workload using the content-independent features. The impact of uncrawled web pages may be estimated for queries by computing an expected impact score for uncrawled web pages that match needy queries. Query sketches may be created for a subset of the queries by computing an expected impact score for crawled web pages and uncrawled web pages matching the queries. Web pages may then be selected to fetch using a combined query-based estimate and query-independent estimate of the impact of fetching the web pages on search query results.

REFERENCES:
patent: 6418433 (2002-07-01), Chakrabarti et al.
patent: 6961723 (2005-11-01), Faybishenko et al.
patent: 7269587 (2007-09-01), Page
patent: 7475069 (2009-01-01), Blackman et al.
patent: 7536389 (2009-05-01), Prabhakar et al.
patent: 2002/0078136 (2002-06-01), Brodsky et al.
patent: 2002/0111934 (2002-08-01), Narayan
patent: 2002/0194161 (2002-12-01), McNamee et al.
patent: 2005/0060297 (2005-03-01), Najork
patent: 2005/0168460 (2005-08-01), Razdan et al.
patent: 2006/0218138 (2006-09-01), Weare
patent: 2006/0294052 (2006-12-01), Kulkami et al.
patent: 2007/0022085 (2007-01-01), Kulkarni
patent: 2007/0038608 (2007-02-01), Chen
patent: 2007/0112761 (2007-05-01), Xu et al.
patent: 2007/0198345 (2007-08-01), Park
patent: 2007/0239701 (2007-10-01), Blackman et al.
patent: 2008/0147644 (2008-06-01), Aridor et al.
patent: 2008/0243812 (2008-10-01), Chien et al.
patent: 2008/0313247 (2008-12-01), Galvin
patent: 2009/0006365 (2009-01-01), Liu et al.
patent: 2009/0019354 (2009-01-01), Jaiswal et al.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

System and method for crawl ordering by search impact does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with System and method for crawl ordering by search impact, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for crawl ordering by search impact will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2726239

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.