Data processing: database and file management or data structures – Database design – Data structure types
Reexamination Certificate
2003-10-15
2009-06-23
Vo, Tim T. (Department: 2168)
Data processing: database and file management or data structures
Database design
Data structure types
C707S793000, C707S793000, C707S793000
Reexamination Certificate
active
07552109
ABSTRACT:
A collaborative focused crawler crawls documents on a network locating documents that match multiple focus topics. The collaborative crawler comprises a fetcher and a focus engine. The fetcher prioritizes which documents to crawl based on a set of rules, obtains documents from the network, and outputs crawled documents to the focus engine. The focus engine determines whether a fetched document is relevant to any of the multiple focus topics. The focus engine determines whether fetched documents are disallowed. If a fetched document is disallowed, the present system may place the URL for that web document in a blacklist, a list of URLs that may not be crawled. URLs may be disallowed if they match a disallowed topic or if they fail a set of rules designed for a web space focus, for example, domain rules, IP address rules, and prefix rules.
REFERENCES:
patent: 6199081 (2001-03-01), Meyerzon et al.
patent: 6295559 (2001-09-01), Emens et al.
patent: 6418433 (2002-07-01), Chakrabarti et al.
patent: 6691108 (2004-02-01), Li
patent: 6754873 (2004-06-01), Law et al.
patent: 6993534 (2006-01-01), Denesuk et al.
patent: 7080073 (2006-07-01), Jiang et al.
patent: 7085753 (2006-08-01), Weiss et al.
patent: 2001/0044818 (2001-11-01), Liang
patent: 2002/0032869 (2002-03-01), Lamberton et al.
patent: 2002/0194161 (2002-12-01), McNamee et al.
patent: 2004/0049514 (2004-03-01), Burkov
patent: 2006/0277175 (2006-12-01), Jiang et al.
Article entitled “Mercator: A Scalable, Extensible Web Crawler”, dated Jun. 26, 1999, by Heydon et al.
R. Ghani, et al., “Relevance feedback rather than actually web mining,” available at http://www.cs.nyu.edu/courses/fall02/g22.3033-008/lec10.html, on Aug. 28, 2003.
S. Chakrabarti, “Focussed Crawling,” available at http://www.cs.Berkeley.edu/˜soumen/focus/ on Aug. 28, 2003.
P. Perry, “Personal Search Crawlers,” available at http://www.paulperry.net
otes/search.asp on Aug. 28, 2003.
C. Aggarwal, et al., “Intelligent Crawling on the World Wide Web With Aebitrary Predicares,” available at http://www.10.org/cdrom/papers/110/ on Aug. 28, 2003.
“Website Promotion Scientific,” available at http://www.ranks.nl/resources/scientific.html on Aug. 28, 2003.
CSIRO—HAIL seminars—Abstract available at http://www.cmis.csiro.au/conferences-seminars/hail/abstracts/2000-past/DavidHawking.htm on Aug. 28, 2003.
“Smider the Smart spIDER,” available at http://frank.spieleck.de/metasuch/ on Aug. 28, 2003.
“The Homepage of Christopher James,” available at http://www.csee.umbc.edu/˜cjames2/Research.htm on Aug. 28, 2003.
Balasubramanian Srinivasan
Chavet Laurent
Qi Runping
Cantor & Colburn LLP
Dwivedi Mahesh H
International Business Machines - Corporation
Lambert Brian
Vo Tim T.
LandOfFree
System, method, and service for collaborative focused... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with System, method, and service for collaborative focused..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System, method, and service for collaborative focused... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4135155