Computer method and apparatus for collecting people and...

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Reexamination Certificate

active

06983282

ABSTRACT:
Computer processing method and apparatus for searching and retrieving Web pages to collect people and organization information are disclosed. A Web site of potential interest is accessed. A subset of Web pages from the accessed site are determined for processing. According to types of contents found on a subject Web page, extraction of people and organization information is enabled. Internal links of a Web site are collected and recorded in a links-to-visit table. To avoid duplicate processing of Web sites, unique identifiers or Web site signatures are utilized. Respective time thresholds (time-outs) for processing a Web site and for processing a Web page are employed. A database is maintained for storing indications of domain URLs, names of respective owners of the URLs as identified from the corresponding Web sites, type of each Web site, processing frequencies, dates of last processings, outcomes of last processings, size of each domain and number of data items found in the last processing of each Web site.

REFERENCES:
patent: 5319777 (1994-06-01), Perez
patent: 5764906 (1998-06-01), Edelstein et al.
patent: 5813006 (1998-09-01), Polnerow et al.
patent: 5835905 (1998-11-01), Pirolli et al.
patent: 5895470 (1999-04-01), Pirolli et al.
patent: 5918236 (1999-06-01), Wical
patent: 5923850 (1999-07-01), Barroux
patent: 5924090 (1999-07-01), Krellenstein
patent: 5974455 (1999-10-01), Monier
patent: 6065016 (2000-05-01), Stuntebeck et al.
patent: 6094653 (2000-07-01), Li et al.
patent: 6112203 (2000-08-01), Bharat et al.
patent: 6122647 (2000-09-01), Horowitz et al.
patent: 6128613 (2000-10-01), Wong et al.
patent: 6212552 (2001-04-01), Biliris et al.
patent: 6253198 (2001-06-01), Perkins
patent: 6260033 (2001-07-01), Tatsuoka
patent: 6266664 (2001-07-01), Russell-Falla et al.
patent: 6269369 (2001-07-01), Robertson
patent: 6301614 (2001-10-01), Najork et al.
patent: 6336108 (2002-01-01), Thiesson et al.
patent: 6336139 (2002-01-01), Feridun et al.
patent: 6349309 (2002-02-01), Aggarwal et al.
patent: 6377936 (2002-04-01), Henrick et al.
patent: 6389436 (2002-05-01), Chakrabarti et al.
patent: 6418432 (2002-07-01), Cohen et al.
patent: 6463430 (2002-10-01), Brady et al.
patent: 6466940 (2002-10-01), Mills
patent: 6493703 (2002-12-01), Knight et al.
patent: 6529891 (2003-03-01), Heckerman
patent: 6553364 (2003-04-01), Wu
patent: 6556964 (2003-04-01), Haug et al.
patent: 6618717 (2003-09-01), Karadimitriou et al.
patent: 6640224 (2003-10-01), Chakrabarti
patent: 6654768 (2003-11-01), Celik
patent: 6668256 (2003-12-01), Lynch
patent: 6675162 (2004-01-01), Russell-Falla et al.
patent: A-53031/98 (1998-08-01), None
patent: 10-320315 (1998-12-01), None
patent: WO 99/67728 (1999-12-01), None
patent: WO 00/33216 (2000-06-01), None
Lorrie Faith Cranor and Brian A. LaMacchia, “Spam!” Communications of the ACM, Aug. 1998. vol. 4, No. 8, pp. 74-83.
PCT International Search Report PCT/US01/22425.
A.K. Jain et al. “Data Clustering: A Review.” ACM Computing Surveys, vol. 31, No. 3, Sep. 1999, pp. 264-323.
Hall, Robert J. “How to Avoid Unwanted Email.” Communications of the ACM, Mar. 1998. vol. 41, No. 3, pp. 88-95.
International Search Report PCT/US01/23343, Mar. 19, 2003, 4 pp.
Guan, T. and K-F Wong, “KPS: a Web information mining algorithm,”Computer Networks 31:11-16(1495-1507) May 17, 1999, Elsevier Science Publishers B.V., Amsterdam.
Miller, R.C. and K. Bharat, “SPHINX: a framework for creating personal, site specific Web crawlers,”Computer Networks and ISDN Systems, 30:1-7(119-130) Apr. 4, 1998, North Holland Publishing, Amsterdam.
Powell, T.A. et al.,HTML Programmer's Reference,(Appendices A and B), Osborne/McGraw-Hill, 1998 (pp. 355-377).
PCT International Search Report PCT/US01/41515, Feb. 28, 2003, 4 pp.
Langer, A. and J.S. Rosenschein, “Using Distributed Problem Solving to Search the Web,”Proc. 4th Int. Conf. on Autonomous Agents, ACM, USA, Jun. 3-7, 2000, pp. 197-198.
PCT International Search Report PCT/US01/22430, Jan. 17, 2003, 4 pp.
PCT International Search Report PCT/US01/22381, Feb. 12, 2003, 3 pp.
PCT International Search Report PCT/US01/24162, Feb. 13, 2003, 4 pp.
Ball, T. and F. Douglis, “An Internet Difference Engine and its Applications, ”Proceedsings of COMPCON '96, IEEE Comp. Soc. Press, Feb. 25, 1996, p. 71-76.
Freitag, D., “Machine Learning for Information Extraction in Informal Domains,”Machine Learning 39:2/3(169-202), May/Jun. 2000, p. 169-202.
Kjell, B., “Authorship Attribution of Text Samples Using Neural Networks and Bayesian Classifiers,”IEEE Int. Conf. on Systems, Man, and Cybernetics, vol. 2, Oct. 5, 1994, pp. 1660-1664.
Singhal, M., “Update Transport: A New Technique for Update Synchronization in Replicated Database Systems, ”IEEE Transactions on Software Engineering 16:12(1325-1336), Dec. 1, 1990.
ABCNEWS.com, Apr. 28, 1999. http://web.archive.org/web/19990428185649/abcnews.go.com/.
COMPAQ, Apr. 22, 1999. http://web/archive.org/web/19990422222242/www.compaq.com/.
Dwi H. Widyantoro, Thomas R. Ioerger, John Yen. “An Adaptive Algorithm for Learning Changes in User Interests”. Nov. 1999. ACM. p. 405-412.
Soumen Chakrabarti, Byron Dom, Piotr Indyk. “Enhanced hypertext categorization using hyperlinks”. 1998 ACM. pp. 307-318.
Sahami, M. et al., “SONIA: A Service for Organizing Networked Information Autonomously,”3rd ACM Conference on Digital Libraries, Digital 98 Libraries, Jun. 23-26, 1998, pp. 200-209.
Nir Friedman, Moises Goldszmidt, “Building Classifiers using Bayesian Networks”. From Proceedings of the National Conference on Artificial Intelligence (AAAI96). pp. 1277-1284.
Pazzani, M. et al., “Learning from hotlists and coldlists: Towards a WWW information filtering and seeking agent,”Proc. International Conference on Tools with Artificial Intelligence, Los Alamitos, CA, 1994, pp. 492-495.
Lam, W. and K. Low, “Automatic Document Classification Based on Probabilistic Reasoning: Model and Performance Analysis,”1996 IEEE Conference on Computational Cybernetics and Simulation, Orlando, FL 1997, pp. 2719-2723.
PCT International Search Report PCT/US01/22385, Dec. 18, 2002 (4 pp).
Chakrabarti, S. et al., “Focused crawling: a new approach to topic-specific Web resource discovery, ”Proceedings of 8th International World Wide Web Conference, 1999 (pp. 545-562).
Cho, J. et al., “Efficient Crawling through URL Ordering,”Proceedings of Seventh International Web Conference, 1998 (20 pp.).
Rennie, J. and A. McCallum, “Using reinforcement learning to spider the Web efficiently,”Proceedings of ICML-99, 1999 (16 pp.).
McCallam, A. et al., “A Machine Learning Approach to Building Domain-Specific Search Engines,”Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence, 1999 (6 pp.).
McCallam, A. et al., “Building Domain-Specific Search Engines with Machine Learning Techniques,”Proceedings of AAAI-99 Spring Symposium on Intelligent Agents in Cyberspace, 1999 (12 pp.).

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Computer method and apparatus for collecting people and... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Computer method and apparatus for collecting people and..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Computer method and apparatus for collecting people and... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3545854

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.