Data processing: database and file management or data structures – Database and file access – Search engines
Reexamination Certificate
2011-06-14
2011-06-14
Timblin, Robert (Department: 2167)
Data processing: database and file management or data structures
Database and file access
Search engines
C707S710000
Reexamination Certificate
active
07962471
ABSTRACT:
Demographic information of an Internet user is predicted based on an analysis of accessed web pages. Web pages accessed by the Internet user are detected and mapped to a user path vector which is converted to a normalized weighted user path vector. A centroid vector identifies web page access patterns of users with a shared user profile attribute. The user profile attribute is assigned to the Internet user based on a comparison of the vectors. Bias values are also assigned to a set of web pages and a user profile attribute can be predicted for an Internet user based on the bias values of web pages accessed by the user. User attributes can also be predicted based on the results of an expectation maximization process. Demographic information can be predicted based on the combined results of a vector comparison, bias determination, or expectation maximization process.
REFERENCES:
patent: 5565684 (1996-10-01), Gulberg et al.
patent: 5724567 (1998-03-01), Rose et al.
patent: 5774586 (1998-06-01), LeCun
patent: 5991735 (1999-11-01), Gerace
patent: 6009410 (1999-12-01), LeMole et al.
patent: 6029195 (2000-02-01), Herz
patent: 6134532 (2000-10-01), Lazarus et al.
patent: 6182122 (2001-01-01), Berstis
patent: 6260038 (2001-07-01), Martin et al.
patent: 6304864 (2001-10-01), Liddy et al.
patent: 6408288 (2002-06-01), Ariyoshi
patent: 6446035 (2002-09-01), Grefenstette et al.
patent: 6529891 (2003-03-01), Heckerman
patent: 6574378 (2003-06-01), Lim
patent: 6681247 (2004-01-01), Payton
patent: 6687696 (2004-02-01), Hofmann et al.
patent: 6738678 (2004-05-01), Bharat et al.
patent: 6742003 (2004-05-01), Heckerman et al.
patent: 6757691 (2004-06-01), Welsh et al.
patent: 6757740 (2004-06-01), Parekh et al.
patent: 6839680 (2005-01-01), Liu et al.
patent: 7072795 (2006-07-01), Haft et al.
patent: 2003/0018636 (2003-01-01), Chi et al.
patent: 2006/0080321 (2006-04-01), Horn et al.
“The Binomial Distribution,” http://www.stat.yale.edu/Courses/1997-98/binom.htm, May 18, 2001.
“Web Usage Analysis and User Profiling,” http://www.acm.org/sigkdd/proceedings/webkdd99/, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
“Web Usage Analysis and User Profiling,” http://www.acm.org/sigkdd/proceedings/webkdd99/toconline.htm, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Bilmes, J.A., “A Gentle Tutorial of the EM Algorithm and its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models,” International Computer Science Institute, TR-97-021, Apr. 1998.
Borges, J., et al., “Data Mining of User Navigation Patterns,” http://www.acm.org/sigkdd/proceedings/webkdd99/papers/paper7-borges.ps, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Büchner, A.G., et al., “Navigation Pattern Discovery from Internet Data,” http://www.acm.org/sigkdd/proceedings/webkdd99/papers/paper13-buchner.ps, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Chan, P.K., “A non-invasive learning approach to building Web user profiles,” http://www.acm.org/sigkdd/proceedings/webkdd99/papers/paper22-chan.ps, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Cooley, R., et al., “WebSIFT: The Web Site Information Filter System,” http://www.acm.org/sigkdd/proceedings/webkdd99/papers/paper11-cooley.ps, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Dempster, et al., “Maximum Likelihood from Incomplete Data via theEMAlgorithm,” J. Royal Statist. Soc. B39, pp. 1-38, 1977 (Read before the Royal Statistical Society, Dec. 8, 1976).
Etzioni, O., “The World-Wide Web: Quagmire or gold mine?” CACM, 39(11):65-68, Nov. 1996.
Fu, Y., et al., “Clustering of Web Users Based on Access Patterns,” http://www.acm.org/sigkdd/proceedings/webkdd99/papers/paper10-yfu.ps, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Getoor, L., et al., “Using Probabilistic Relational Models for Collaborative Filtering,” http://www.acm.org/sigkdd/proceedings/webkdd99/papers/paper21-getoor.ps, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Gomory, S., et al., “Analysis and Visualization of Metrics for Online Merchandising,” http://www.acm.org/sigkdd/proceedings/webkdd99/papers/paper8-jylee.ps, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Gordon, A.D., “Classification: 2ndEdition,” Chapman & Hall, 1999.
Green, H., “The Information Gold Mine,” Business Week, Jul. 30, 1999.
Hofmann, T., “Probabilistic Latent Semantic Indexing,” Proc. SIGIR '99, pp. 50-57, Aug. 1999.
James, M., “Classification Algorithms,” John Wiley & Sons, Inc., 1985.
Lan, B., et al., “Making Web Servers Pushier,” http://www.acm.org/sigkdd/proceedings/webkdd99/papers/paper12-blan.ps, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Leon-Garcia, A., “Probability and Random Processes for Electrical Engineering,” 2ndEd., Addison-Wesley Publishing Company, Reading, MA, pp. 53-54, 61-64, 126-136, Sep. 1993.
Martin, D. C., “Abstract for the Invited Talk: The IBM SurfAid Project: Transactive Analysis and Prediction,” http://acm.org/sigkdd/proceedings/webkdd99/invited.htm, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Masand, B., et al., “Foreword,” http://acm.org/sigkdd/proceedings/webkdd99/forewordonline.htm, Web Usage Analysis and User Profiling. KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Meng, et al., “Maximum likelihood estimation via the ECM algorithm: A general framework,” Biometrika, 80, 2, pp. 267-278, 1993.
Murray, D., et al., “Inferring Demographic Attributes of Anonymous Internet Users,” http://www.acm.org/sigkdd/proceedings/webkdd99/papers/paper2-murray.ps, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Russell, S., “The EM Algorithm,” Machine Learning, CS 281, Spring 1998.
Spiliopoulou, M., et al., “Improving the Effectiveness of a Web Site with Web Usage Mining,” http://www.acm.org/sigkdd/proceedings/webkdd99/papers/paper18-myra.ps, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Toutanova, et al., “Text Classification in a Hierarchical Mixture Model for Small Training Sets,” http://www.stanford.edu/˜krist/papers/cikm2001.pdf, see entire document and Sections 2.2 and 3.2.2 in particular, Aug. 29, 2001.
Vasconcelos, et al., “A Bayesian framework for semantic content characterization,” In Proc. Of IEEE Conf. On Computer Vision and Pattern Recognition, CVPR '98, pp. 566-571, Santa Barbara, Jun. 1998.
Weisstein, E. W., “Bayes' Formula,” http://br.crashed.net/˜akrowne/crc/math/b/b076.htm, Sep. 1996, May 26, 1999.
Adamic Lada A.
Adar Eytan
Chen Francine R.
Palo Alto Research Center Incorporated
Park Vaughan Fleming & Dowler LLP
Reyes Mariela
Timblin Robert
Yao Shun
LandOfFree
User profile classification by web usage analysis does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with User profile classification by web usage analysis, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and User profile classification by web usage analysis will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2728273