User profile classification by web usage analysis

Data processing: database and file management or data structures – Database and file access – Post processing of search results

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C705S014660

Reexamination Certificate

active

08005833

ABSTRACT:
Demographic information of an Internet user is predicted based on an analysis of accessed web pages. Web pages accessed by the Internet user are detected and mapped to a user path vector which is converted to a normalized weighted user path vector. A centroid vector identifies web page access patterns of users with a shared user profile attribute. The user profile attribute is assigned to the Internet user based on a comparison of the vectors. Bias values are also assigned to a set of web pages and a user profile attribute can be predicted for an Internet user based on the bias values of web pages accessed by the user. User attributes can also be predicted based on the results of an expectation maximization process. Demographic information can be predicted based on the combined results of a vector comparison, bias determination, or expectation maximization process.

REFERENCES:
patent: 5565684 (1996-10-01), Gullberg et al.
patent: 5774586 (1998-06-01), LeCun
patent: 5991735 (1999-11-01), Gerace
patent: 6018738 (2000-01-01), Breese et al.
patent: 6029195 (2000-02-01), Herz
patent: 6134532 (2000-10-01), Lazarus et al.
patent: 6260038 (2001-07-01), Martin et al.
patent: 6408288 (2002-06-01), Ariyoshi
patent: 6446035 (2002-09-01), Grefenstette et al.
patent: 6529891 (2003-03-01), Heckerman
patent: 6574378 (2003-06-01), Lim
patent: 6633852 (2003-10-01), Heckerman et al.
patent: 6681247 (2004-01-01), Payton
patent: 6687696 (2004-02-01), Hofmann et al.
patent: 6742003 (2004-05-01), Heckerman et al.
patent: 6757691 (2004-06-01), Welsh et al.
patent: 6757740 (2004-06-01), Parekh et al.
patent: 6839680 (2005-01-01), Liu et al.
patent: 7072795 (2006-07-01), Haft et al.
patent: 7072888 (2006-07-01), Perkins
patent: 2002/0029162 (2002-03-01), Mascarenhas
patent: 2002/0091820 (2002-07-01), Hirai
patent: 2003/0018636 (2003-01-01), Chi et al.
Srivastava, J. et al., “Web Usage Mining: Discovery and Application of Usage Patterns from Web Data”, SIGKDD Exploration, ACM, Volumn 1, Issue 2, pp. 12-23, Jan. 2000.
Mobasher, B. et al., “Improving the Effectiveness of Collaborative Filtering on Anonymous Web Usage Data”, In Proceedings of the IJCAI 2001 Workshop on Intelligent Techniques for Web Personalization (ITWP01), Aug. 2001, Seattle.
Shahabi, C., et al., “Knowledge Discovery from Users Web-Page Navigation”, 7th International workshop on Research Issues in Data Engineering, pp. 20-29, Apr. 7-8, 1997.
Glance, N., et al. “Making Recommendaer Systems Work for Organization”, In Proceedings of PAAM'99 London, England, Apr. 1999.
“The Binomial Distribution,” http://www.stat.yale.edu/Courses/1997-98/binom.htm, May 18, 2001.
“Web Usage Analysis and User Profiling,” http://www.acm.org/sigkdd/proceedings/webkdd99/, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
“Web Usage Analysis and User Profiling,” http://www.acm.org/sigkdd/proceedings/webkdd99/toconline.htm, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Bilmes, J.A., “A Gentle Tutorial of the EM Algorithm and its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models,” International Computer Science Institute, TR-97-021, Apr. 1998.
Borges, J., et al., “Data Mining of User Navigation Patterns,” http://www.acm.org/sigkdd/proceedings/webkdd99/papers/paper7-borges.ps, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Büchner, A.G., et al., “Navigation Pattern Discovery from Internet Data,” http://www.acm.org/sigkdd/proceedings/webkdd99/papers/paper13-buchner.ps, KDD-99 Workshop Program, San Digeo, CA, Aug. 15, 1999.
Chan, P.K., “A non-invasive learning approach to building Web user profiles,” http://www.acm.org/sigkdd/proceedings/webkdd99/papers/paper22-chan.ps, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Cooley, R., et al., “WebSIFT: The Web Site Information Filter System,” http://www.acm.org/sigkdd/proceedings/webkdd99/papers/paper11-cooley.ps, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Dempster, et al.., “Maximum Likelihood from Incomplete Data via theEMAlgorithm,” J. Royal Statist. Soc. B39, pp. 1-38, 1977 (Read before the Royal Statistical Society, Dec. 8, 1976).
Etzioni, O., “The World-Wide Web: Quagmire or gold mine?” CACM, 39(11):65-68, Nov. 1996.
Fu, Y., et al., “Clustering of Web Users Based on Access Patterns,” http://www.acm.org/sigkdd/proceedings/webkdd99/papers/paper10-yfu.ps, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Getoor, L., et al., “Using Probabilistic Relational Models for Collaborative Filtering,” http://www.acm.org/sigkdd/proceedings/webkdd99/papers/paper21-getoor.ps, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Gomory, S., et al., “Analysis and Visualization of Metrics for Online Merchandising,” http://www.acm.org/sigkdd/proceedings/webkdd99/papers/paper8-jylee.ps, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Gordon, A.D., “Classification: 2ndEdition,” Chapman & Hall, 1999.
Green, H., “The information Gold Mine,” Business Week, Jul. 30, 1999.
Hofmann, T., “Probabilistic Latent Semantic Indexing,” Proc. SIGIR '99, pp. 50-57, Aug. 1999.
James, M., “Classification Algorithms,” John Wiley & Sons, Inc., 1985.
Lan, B., et al., “Making Web Servers Pushier,” http://www.acm.org/sigkdd/proceedings/webkdd99/papers/paper12-blan.ps, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Leon-Garcia, A., “Probability and Random Processes for Electrical Engineering,” 2ndEd., Addison-Wesley Publishing Company, Reading, MA, pp. 53-54, 61-64, 126-136, Sep. 1993.
Martin, D. C., “Abstract for the Invited Talk: The IBM SurfAid Project: Transactive Analysis and Prediction,” http://www.acm.org/sigkdd/proceedings/webkdd99/invited.htm, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Masand, B., et al., “Foreword,” http://www.acm.org/sigkdd/proceedings/webkdd99/forewordonline.htm, Web Usage Analysis and User Profiling. KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Meng, et al., “Maximum likelihood estimation via the ECM algorithm: A general framework,” Biometrika, 80, 2, pp. 267-278, 1993.
Murray, D., et al., “Inferring Demographic Attributes of Anonymous Internet Users,” http://www.acm.org/sigkdd/proceedings/webkdd99/papers/paper2-murray.ps, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Russell, S., “The EM Algorithm,” Machine Learning, CS 281, Spring 1998.
Spiliopoulou, M., et al., “Improving the Effectiveness of a Web Site with Web Usage Mining,” http://www.acm.org/sigkdd/proceedings/webkdd99/papers/paper18-myra.ps, KDD-99 Workshop Program, San Diego, CA, Aug. 15, 1999.
Toutanova, et al., “Text Classification in a Hierarchical Mixture Model for Small Training Sets,” http://www.stanford.edu/˜krist/papers/cikm2001.pdf, see entire document and Sections 2.2 and 3.2.2 in particular, Aug. 29, 2001.
Vasconcelos, et al., “A Bayesian framework for semantic content characterization,” In Proc. Of IEEE Conf. On Computer Vision and Pattern Recognition, CVPR '98, pp. 566-571, Santa Barbara, Jun. 1998.
Weisstein, E. W., “Bayes' Formula,” http://br.crashed.net/˜akrowne/crc/math/b/b076.htm, Sep. 1996, May 26, 1999.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

User profile classification by web usage analysis does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with User profile classification by web usage analysis, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and User profile classification by web usage analysis will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2787979

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.