Data processing: speech signal processing – linguistics – language – Linguistics – Natural language
Reexamination Certificate
2006-03-30
2010-11-30
Hudspeth, David R (Department: 2626)
Data processing: speech signal processing, linguistics, language
Linguistics
Natural language
C704S001000, C704S010000, C707S706000, C707S707000, C707S708000
Reexamination Certificate
active
07844449
ABSTRACT:
A scalable two-pass scalable probabilistic latent semantic analysis (PLSA) methodology is disclosed that may perform more efficiently, and in some cases more accurately, than traditional PLSA, especially where large and/or sparse data sets are provided for analysis. The improved methodology can greatly reduce the storage and/or computational costs of training a PLSA model. In the first pass of the two-pass methodology, objects are clustered into groups, and PLSA is performed on the groups instead of the original individual objects. In the second pass, the conditional probability of a latent class, given an object, is obtained. This may be done by extending the training results of the first pass. During the second pass, the most likely latent classes for each object are identified.
REFERENCES:
patent: 5619709 (1997-04-01), Caid et al.
patent: 5794178 (1998-08-01), Caid et al.
patent: 5839106 (1998-11-01), Bellegarda
patent: 6374217 (2002-04-01), Bellegarda
patent: 6687696 (2004-02-01), Hofmann et al.
patent: 6728695 (2004-04-01), Pathria et al.
patent: 6954750 (2005-10-01), Bradford
patent: 7031970 (2006-04-01), Blitzer
patent: 7117437 (2006-10-01), Chen et al.
patent: 7328216 (2008-02-01), Hofmann et al.
patent: 7376893 (2008-05-01), Chen et al.
patent: 7451395 (2008-11-01), Brants et al.
patent: 7529765 (2009-05-01), Brants et al.
patent: 2006/0242140 (2006-10-01), Wnek
patent: 2006/0242190 (2006-10-01), Wnek
patent: 2007/0067281 (2007-03-01), Matveeva et al.
Lin et al., Scalable Collaborative Filtering Using Cluster-based Smoothing, Proceedings of the 28thannual Internation ACM SIGIR conference on Research and development in information retrieval, 2005, pp. 114-121.
Landauer et al., An Introduction to Latent Semantic Analysis, 1998, Discourse Processes, 25 pp. 259-284.
Brants et al., Topic-Based Segmentation with Probabilistic Latent Semantic Analysis, 2002, ACM, pp. 211-218.
Sun et al, Supervised Latent Semantic Indexing for Document Categorization, IEEE (ICDM '04), 2004, pp. 535-538.
Xu et al., Using probabilistic latent semantic analysis for Web page grouping, IEEE, 2005, pp. 29-36.
Hofmann, Thomas, “Latent Semantic Models for Collaborative Filtering,” ACM Transactions on Information Systems, vol. 22, No. 1, Jan. 2004, pp. 89-115.
Hofmann, Thomas, “Probabilistic Latent Semantic Analysis,” Uncertainity in Artificial Intelligence, UAI '99, Stockholm, first date of publication unknown, 8 pages.
Chen Zheng
Han Jie
Lin Chenxi
Wang Jian
Xue Guirong
Hudspeth David R
Lee & Hayes PLLC
Microsoft Corporation
Spooner Lamont M
LandOfFree
Scalable probabilistic latent semantic analysis does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Scalable probabilistic latent semantic analysis, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Scalable probabilistic latent semantic analysis will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4183152