Data processing: database and file management or data structures – Database design – Data structure types
Reexamination Certificate
2004-11-12
2008-10-21
Wong, Leslie (Department: 2161)
Data processing: database and file management or data structures
Database design
Data structure types
C707S793000, C707S793000
Reexamination Certificate
active
07440947
ABSTRACT:
A system and method for identifying query-related keywords in documents found in a search using latent semantic analysis. The documents are represented as a document term matrixMcontaining one or more document term-weight vectors d, which may be term-frequency (tf) vectors or term-frequency inverse-document-frequency (tf-idf) vectors. This matrix is subjected to a truncated singular value decomposition. The resulting transform matrixUcan be used to project a query term-weight vector q into the reduced N-dimensional space, followed by its expansion back into the full vector space using the inverse ofU.To perform a search, the similarity of qexpandedis measured relative to each candidate document vector in this space. Exemplary similarity functions are dot product and cosine similarity. Keywords are selected with the highest values in qexpandedthat are also comprised in at least one document. Matching keywords from the query may be highlighted in the search results.
REFERENCES:
patent: 5694594 (1997-12-01), Chang
patent: 5703655 (1997-12-01), Corey et al.
patent: 6847966 (2005-01-01), Sommer et al.
patent: 2002/0107735 (2002-08-01), Henkin et al.
patent: 2003/0055810 (2003-03-01), Cragun et al.
patent: 2004/0117725 (2004-06-01), Chen et al.
patent: 2004/0122657 (2004-06-01), Brants et al.
patent: 2005/0022114 (2005-01-01), Shanahan et al.
patent: 2005/0210006 (2005-09-01), Robertson et al.
Mani, Inderjeet and Eric Bloedorn, “Summarizing Similarities and Differences Among Related Documents”, Information Retrieval, vol. 1, Nos. 1-2, pp. 35-67 (1999).
Deerwester, S., Dumais, S. T., Fumas, G. W., Landauer, T. K., Harshman, R., “Indexing by Latent Semantic Analysis.” Journal of the American Society for Information Science, 41-6, pp. 391-407 (1990).
Berry, M.W., S. T. Dumais and G. W. O'Brien, “Using Linear Algebra for Intelligent Information Retrieval”, Review 37:4, pp. 573-595 (1995).
Landauer, T. K. and S. T. Dumais, “A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction and Representation of Knowledge”, Psychological Review, 104, pp. 211-240 (1997).
Adcock John E.
Cooper Matthew
Girgensohn Andreas
Wilcox Lynn D.
Chau Dung K
Fliesler & Meyer LLP
Fuji 'Xerox Co., Ltd.
Wong Leslie
LandOfFree
System and method for identifying query-relevant keywords in... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with System and method for identifying query-relevant keywords in..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for identifying query-relevant keywords in... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4006409